Monte Carlo methods are ways of solving the reinforcement learning problem based on averaging sample returns
General policy iteration (GPI)
First visit mc method
Every visit mc method
gamma lowers over time
Every visit has bias