CS 188 Midterm 1 Notes

Probability Review

Random Variables - P(X)

Policy Evaluation

MDP Equations

Q-Learning

Exploration vs Exploitation

Minimizing Error