In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning.
This course is part of the Reinforcement Learning Specialization
Offered By
About this Course
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Skills you will gain
- Artificial Intelligence (AI)
- Machine Learning
- Reinforcement Learning
- Function Approximation
- Intelligent Systems
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode
Syllabus - What you will learn from this course
Welcome to the Course!
Monte Carlo Methods for Prediction & Control
Temporal Difference Learning Methods for Prediction
Temporal Difference Learning Methods for Control
Planning, Learning & Acting
Reviews
- 5 stars81.99%
- 4 stars13.59%
- 3 stars2.85%
- 2 stars0.60%
- 1 star0.95%
TOP REVIEWS FROM SAMPLE-BASED LEARNING METHODS
Well done. Follows Reinforcement Learning (Sutton/Barto) closely and explains topics well. Graded notebooks are invaluable in understanding the material well.
Good balance of theory and programming assignments. I really like the weekly bonus videos with professors and developers. Recommend to everyone.
Pretty clear explanations! Nice starting point if you want to deep dive into RL. It gives clear picture over some confusing terms in RL.
Everything is great overall but It would be more better if DynaQ & DynaQ+ were explained more detail in the lecture instead of assignment.
About the Reinforcement Learning Specialization

Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.