About this Course

8,082 recent views
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Intermediate Level

Introductory computer science and data structures. Familiarity with the Python. Familiarity with basic probability and optimization.

Approx. 46 hours to complete
English

What you will learn

  • Map between qualitative preferences and appropriate quantitative utilities.

  • Model non-associative and associative sequential decision problems with multi-armed bandit problems and Markov decision processes respectively

  • Implement dynamic programming algorithms to find optimal policies

  • Implement basic reinforcement learning algorithms using Monte Carlo and temporal difference methods

Skills you will gain

  • Deep Learning
  • Markov Decision Process
  • Machine Learning
  • Reinforcement Learning
  • Monte Carlo Method
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Intermediate Level

Introductory computer science and data structures. Familiarity with the Python. Familiarity with basic probability and optimization.

Approx. 46 hours to complete
English

Offered by

Placeholder

Columbia University

Syllabus - What you will learn from this course

Week1
Week 1
6 hours to complete

Decision Making and Utility Theory

6 hours to complete
6 videos (Total 40 min), 6 readings, 2 quizzes
Week2
Week 2
5 hours to complete

Bandit Problems

5 hours to complete
3 videos (Total 36 min), 1 reading, 2 quizzes
Week3
Week 3
5 hours to complete

Markov Decision Processes

5 hours to complete
6 videos (Total 36 min), 1 reading, 2 quizzes
Week4
Week 4
8 hours to complete

Dynamic Programming

8 hours to complete
6 videos (Total 42 min), 1 reading, 3 quizzes

Frequently Asked Questions

More questions? Visit the Learner Help Center.