This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.

Decision Making and Reinforcement Learning

Decision Making and Reinforcement Learning

Instructor: Tony Dear
Access provided by Mojatu Foundation
4,415 already enrolled
22 reviews
Recommended experience
What you'll learn
Map between qualitative preferences and appropriate quantitative utilities.
Model non-associative and associative sequential decision problems with multi-armed bandit problems and Markov decision processes respectively
Implement dynamic programming algorithms to find optimal policies
Implement basic reinforcement learning algorithms using Monte Carlo and temporal difference methods
Details to know

Add to your LinkedIn profile
8 assignments
See how employees at top companies are mastering in-demand skills

There are 8 modules in this course
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
63.63%
- 4 stars
22.72%
- 3 stars
0%
- 2 stars
9.09%
- 1 star
4.54%
Showing 3 of 22
Reviewed on Jan 20, 2024
Very good introductory and basic to Reinforcement Learning. But programming assignments need more careful compilation and more attention to detail!
Reviewed on Jul 9, 2023
Well-structured course that provides a great introduction to methodologies used in reinforcement learning. I am now eager to experiment more in my own time, to consolidate what I have learned.
Explore more from Computer Science

University of Alberta

New York University

Simplilearn

University of Alberta

