Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Spécialisation "Foundations of Reinforcement Learning"

Obtenez l'une de nos meilleures offres avec Coursera Plus pour 199 $ (habituellement 399 $). Économisez maintenant.

Ce spécialisation n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.

Spécialisation "Foundations of Reinforcement Learning"

Master Reinforcement Learning.

Build foundations in classical RL, deep RL, and reward design.

Instructeur : Ashutosh Trivedi

Inclus avec

Série de 3 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Série de 3 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Explain the mathematical foundations of reinforcement learning.
Analyze and compare tabular, approximate, and deep reinforcement learning algorithms .
Explain how function approximation and neural networks extend reinforcement learning beyond finite tabular settings
Design, infer, and assess reward structures and specification-based objectives that align learned behavior with intended task goals.

Compétences que vous acquerrez

Catégorie : Model Optimization
Catégorie : Agentic systems
Catégorie : Artificial Intelligence and Machine Learning (AI/ML)
Catégorie : Artificial Intelligence
Catégorie : Reinforcement Learning
Catégorie : Decision Intelligence
Catégorie : Machine Learning Algorithms
Catégorie : Theoretical Computer Science
Catégorie : Machine Learning
Catégorie : Statistical Machine Learning
Catégorie : Model Evaluation
Catégorie : Computational Logic
Catégorie : Responsible AI
Catégorie : Markov Model
Catégorie : Applied Mathematics
Catégorie : Deep Learning
Catégorie : Machine Learning Methods
Catégorie : Algorithms
Catégorie : Artificial Neural Networks

Outils que vous découvrirez

Catégorie : AI Workflows

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Enseigné en Anglais

Récemment mis à jour !

juillet 2026

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Améliorez votre expertise en la matière

Acquérez des compétences recherchées auprès d’universités et d’experts du secteur
Maîtrisez un sujet ou un outil avec des projets pratiques
Développez une compréhension approfondie de concepts clés
Obtenez un certificat professionnel auprès de University of Colorado Boulder

Spécialisation - série de 3 cours

Reinforcement learning studies how agents learn to make better decisions through interaction with an environment. Agents act, observe consequences, receive feedback, and adapt future behavior. This specialization develops reinforcement learning as a framework for sequential decision-making under uncertainty, progressing from classical foundations to scalable deep learning methods and reward design.

The first course, Classical Reinforcement Learning, introduces finite-state decision problems, Markov chains, Markov decision processes, discounted rewards, Bellman equations, planning with known models, and learning from sampled experience. Learners study value iteration, policy iteration, Monte Carlo methods, temporal-difference learning, SARSA, and Q-learning.

The second course, Deep Reinforcement Learning, shows how reinforcement learning scales beyond tabular settings using neural-network-based function approximation. Learners study Deep Q-Networks, replay buffers, target networks, policy-gradient methods, actor–critic algorithms, and modern methods such as PPO, DDPG, and SAC, with attention to stability, diagnosis, evaluation, and reproducibility.

The third course, Reward Programming, addresses how to design, infer, monitor, and revise objectives so agents learn intended behavior. Learners study temporal logic, automata, reward machines, reward shaping, inverse reinforcement learning, preference feedback, safety, shielding, auditing, and stress testing.

Projet d'apprentissage appliqué

Learners complete conceptual quizzes throughout the specialization to check their understanding of reinforcement learning foundations, deep RL methods, and reward-design principles. These assessments emphasize interpreting algorithms, diagnosing learning behavior, and reasoning about how agents make decisions under uncertainty.

Mastering Classic Reinforcement Learning Algorithms

COURS 1, 14 heures

Ce que vous apprendrez

Formulate sequential decision-making problems as deterministic decision processes, Markov chains, and finite Markov decision processes.
Explain and apply core reinforcement-learning concepts, including discounting, value functions, policies, Bellman equations, and optimality.
Implement planning algorithms for finite Markov decision processes, including value iteration, policy iteration, and linear programming formulations.
Compare tabular reinforcement-learning algorithms, including bandits, Monte Carlo methods, temporal-difference learning, SARSA, and Q-learning.

Compétences que vous acquerrez

Catégorie : Reinforcement Learning

Catégorie : Markov Model

Catégorie : Model Optimization

Catégorie : Probability Distribution

Catégorie : Machine Learning

Catégorie : Algorithms

Catégorie : Machine Learning Algorithms

Catégorie : Statistical Machine Learning

Catégorie : Probability & Statistics

Catégorie : Artificial Intelligence and Machine Learning (AI/ML)

Catégorie : Sampling (Statistics)

Catégorie : Decision Intelligence

Catégorie : Applied Mathematics

Deep Reinforcement Learning: From Theory to Practice

COURS 2, 14 heures

Ce que vous apprendrez

Explain how neural-network-based function approximation extends reinforcement learning beyond finite tabular settings.
Implement and evaluate value-based deep reinforcement learning algorithms, including Deep Q-Networks and stabilizing techniques.
Derive and implement policy-gradient methods, including REINFORCE, baselines, and advantage-based updates.
Explain and analyze actor–critic methods that combine policy optimization with value estimation.

Compétences que vous acquerrez

Catégorie : Reinforcement Learning

Catégorie : Deep Learning

Catégorie : System Design and Implementation

Catégorie : Machine Learning

Catégorie : Artificial Intelligence

Catégorie : Applied Machine Learning

Catégorie : Machine Learning Algorithms

Catégorie : Model Training

Catégorie : Model Evaluation

Catégorie : Artificial Neural Networks

Catégorie : Algorithms

Catégorie : Model Optimization

Catégorie : Machine Learning Methods

Catégorie : Agentic systems

Reward Programming: Optimizing RL Efficiency and Safety

COURS 3, 13 heures

Ce que vous apprendrez

Identify limitations of standard scalar reward formulations, including reward hacking, specification gaming, and brittle proxies.
Express structured learning objectives using formal tools such as temporal logic, automata, and reward machines.
Construct and analyze reward mechanisms based on temporal logic, automata, product MDPs, reward machines, and reward shaping.
Model reward-programming problems under hidden state, memory, hierarchy, multiagent interaction, and continuous-time dynamics

Compétences que vous acquerrez

Catégorie : Reinforcement Learning

Catégorie : Safety and Security

Catégorie : Machine Learning

Catégorie : Theoretical Computer Science

Catégorie : Model Evaluation

Catégorie : Machine Learning Methods

Catégorie : Agentic systems

Catégorie : Computational Logic

Catégorie : Functional Specification

Catégorie : Verification And Validation

Catégorie : Continuous Monitoring

Catégorie : Responsible AI

Catégorie : AI Workflows

Catégorie : Markov Model

Catégorie : Model Optimization

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeur

Ashutosh Trivedi

University of Colorado Boulder

3 Cours60 apprenants

Offert par

University of Colorado Boulder

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Foire Aux Questions

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Plus de questions

Visitez le Centre d'Aide pour les Étudiants

Aide financière disponible,