Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Spezialisierung „Foundations of Reinforcement Learning“

Sichern Sie sich eines unserer besten Angebote mit Coursera Plus für 199 $ (normalerweise 399 $). Jetzt sparen.

spezialisierung ist nicht verfügbar in Deutsch (Deutschland)

Wir übersetzen es in weitere Sprachen.

Spezialisierung „Foundations of Reinforcement Learning“

Master Reinforcement Learning.

Build foundations in classical RL, deep RL, and reward design.

Dozent: Ashutosh Trivedi

Bei enthalten

Mehr erfahren

Fragen Sie Coursera

3-teilige Kursreihe

Befassen Sie sich eingehend mit einem Thema

Stufe Mittel

Empfohlene Erfahrung

4 Wochen zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

3-teilige Kursreihe

Befassen Sie sich eingehend mit einem Thema

Stufe Mittel

Empfohlene Erfahrung

4 Wochen zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

Was Sie lernen werden

Explain the mathematical foundations of reinforcement learning.
Analyze and compare tabular, approximate, and deep reinforcement learning algorithms .
Explain how function approximation and neural networks extend reinforcement learning beyond finite tabular settings
Design, infer, and assess reward structures and specification-based objectives that align learned behavior with intended task goals.

Kompetenzen, die Sie erwerben

Kategorie: Artificial Neural Networks
Kategorie: Machine Learning
Kategorie: Theoretical Computer Science
Kategorie: Applied Mathematics
Kategorie: Machine Learning Methods
Kategorie: Markov Model
Kategorie: Deep Learning
Kategorie: Algorithms
Kategorie: Computational Logic
Kategorie: Artificial Intelligence and Machine Learning (AI/ML)
Kategorie: Machine Learning Algorithms
Kategorie: Model Evaluation
Kategorie: Artificial Intelligence
Kategorie: Responsible AI
Kategorie: Agentic systems
Kategorie: Reinforcement Learning
Kategorie: Decision Intelligence
Kategorie: Model Optimization
Kategorie: Statistical Machine Learning

Werkzeuge, die Sie lernen werden

Kategorie: AI Workflows

Wichtige Details

Zertifikat zur Vorlage

Zu Ihrem LinkedIn-Profil hinzufügen

Unterrichtet in Englisch

Kürzlich aktualisiert!

Juli 2026

Erfahren Sie, wie Mitarbeiter führender Unternehmen gefragte Kompetenzen erwerben.

Weitere Informationen zu Coursera für Unternehmen

Logos von Petrobras, TATA, Danone, Capgemini, P&G und L'Oreal

Erweitern Sie Ihre Fachkenntnisse.

Erlernen Sie gefragte Kompetenzen von Universitäten und Branchenexperten.
Erlernen Sie ein Thema oder ein Tool mit echten Projekten.
Entwickeln Sie ein fundiertes Verständnisse der Kernkonzepte.
Erwerben Sie ein Karrierezertifikat von University of Colorado Boulder.

Spezialisierung - 3 Kursreihen

Reinforcement learning studies how agents learn to make better decisions through interaction with an environment. Agents act, observe consequences, receive feedback, and adapt future behavior. This specialization develops reinforcement learning as a framework for sequential decision-making under uncertainty, progressing from classical foundations to scalable deep learning methods and reward design.

The first course, Classical Reinforcement Learning, introduces finite-state decision problems, Markov chains, Markov decision processes, discounted rewards, Bellman equations, planning with known models, and learning from sampled experience. Learners study value iteration, policy iteration, Monte Carlo methods, temporal-difference learning, SARSA, and Q-learning.

The second course, Deep Reinforcement Learning, shows how reinforcement learning scales beyond tabular settings using neural-network-based function approximation. Learners study Deep Q-Networks, replay buffers, target networks, policy-gradient methods, actor–critic algorithms, and modern methods such as PPO, DDPG, and SAC, with attention to stability, diagnosis, evaluation, and reproducibility.

The third course, Reward Programming, addresses how to design, infer, monitor, and revise objectives so agents learn intended behavior. Learners study temporal logic, automata, reward machines, reward shaping, inverse reinforcement learning, preference feedback, safety, shielding, auditing, and stress testing.

Übungsprojekt

Learners complete conceptual quizzes throughout the specialization to check their understanding of reinforcement learning foundations, deep RL methods, and reward-design principles. These assessments emphasize interpreting algorithms, diagnosing learning behavior, and reasoning about how agents make decisions under uncertainty.

Mastering Classic Reinforcement Learning Algorithms

KURS 1, 14 Stunden

Was Sie lernen werden

Formulate sequential decision-making problems as deterministic decision processes, Markov chains, and finite Markov decision processes.
Explain and apply core reinforcement-learning concepts, including discounting, value functions, policies, Bellman equations, and optimality.
Implement planning algorithms for finite Markov decision processes, including value iteration, policy iteration, and linear programming formulations.
Compare tabular reinforcement-learning algorithms, including bandits, Monte Carlo methods, temporal-difference learning, SARSA, and Q-learning.

Kompetenzen, die Sie erwerben

Kategorie: Reinforcement Learning

Kategorie: Probability Distribution

Kategorie: Model Optimization

Kategorie: Markov Model

Kategorie: Probability & Statistics

Kategorie: Statistical Machine Learning

Kategorie: Machine Learning

Kategorie: Decision Intelligence

Kategorie: Algorithms

Kategorie: Sampling (Statistics)

Kategorie: Artificial Intelligence and Machine Learning (AI/ML)

Kategorie: Machine Learning Algorithms

Kategorie: Applied Mathematics

Deep Reinforcement Learning: From Theory to Practice

KURS 2, 14 Stunden

Was Sie lernen werden

Explain how neural-network-based function approximation extends reinforcement learning beyond finite tabular settings.
Implement and evaluate value-based deep reinforcement learning algorithms, including Deep Q-Networks and stabilizing techniques.
Derive and implement policy-gradient methods, including REINFORCE, baselines, and advantage-based updates.
Explain and analyze actor–critic methods that combine policy optimization with value estimation.

Kompetenzen, die Sie erwerben

Kategorie: Reinforcement Learning

Kategorie: Deep Learning

Kategorie: Machine Learning Methods

Kategorie: Agentic systems

Kategorie: Model Optimization

Kategorie: Machine Learning Algorithms

Kategorie: Artificial Intelligence

Kategorie: Model Training

Kategorie: System Design and Implementation

Kategorie: Applied Machine Learning

Kategorie: Machine Learning

Kategorie: Artificial Neural Networks

Kategorie: Algorithms

Kategorie: Model Evaluation

Reward Programming: Optimizing RL Efficiency and Safety

KURS 3, 13 Stunden

Was Sie lernen werden

Identify limitations of standard scalar reward formulations, including reward hacking, specification gaming, and brittle proxies.
Express structured learning objectives using formal tools such as temporal logic, automata, and reward machines.
Construct and analyze reward mechanisms based on temporal logic, automata, product MDPs, reward machines, and reward shaping.
Model reward-programming problems under hidden state, memory, hierarchy, multiagent interaction, and continuous-time dynamics

Kompetenzen, die Sie erwerben

Kategorie: Agentic systems

Kategorie: Machine Learning Methods

Kategorie: Safety and Security

Kategorie: Machine Learning

Kategorie: Model Evaluation

Kategorie: Theoretical Computer Science

Kategorie: Functional Specification

Kategorie: Verification And Validation

Kategorie: Computational Logic

Kategorie: Reinforcement Learning

Kategorie: Continuous Monitoring

Kategorie: Model Optimization

Kategorie: Markov Model

Kategorie: Responsible AI

Kategorie: AI Workflows

Erwerben Sie ein Karrierezertifikat.

Fügen Sie dieses Zeugnis Ihrem LinkedIn-Profil, Lebenslauf oder CV hinzu. Teilen Sie sie in Social Media und in Ihrer Leistungsbeurteilung.

Dozent

Ashutosh Trivedi

University of Colorado Boulder

2 Kurse47 Lernende

von

University of Colorado Boulder

Warum entscheiden sich Menschen für Coursera für ihre Karriere?

Felipe M.

Lernender seit 2018

„Es ist eine großartige Erfahrung, in meinem eigenen Tempo zu lernen. Ich kann lernen, wenn ich Zeit und Nerven dazu habe.“

Jennifer J.

Lernender seit 2020

„Bei einem spannenden neuen Projekt konnte ich die neuen Kenntnisse und Kompetenzen aus den Kursen direkt bei der Arbeit anwenden.“

Larry W.

Lernender seit 2021

„Wenn mir Kurse zu Themen fehlen, die meine Universität nicht anbietet, ist Coursera mit die beste Alternative.“

Chaitanya A.

„Man lernt nicht nur, um bei der Arbeit besser zu werden. Es geht noch um viel mehr. Bei Coursera kann ich ohne Grenzen lernen.“

Häufig gestellte Fragen

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Weitere Fragen

Besuchen Sie die das Hilfe-Center für Kursteilnehmer.

Finanzielle Unterstützung verfügbar,