Large language models (LLMs) are trained on human-generated text, but additional methods are needed to align an LLM with human values and preferences.


Reinforcement Learning from Human Feedback

Instructor: Nikita Namjoshi
Access provided by Datta Meghe Institute of Higher Education & Research (DU)
2,729 already enrolled
(32 reviews)
Recommended experience
What you'll learn
Get a conceptual understanding of Reinforcement Learning from Human Feedback (RLHF), as well as the datasets needed for this technique.
Fine-tune the Llama 2 model using RLHF with the open source Google Cloud Pipeline Components Library.
Evaluate tuned model performance against the base model with evaluation methods.
Skills you'll practice
Details to know
Only available on desktop
See how employees at top companies are mastering in-demand skills

Learn, practice, and apply job-ready skills in less than 2 hours
- Receive training from industry experts
- Gain hands-on experience solving real-world job tasks

About this project
Instructor

Offered by
How you'll learn
Hands-on, project-based learning
Practice new skills by completing job-related tasks with step-by-step instructions.
No downloads or installation required
Access the tools and resources you need in a cloud environment.
Available only on desktop
This project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
Why people choose Coursera for their career




Learner reviews
32 reviews
- 5 stars
71.87%
- 4 stars
25%
- 3 stars
3.12%
- 2 stars
0%
- 1 star
0%
Showing 3 of 32
Reviewed on Jun 18, 2025
better to be expanded a bit, but overall, it is super course
Reviewed on Jan 11, 2025
Overall worth a shot. Not in depth but good overview
You might also like

University of Alberta

DeepLearning.AI

DeepLearning.AI
