This course addresses evaluating Large Language Models (LLMs), starting with foundational evaluation methods, exploring advanced techniques with Vertex AI's tools like Automatic Metrics and AutoSxS, and forecasting the evolution of generative AI evaluation.



Evaluating Large Language Model Outputs: A Practical Guide
This course is part of Harnessing LLMs: Strategy, Fine-Tuning & Evaluation Specialization


Instructors: Reza Moradinezhad
Access provided by Duke University
Recommended experience
What you'll learn
Identify the fundamentals of Large Language Models, including current evaluation methods and access to Vertex AI's evaluation models.
Apply hands-on knowledge of using Vertex AI's Automatic Metrics and AutoSxS for LLM evaluation.
Evaluate upcoming trends in generative AI evaluation, encompassing text, image, and audio models, and the importance of human evaluation.
Skills you'll gain
Details to know

Add to your LinkedIn profile
1 assignment
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There is 1 module in this course
This course addresses evaluating Large Language Models (LLMs), starting with foundational evaluation methods, exploring advanced techniques with Vertex AI's tools like Automatic Metrics and AutoSxS, and forecasting the evolution of generative AI evaluation.
What's included
13 videos4 readings1 assignment3 plugins
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Explore more from Data Science
H2O.ai
Google Cloud
Coursera Instructor Network