Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

Spécialisation "LLM Optimization & Evaluation"

Optimize & Deploy Production-Ready LLM Systems.

Build expertise in LLM evaluation, optimization, and deployment through hands-on MLOps projects.

Instructeurs : John Whitworth

Inclus avec En savoir plus

Demander à Coursera

Série de 13 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Série de 13 cours

Approfondissez votre connaissance d’un sujet

niveau Intermédiaire

Expérience recommandée

4 semaines à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Evaluate and optimize LLM performance using statistical testing, MLOps tools, and production monitoring systems.
Build automated pipelines for feature engineering, experiment tracking, and data processing with industry-standard tools.
Diagnose LLM errors, implement safety frameworks, and reduce operational costs through systematic analysis.

Compétences que vous acquerrez

Catégorie : MLOps (Machine Learning Operations)
Catégorie : Large Language Modeling
Catégorie : Model Optimization
Catégorie : Extract, Transform, Load
Catégorie : Technical Communication
Catégorie : Version Control
Catégorie : SQL
Catégorie : Prompt Patterns
Catégorie : Technical Documentation
Catégorie : LLM Application
Catégorie : Statistical Analysis
Catégorie : Performance Tuning
Catégorie : Root Cause Analysis
Catégorie : Fine-tuning
Catégorie : Data Pipelines
Catégorie : Data Presentation
Catégorie : User Acceptance Testing (UAT)
Catégorie : Scripting
Catégorie : AI Security

Outils que vous découvrirez

Catégorie : Python Programming
Catégorie : Apache Airflow

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Enseigné en Anglais

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Améliorez votre expertise en la matière

Acquérez des compétences recherchées auprès d’universités et d’experts du secteur
Maîtrisez un sujet ou un outil avec des projets pratiques
Développez une compréhension approfondie de concepts clés
Obtenez un certificat professionnel auprès de Coursera

Spécialisation - série de 13 cours

Learn the complete lifecycle of LLM optimization and evaluation through hands-on experience with production-ready techniques. This comprehensive specialization equips you with essential skills to evaluate, optimize, and deploy large language models effectively. You'll learn to engineer features for ML models, implement rigorous statistical testing for LLM performance, diagnose and fix hallucinations through log analysis, optimize both computational costs and database performance, and build robust safety testing frameworks. The program progresses from foundational ML concepts through advanced MLOps practices, covering experiment tracking with tools like DVC and W&B, automated cloud workflows, data pipeline management with Apache Airflow, and product development workflows including requirements documentation and user acceptance testing. Through practical projects, you'll analyze LLM spend reports to reduce operational costs, implement value-stream mapping to streamline ML pipelines, create comprehensive testing suites with mutation testing, and develop operational runbooks for production systems. Whether you're optimizing SQL queries for vector search, conducting A/B tests for model improvements, or building automated monitoring systems, this specialization provides the technical depth and practical experience needed to excel in LLM engineering roles.

Projet d'apprentissage appliqué

Apply your skills through industry-relevant projects including building feature engineering pipelines with MLOps tools, creating statistical testing frameworks to evaluate LLM performance, diagnosing and resolving hallucination issues through data analysis, optimizing vector search and SQL queries for production systems, and developing comprehensive safety testing suites. You'll also track ML experiments using version control systems, automate cloud workflows with Python scripts, build data pipelines with Apache Airflow, and create complete product requirements and testing documentation for LLM features.

Engineer Features and Evaluate Models for Production

COURS 1, 3 heures

Ce que vous apprendrez

Build feature engineering pipelines and evaluate ML experiments using MLOps tools to select and deploy production-ready models.

Compétences que vous acquerrez

Catégorie : Technical Writing

Catégorie : Analysis

Catégorie : Model Evaluation

Catégorie : Feature Engineering

Catégorie : Performance Analysis

Catégorie : Machine Learning Methods

Catégorie : Data Pipelines

Catégorie : Model Training

Catégorie : Model Deployment

Catégorie : Data Preprocessing

Catégorie : Model Optimization

Catégorie : Data Transformation

Catégorie : MLOps (Machine Learning Operations)

Optimize Deep Learning: Tune PyTorch Models

COURS 2, 4 heures

Ce que vous apprendrez

Use PyTorch Lightning to implement callbacks, diagnose instabilities, and optimize model performance.

Compétences que vous acquerrez

Catégorie : Debugging

Catégorie : Fine-tuning

Catégorie : Model Training

Catégorie : MLOps (Machine Learning Operations)

Catégorie : Transfer Learning

Catégorie : Model Deployment

Catégorie : Deep Learning

Catégorie : Performance Tuning

Catégorie : Scalability

Catégorie : Artificial Neural Networks

Catégorie : Model Optimization

Catégorie : PyTorch (Machine Learning Library)

Evaluate & Optimize LLM Performance

COURS 3, 4 heures

Ce que vous apprendrez

Evaluate LLMs using metrics like BLEU & ROUGE run A/B tests for statistical significance, and optimize model performance with data-driven strategies.

Compétences que vous acquerrez

Catégorie : Statistical Inference

Catégorie : Statistical Methods

Catégorie : Model Evaluation

Catégorie : Test Script Development

Catégorie : Statistical Analysis

Catégorie : Scripting

Catégorie : Performance Metric

Catégorie : Probability & Statistics

Catégorie : Large Language Modeling

Catégorie : Model Optimization

Catégorie : Statistical Hypothesis Testing

Catégorie : LLM Application

Catégorie : Prompt Engineering

Catégorie : Data-Driven Decision-Making

Catégorie : Embeddings

Catégorie : Natural Language Processing

Analyze Logs: Fix LLM Hallucinations

COURS 4, 4 heures

Ce que vous apprendrez

Use data analysis to diagnose LLM hallucinations by correlating user behavior and system errors, and document findings to guide engineering fixes.

Compétences que vous acquerrez

Catégorie : Analysis

Catégorie : Large Language Modeling

Catégorie : Correlation Analysis

Catégorie : Business Metrics

Catégorie : Root Cause Analysis

Catégorie : Data Manipulation

Catégorie : Artificial Intelligence

Catégorie : Technical Communication

Catégorie : LLM Application

Catégorie : Generative AI

Catégorie : Pandas (Python Package)

Catégorie : Retrieval-Augmented Generation

Catégorie : Data Analysis

Catégorie : Debugging

Evaluate LLMs: Test and Prove Significance

COURS 5, 3 heures

Ce que vous apprendrez

Rigorously evaluate LLM performance using statistical tests and confidence intervals to make data-driven deployment decisions.

Compétences que vous acquerrez

Catégorie : Model Evaluation

Catégorie : Statistical Hypothesis Testing

Catégorie : Statistical Software

Catégorie : Matplotlib

Catégorie : Model Deployment

Catégorie : Statistical Programming

Catégorie : Data Presentation

Catégorie : Data-Driven Decision-Making

Catégorie : Large Language Modeling

Catégorie : Scientific Visualization

Catégorie : Data Storytelling

Catégorie : Performance Metric

Catégorie : Statistical Inference

Catégorie : Statistical Methods

Catégorie : Experimentation

Catégorie : Statistical Analysis

Catégorie : Statistical Visualization

Catégorie : Statistics

Optimize SQL: Build Fast Data Pipelines

COURS 6, 3 heures

Ce que vous apprendrez

Parameterized SQL with CTEs and window functions builds scalable, maintainable pipelines that adapt as business needs change.
Query optimization is systematic: analyze execution plans, find costly steps, then resolve them with indexing or rewrites.
Materialized summary tables and well-timed processing, like morning refreshes, support reliable analytics infrastructure.
Understanding execution internals helps analysts build self-sufficient workflows without recurring engineering delays.

Compétences que vous acquerrez

Catégorie : Performance Tuning

Catégorie : SQL

Catégorie : Extract, Transform, Load

Catégorie : Data Transformation

Catégorie : Scripting

Catégorie : Database Management

Catégorie : Data Manipulation

Catégorie : Data Pipelines

Safeguard LLM Outputs: Test and Evaluate

COURS 7, 3 heures

Ce que vous apprendrez

Build and validate a robust safety testing framework for LLMs. Create behavioral test suites and use mutation testing to ensure their effectiveness.

Compétences que vous acquerrez

Catégorie : Security Testing

Catégorie : Verification And Validation

Catégorie : Large Language Modeling

Catégorie : Software Technical Review

Catégorie : LLM Application

Catégorie : Test Tools

Catégorie : Code Coverage

Catégorie : Prompt Patterns

Catégorie : Prompt Engineering

Catégorie : Test Case

Catégorie : Quality Assessment

Catégorie : Responsible AI

Catégorie : AI Security

Catégorie : Maintainability

Catégorie : Model Evaluation

Catégorie : Testability

Catégorie : Unit Testing

Catégorie : Threat Modeling

Catégorie : Software Testing

Catégorie : Test Script Development

Track and Evaluate ML Model Experiments

COURS 8, 3 heures

Ce que vous apprendrez

Track, version, and evaluate ML experiments using DVC and W&B to reliably select and prepare models for production deployment.

Compétences que vous acquerrez

Catégorie : MLOps (Machine Learning Operations)

Catégorie : Model Evaluation

Catégorie : Version Control

Catégorie : Predictive Modeling

Catégorie : Dashboard

Catégorie : Large Language Modeling

Catégorie : Interactive Data Visualization

Catégorie : Model Training

Catégorie : Data Management

Catégorie : Performance Analysis

Catégorie : Record Keeping

Catégorie : Model Deployment

Catégorie : Machine Learning

Automate Cloud Workflows with Python Scripting

COURS 9, 1 heure

Ce que vous apprendrez

Create automated Python scripts to manage multi-step cloud workflows, from provisioning resources to persisting data.

Compétences que vous acquerrez

Catégorie : Scripting

Catégorie : Infrastructure as Code (IaC)

Catégorie : AI Workflows

Catégorie : Data Persistence

Catégorie : Python Programming

Catégorie : Virtual Machines

Catégorie : Data Pipelines

Catégorie : Command-Line Interface

Automate Data Pipelines: Schema Evolution

COURS 10, 2 heures

Ce que vous apprendrez

Build automated data pipelines with Apache Airflow, manage schema evolution to prevent failures, and implement monitoring for data integrity.

Compétences que vous acquerrez

Catégorie : Data Pipelines

Catégorie : Apache Airflow

Catégorie : Data Integrity

Catégorie : Data Transformation

Catégorie : System Monitoring

Catégorie : Extract, Transform, Load

Catégorie : Data Validation

Catégorie : Continuous Monitoring

Catégorie : Data Modeling

Catégorie : Data Quality

Develop and Evaluate LLM Features Effectively

COURS 11, 3 heures

Ce que vous apprendrez

Translate an LLM product concept into a detailed PRD and create a UAT plan to validate that the delivered feature meets user requirements.

Compétences que vous acquerrez

Catégorie : User Acceptance Testing (UAT)

Catégorie : AI Product Strategy

Catégorie : Product Requirements

Catégorie : Business Requirements

Catégorie : Verification And Validation

Catégorie : Prioritization

Catégorie : Large Language Modeling

Catégorie : Requirements Analysis

Catégorie : Test Planning

Catégorie : Acceptance Testing

Catégorie : Functional Requirement

Catégorie : User Requirements Documents

Catégorie : Key Performance Indicators (KPIs)

Catégorie : User Story

Catégorie : LLM Application

Catégorie : Functional Testing

Document and Evaluate LLM Prompting Success

COURS 12, 2 heures

Ce que vous apprendrez

Create operational run-books for LLM systems and evaluate prompt patterns to improve performance and reduce operational costs.

Compétences que vous acquerrez

Catégorie : Prompt Patterns

Catégorie : Prompt Engineering

Catégorie : Technical Documentation

Catégorie : Model Optimization

Catégorie : LLM Application

Catégorie : Data Maintenance

Catégorie : Requirements Analysis

Catégorie : Performance Tuning

Catégorie : Technical Writing

Catégorie : Configuration Management

Catégorie : Performance Testing

Catégorie : Large Language Modeling

Catégorie : Token Optimization

Catégorie : Benchmarking

Catégorie : Retrieval-Augmented Generation

Catégorie : MLOps (Machine Learning Operations)

Optimize LLM Costs & Streamline Processes

COURS 13, 2 heures

Ce que vous apprendrez

Optimize LLM costs by analyzing spend reports and streamline ML pipelines using value-stream mapping to boost efficiency and reduce cycle times.

Compétences que vous acquerrez

Catégorie : Model Optimization

Catégorie : Process Improvement and Optimization

Catégorie : Proposal Development

Catégorie : Collaborative Software

Catégorie : Productivity Software

Catégorie : AI Workflows

Catégorie : Data-Driven Decision-Making

Catégorie : Operating Cost

Catégorie : Business Workflow Analysis

Catégorie : Lean Manufacturing

Catégorie : Process Analysis

Catégorie : Process Optimization

Catégorie : Miro AI

Catégorie : Process Modeling

Catégorie : Cost Management

Catégorie : Waste Minimization

Catégorie : LLM Application

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeurs

John Whitworth

29 Cours3 356 apprenants

LearningMate

276 Cours37 669 apprenants

Offert par

Coursera

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Débloquez l'accès à plus de 10 000 cours grâce à un abonnement
Faites progresser votre carrière avec un diplôme en ligne
Obtenez un diplôme auprès d’universités de renommée mondiale - 100 % en ligne
Rejoignez les 4 700 entreprises internationales qui ont choisi Coursera for Business.

Foire Aux Questions

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Plus de questions

Visitez le Centre d'Aide pour les Étudiants

Aide financière disponible,

Spécialisation "LLM Optimization & Evaluation"

Spécialisation "LLM Optimization & Evaluation"

Ce que vous apprendrez

Compétences que vous acquerrez

Outils que vous découvrirez

Détails à connaître

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

Améliorez votre expertise en la matière

Spécialisation - série de 13 cours

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Ce que vous apprendrez

Compétences que vous acquerrez

Obtenez un certificat professionnel

Instructeurs

Offert par

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Foire Aux Questions

Is this course really 100% online? Do I need to attend any classes in person?

Can I just enroll in a single course?

Is financial aid available?

Plus de questions