Who is this program for?

Experienced ML engineers, software engineers, data scientists and AI practitioners who want hands‑on, production‑focused expertise in LLMs and AI systems.

What background knowledge is necessary?

Strong programming and ML literacy are required; you should be comfortable with Python, statistics and fundamental ML workflows.

Is this course really 100% online? Do I need to attend any classes in person?

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Can I just enroll in a single course?

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Certificate, you’re automatically subscribed to the full Certificate. Visit your learner dashboard to track your progress.

LLM Engineering That Works: Prompting, Tuning, and Retrieval Professional Certificate

Engineer Production-Ready LLM Systems.

Learn prompting, tuning, retrieval, and scalable architectures for reliable AI applications.

Instructor: Professionals from the Industry

Included with

Learn more

6 course series

Earn a career credential that demonstrates your expertise

Intermediate level

Recommended experience

2 months to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

6 course series

Earn a career credential that demonstrates your expertise

Intermediate level

Recommended experience

2 months to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Design and deploy production-grade LLM systems combining prompting, tuning, and retrieval
Build reliable, scalable AI pipelines with evaluation, monitoring, and governance
Apply responsible AI practices, ethics, and safety throughout the lifecycle of LLMs

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your career with in-demand skills

Receive professional-level training from Coursera
Demonstrate your technical proficiency
Earn an employer-recognized certificate from Coursera

Professional Certificate - 6 course series

LLM Engineering That Works is an advanced, multi-course professional certificate designed to prepare you for production-grade AI systems. The program combines five long courses, 19 short courses (including career development), and four add-on modules to cover the end-to-end lifecycle of large language models—from prompt design and model tuning to retrieval, evaluation, and deployment. You’ll learn to build robust, ethical, and cost-efficient LLM solutions through hands-on projects that mirror real-world workflows. By completing this program, you’ll gain the skills needed to design, implement, monitor, and improve scalable LLM-enabled applications across industries.

Who this is for: Experienced ML engineers, software engineers, data scientists, and AI practitioners who seek hands-on, production-focused expertise in LLMs and AI systems. A strong programming background and familiarity with ML concepts are recommended.

Applied Learning Project

This program guides you through a portfolio of production-focused projects. You will work on an End-to-End LLM Performance Audit, diagnose flawed LLM architectures, add safety guardrails to LLM services, design multi-region, scalable LLM architectures, and optimize costs and compute. A capstone-style set of add-on projects rounds out your portfolio, demonstrating end-to-end capabilities from data pipelines to deployment and monitoring in real-world settings.

Production AI Model Development and Ethics

Course 1, 10 hours

What you'll learn

Apply custom training loops with callbacks (early-stopping, checkpointing) and diagnose gradient issues using norm and activation analysis.
Implement feature engineering pipelines for structured and text data, then evaluate ML experiments to select production-ready models.
Create comprehensive model cards for LLM features that detail intended use, technical limitations, and specific fairness metrics.
Evaluate AI systems against established ethical guidelines to identify biases and propose actionable mitigation strategies.

Skills you'll gain

Category: Model Evaluation

Category: Technical Documentation

Category: Responsible AI

Category: Feature Engineering

Category: Model Deployment

Category: Model Training

Category: Scikit Learn (Machine Learning Library)

Category: PyTorch (Machine Learning Library)

Category: MLOps (Machine Learning Operations)

Category: Data Preprocessing

Category: Deep Learning

Category: Model Optimization

Category: Software Documentation

Category: Data Ethics

Category: Data Pipelines

Building Reliable LLM Systems

Course 2, 18 hours

What you'll learn

Build scripts with lexical/semantic metrics to evaluate LLMs, diagnose hallucinations, and balance vector-search recall against latency.
Apply hypothesis testing, confidence intervals, and significance metrics to evaluate model accuracy and validate results from A/B experiments.
Utilize parameterized SQL and data manipulation to segment user logs, calculate retention, and securely retrieve large-scale datasets.
Analyze LLM performance gaps to prioritize technical fixes and implement remediation measures for production-level reliability.

Skills you'll gain

Category: Model Evaluation

Category: Performance Tuning

Category: Performance Testing

Category: Statistical Methods

Category: SQL

Category: Data-Driven Decision-Making

Category: Statistical Analysis

Category: Artificial Intelligence and Machine Learning (AI/ML)

Category: LLM Application

Category: Statistical Hypothesis Testing

Category: MLOps (Machine Learning Operations)

Category: Large Language Modeling

Category: Debugging

Category: Retrieval-Augmented Generation

Category: Query Languages

Category: Vector Databases

Category: Python Programming

Testing and Refining LLM Applications

Course 3, 13 hours

What you'll learn

Apply TDD to microservice endpoints and refactor modules based on code reviews to improve readability and reduce complexity.
Develop behavior and safety tests to ensure LLM outputs comply with policies and block unsafe changes to the model.
Apply data versioning to track artifacts and evaluate ML experiment runs to select production-ready models.
Create scripts using Python's argparse to automate multi-step computational workflows in cloud environments.

Skills you'll gain

Category: Software Testing

Category: Test Driven Development (TDD)

Category: AI Security

Category: Continuous Integration

Category: Security Testing

Category: MLOps (Machine Learning Operations)

Category: Unit Testing

Category: Test Script Development

Category: Responsible AI

Category: SQL

Category: Python Programming

Category: AI Workflows

Category: LLM Application

Category: Model Deployment

Category: Statistical Analysis

Category: Test Case

Category: Large Language Modeling

Category: Testability

Category: Test Automation

Category: CI/CD

Designing Production LLM Architectures

Course 4, 11 hours

What you'll learn

Compare synchronous and asynchronous architectures and apply 12-factor principles and container orchestration to deploy scalable microservices.
Analyze multi-region deployments, pinpoint latency bottlenecks, and design resilient architecture improvements via fault analysis.
Create Airflow DAGs to automate data workflows and analyze the impact of schema evolution on downstream processes and tests.
Analyze trade-offs between self-hosting models vs. managed APIs and evaluate proposed infrastructure for fault tolerance and cost.

Skills you'll gain

Category: Application Deployment

Category: Apache Airflow

Category: Scalability

Category: Microservices

Category: Cloud-Native Computing

Category: Diagram Design

Category: Software Architecture

Category: Data Pipelines

Category: Kubernetes

Category: Containerization

Category: Large Language Modeling

Category: Managed Services

Category: LLM Application

Category: AWS CloudFormation

Category: Systems Architecture

Category: Software Design

Category: Infrastructure Architecture

Category: Azure DevOps

Category: Open Source Technology

Category: Model Deployment

Evaluating LLM Performance and Efficiency

Course 5, 9 hours

What you'll learn

Create PRDs with requirements and success metrics, and evaluate features against user-story acceptance criteria to identify gaps.
Evaluate prompt patterns and compute-spend reports to implement model-optimization techniques that reduce operational costs.
Analyze pipelines using value-stream mapping to eliminate inefficiencies and prioritize chatbot KPI optimizations.
Create technical documentation for vector index updates and evaluate system effectiveness against business requirements.

Skills you'll gain

Category: Prompt Engineering

Category: Product Requirements

Category: Model Optimization

Category: Process Optimization

Category: Token Optimization

Category: Prompt Patterns

Category: Operational Efficiency

Category: Large Language Modeling

Category: Process Mapping

Category: User Requirements Documents

Category: MLOps (Machine Learning Operations)

Category: Business Process Automation

Category: Product Lifecycle Management

Category: Key Performance Indicators (KPIs)

Category: LLM Application

Category: Process Driven Development

Category: Artificial Intelligence and Machine Learning (AI/ML)

Category: Product Management

Category: Process Design

Category: Cost Containment

Advancing Your Career in Production AI

Course 6, 1 hour

What you'll learn

Position yourself for senior AI roles by creating a strategic portfolio and mastering advanced system design and ethics-focused technical interviews.

Skills you'll gain

Category: Responsible AI

Category: Data Ethics

Category: Model Training

Category: AI Security

Category: MLOps (Machine Learning Operations)

Category: System Design and Implementation

Category: AWS CloudFormation

Category: Model Optimization

Category: Python Programming

Category: Technical Communication

Category: Artificial Intelligence and Machine Learning (AI/ML)

Category: Model Deployment

Category: Technical Design

Category: Communication

Category: AI Workflows

Category: CI/CD

Category: Apache Airflow

Category: LLM Application

Category: Prompt Engineering

Category: SQL

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Professionals from the Industry

475 Courses90,550 learners

Offered by

Coursera

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Unlock access to 10,000+ courses with a subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 4,700 global companies that choose Coursera for Business

Frequently asked questions

This is an advanced program that assumes prior ML and software development experience. Some courses may require comfort with core ML concepts and Python programming.

You’ll gain experience with PyTorch, scikit-learn, vector-search techniques (e.g., HNSW), SQL, and workflow tools such as Airflow, among others.

The program targets advanced ML/AI engineering roles, including MLOps Engineer, ML Engineer, AI Architect, and related production-focused positions.