Coursera

LLM Engineering That Works: Prompting, Tuning, and Retrieval Professional Certificate

Coursera

LLM Engineering That Works: Prompting, Tuning, and Retrieval Professional Certificate

Engineer Production-Ready LLM Systems.

Learn prompting, tuning, retrieval, and scalable architectures for reliable AI applications.

Included with Coursera Plus

Earn a career credential that demonstrates your expertise
Intermediate level

Recommended experience

2 months to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Earn a career credential that demonstrates your expertise
Intermediate level

Recommended experience

2 months to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Design and deploy production-grade LLM systems combining prompting, tuning, and retrieval

  • Build reliable, scalable AI pipelines with evaluation, monitoring, and governance

  • Apply responsible AI practices, ethics, and safety throughout the lifecycle of LLMs

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English
Recently updated!

March 2026

91%

of learners achieved a positive career outcome

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your career with in-demand skills

  • Receive professional-level training from Coursera
  • Demonstrate your technical proficiency
  • Earn an employer-recognized certificate from Coursera

Professional Certificate - 6 course series

Production AI Model Development and Ethics

Production AI Model Development and Ethics

Course 1, 10 hours

What you'll learn

  • Apply custom training loops with callbacks (early-stopping, checkpointing) and diagnose gradient issues using norm and activation analysis.

  • Implement feature engineering pipelines for structured and text data, then evaluate ML experiments to select production-ready models.

  • Create comprehensive model cards for LLM features that detail intended use, technical limitations, and specific fairness metrics.

  • Evaluate AI systems against established ethical guidelines to identify biases and propose actionable mitigation strategies.

Skills you'll gain

Category: Responsible AI
Category: Model Deployment
Category: Model Evaluation
Category: Feature Engineering
Category: Model Training
Category: Technical Documentation
Category: MLOps (Machine Learning Operations)
Category: Scikit Learn (Machine Learning Library)
Category: PyTorch (Machine Learning Library)
Category: Deep Learning
Category: Model Optimization
Category: Data Ethics
Category: Data Pipelines
Category: Software Documentation
Category: Data Preprocessing
Building Reliable LLM Systems

Building Reliable LLM Systems

Course 2, 18 hours

What you'll learn

  • Build scripts with lexical/semantic metrics to evaluate LLMs, diagnose hallucinations, and balance vector-search recall against latency.

  • Apply hypothesis testing, confidence intervals, and significance metrics to evaluate model accuracy and validate results from A/B experiments.

  • Utilize parameterized SQL and data manipulation to segment user logs, calculate retention, and securely retrieve large-scale datasets.

  • Analyze LLM performance gaps to prioritize technical fixes and implement remediation measures for production-level reliability.

Skills you'll gain

Category: Model Evaluation
Category: SQL
Category: Performance Tuning
Category: Performance Testing
Category: Statistical Analysis
Category: Data-Driven Decision-Making
Category: Statistical Methods
Category: LLM Application
Category: MLOps (Machine Learning Operations)
Category: Retrieval-Augmented Generation
Category: Debugging
Category: Vector Databases
Category: Large Language Modeling
Category: Python Programming
Category: Artificial Intelligence and Machine Learning (AI/ML)
Category: Statistical Hypothesis Testing
Category: Query Languages
Testing and Refining LLM Applications

Testing and Refining LLM Applications

Course 3, 13 hours

What you'll learn

  • Apply TDD to microservice endpoints and refactor modules based on code reviews to improve readability and reduce complexity.

  • Develop behavior and safety tests to ensure LLM outputs comply with policies and block unsafe changes to the model.

  • Apply data versioning to track artifacts and evaluate ML experiment runs to select production-ready models.

  • Create scripts using Python's argparse to automate multi-step computational workflows in cloud environments.

Skills you'll gain

Category: Test Driven Development (TDD)
Category: Security Testing
Category: AI Security
Category: MLOps (Machine Learning Operations)
Category: Unit Testing
Category: Continuous Integration
Category: Software Testing
Category: Test Automation
Category: Python Programming
Category: CI/CD
Category: Responsible AI
Category: Statistical Analysis
Category: Testability
Category: LLM Application
Category: Large Language Modeling
Category: Test Case
Category: AI Workflows
Category: Test Script Development
Category: SQL
Category: Model Deployment
Designing Production LLM Architectures

Designing Production LLM Architectures

Course 4, 11 hours

What you'll learn

  • Compare synchronous and asynchronous architectures and apply 12-factor principles and container orchestration to deploy scalable microservices.

  • Analyze multi-region deployments, pinpoint latency bottlenecks, and design resilient architecture improvements via fault analysis.

  • Create Airflow DAGs to automate data workflows and analyze the impact of schema evolution on downstream processes and tests.

  • Analyze trade-offs between self-hosting models vs. managed APIs and evaluate proposed infrastructure for fault tolerance and cost.

Skills you'll gain

Category: Application Deployment
Category: Data Pipelines
Category: Microservices
Category: Apache Airflow
Category: Software Architecture
Category: Kubernetes
Category: Diagram Design
Category: Cloud-Native Computing
Category: Scalability
Category: AWS CloudFormation
Category: Containerization
Category: Systems Architecture
Category: Model Deployment
Category: Infrastructure Architecture
Category: LLM Application
Category: Open Source Technology
Category: Azure DevOps
Category: Software Design
Category: Managed Services
Category: Large Language Modeling
Evaluating LLM Performance and Efficiency

Evaluating LLM Performance and Efficiency

Course 5, 9 hours

What you'll learn

  • Create PRDs with requirements and success metrics, and evaluate features against user-story acceptance criteria to identify gaps.

  • Evaluate prompt patterns and compute-spend reports to implement model-optimization techniques that reduce operational costs.

  • Analyze pipelines using value-stream mapping to eliminate inefficiencies and prioritize chatbot KPI optimizations.

  • Create technical documentation for vector index updates and evaluate system effectiveness against business requirements.

Skills you'll gain

Category: Prompt Engineering
Category: Product Requirements
Category: Process Optimization
Category: Model Optimization
Category: Cost Reduction
Category: Procedure Development
Category: Process Design
Category: Token Optimization
Category: MLOps (Machine Learning Operations)
Category: Key Performance Indicators (KPIs)
Category: Artificial Intelligence and Machine Learning (AI/ML)
Category: Workflow Management
Category: Process Mapping
Category: Prompt Patterns
Category: User Requirements Documents
Category: Software Documentation
Category: Large Language Modeling
Category: Business Process Automation
Category: AI Workflows
Category: LLM Application
Advancing Your Career in Production AI

Advancing Your Career in Production AI

Course 6, 1 hour

What you'll learn

  • Position yourself for senior AI roles by creating a strategic portfolio and mastering advanced system design and ethics-focused technical interviews.

Skills you'll gain

Category: Responsible AI
Category: Data Ethics
Category: Technical Communication
Category: Model Training
Category: Model Deployment
Category: SQL
Category: Technical Design
Category: Model Optimization
Category: MLOps (Machine Learning Operations)
Category: Prompt Engineering
Category: Python Programming
Category: Artificial Intelligence and Machine Learning (AI/ML)
Category: Communication
Category: AI Security
Category: CI/CD
Category: AI Workflows
Category: Apache Airflow
Category: AWS CloudFormation
Category: LLM Application
Category: System Design and Implementation

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Professionals from the Industry
475 Courses89,775 learners

Offered by

Coursera

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

¹Based on Coursera learner outcome survey responses, United States, 2021.