Board Infinity

Model Serving Systems: Containers, APIs & Scalability

Board Infinity

Model Serving Systems: Containers, APIs & Scalability

Board Infinity

Instructor: Board Infinity

Access provided by Amgen

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Build optimized Docker images and multi-container ML apps using Docker Compose and multi-stage builds

  • Design scalable REST APIs with FastAPI, Pydantic validation, versioning, and error handling

  • Scale ML serving with async queues, load balancing, and latency profiling for production workloads

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

17 assignments

Taught in English
Recently updated!

May 2026

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Machine Learning Operations (MLOps) Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 4 modules in this course

This module introduces containerization fundamentals and shows learners how to build efficient Docker images for ML workloads, ensuring portability and reproducibility across environments.

What's included

12 videos4 readings5 assignments

Learners develop and refine REST APIs for ML model inference, focusing on reliability, scalability, and real-world best practices.

What's included

9 videos3 readings4 assignments

This module emphasizes scalability, concurrency, and optimization for production-grade model serving systems.

What's included

9 videos3 readings4 assignments

The final module demonstrates how to save, deploy, and safely roll back production models while maintaining uptime and integrity.

What's included

9 videos3 readings4 assignments

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Board Infinity
Board Infinity
258 Courses412,518 learners

Offered by

Board Infinity

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Explore more from Data Science