The API Development and Model Serving course is designed for developers, engineers, and technical product builders who are new to Generative AI but already have intermediate machine learning knowledge, basic Python proficiency, and familiarity with development environments such as VS Code, and who want to engineer, customize, and deploy open generative AI solutions while avoiding vendor lock-in.

API Development and Model Serving

API Development and Model Serving
This course is part of Open Generative AI: Build with Open Models and Tools Professional Certificate

Instructor: Professionals from the Industry
Access provided by L&T Corp - ATLNext
Recommended experience
Details to know

Add to your LinkedIn profile
3 assignments
See how employees at top companies are mastering in-demand skills

Build your Software Development expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Coursera

There are 3 modules in this course
Learn how to build practical REpresentational State Transfer Application Programming Interface (REST APIs) that turn your models into usable services. You’ll create inference endpoints, design request/response schemas, and implement authentication and error handling to keep your APIs secure and reliable. By the end, you’ll have hands-on experience developing a FastAPI service that teammates and applications can call seamlessly, a core skill for production ML engineers.
What's included
1 video2 readings1 assignment1 ungraded lab
Explore how Model Context Protocol (MCP) enables models to connect directly with tools and systems. You’ll compare MCP with traditional APIs, implement function calling, and practice integrating MCP into FastAPI endpoints. These skills show you how to extend models beyond simple outputs, giving them the ability to take real actions—a capability increasingly expected in applied AI systems.
What's included
1 video1 reading1 assignment1 ungraded lab
Learn how to prepare APIs for production by making them scalable and resilient. You’ll use Docker to containerize services, apply asynchronous request handling, and configure load balancing to support real workloads. You’ll also monitor performance and optimize bottlenecks, gaining the practical skills to ensure your model APIs stay reliable when demand grows.
What's included
1 video2 readings1 assignment
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Explore more from Computer Science

Coursera

Coursera



