AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.

Deploy and Scale AI Models with Cloud Run

Deploy and Scale AI Models with Cloud Run
This course is part of Build and Modernize Applications With Generative AI Specialization

Instructor: Google Cloud Training
Access provided by SGCSRC
Gain insight into a topic and learn the fundamentals.
Beginner level
No prior experience required
1 hour to complete
Flexible schedule
Learn at your own pace
What you'll learn
Use Cloud Run GPUs for AI inference.
Deploy lightweight language models on Cloud Run for AI inference.
Optimize model deployments on Cloud Run for performance and cost efficiency.
Integrate Cloud Run AI inference services with database services on Google Cloud.
Skills you'll gain
Tools you'll learn
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
2 assignments
Taught in English
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
This course is part of the Build and Modernize Applications With Generative AI Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 2 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."





