Deploy and Scale AI Models with Cloud Run

Deploy and Scale AI Models with Cloud Run

This course is part of Build and Modernize Applications With Generative AI Specialization

Instructor: Google Cloud Training

Access provided by Martin Luther Christian University

2 modules

Gain insight into a topic and learn the fundamentals.

Beginner level

No prior experience required

1 hour to complete

Flexible schedule

Learn at your own pace

2 modules

Gain insight into a topic and learn the fundamentals.

Beginner level

No prior experience required

1 hour to complete

Flexible schedule

Learn at your own pace

What you'll learn

Use Cloud Run GPUs for AI inference.
Deploy lightweight language models on Cloud Run for AI inference.
Optimize model deployments on Cloud Run for performance and cost efficiency.
Integrate Cloud Run AI inference services with database services on Google Cloud.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

2 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Build and Modernize Applications With Generative AI Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 2 modules in this course

AI inference is the process of using a trained machine learning model to make predictions on new, unseen data by applying learned patterns. This course is designed for developers, data scientists, and ML engineers interested in quickly deploying AI inference services on Cloud Run. It is useful for those familiar with cloud-based serverless application deployment solutions, but who may not have experience with running AI inference using Google Cloud serverless products. The course includes examples that deploys a model for AI inference with GPUs and integrates gen AI apps with data storage services.