This course provides a comprehensive guide to deploying, managing, and optimizing AI and high-performance computing (HPC) workloads on Google Cloud. Through a series of lessons and practical demonstrations, you’ll explore diverse deployment strategies, ranging from highly customizable environments using Google Compute Engine (GCE) to managed solutions like Google Kubernetes Engine (GKE). Specifically, you’ll learn how to create clusters and deploy GKE for inference.

AI Infrastructure: Deployment Types

AI Infrastructure: Deployment Types

Instructor: Google Cloud Training
Access provided by US Postal Service
Gain insight into a topic and learn the fundamentals.
Intermediate level
Some related experience required
5 hours to complete
Flexible schedule
Learn at your own pace
What you'll learn
Describe the process of creating a GPU-accelerated cluster.
Identify how to provision a GPU-accelerated cluster on GCE.
Identify how to provision a GPU-accelerated cluster on GKE.
Identify how to deploy AI inference workloads on GKE.
Skills you'll gain
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
4 assignments
Taught in English
Recently updated!
December 2025
See how employees at top companies are mastering in-demand skills

There are 6 modules in this course
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Explore more from Computer Science

Google Cloud

Google Cloud

Google Cloud


