Learn to orchestrate AI systems across local and cloud environments through hands-on infrastructure setup, model deployment, and workflow integration. You will build a prompt engineering pyramid from basic prompts to chain-of-thought reasoning implemented in Rust, then evaluate six decision factors for choosing between local and cloud models including latency, throughput, cost, and privacy. The course covers local AI infrastructure in depth: running Ollama with custom Modelfiles for task-specific assistants, deploying llamafile for zero-dependency portable inference, compiling Rust Candle with CUDA for GPU-accelerated local inference, and optimizing local RAG with caching strategies. You will configure a complete AI workstation with tmux for session management, nvidia-smi and Zenith for GPU monitoring, and NVIDIA GPU optimization. The final module covers cloud workflows including AWS Spot instances for cost-effective GPU compute, Hugging Face model discovery and download, and GitHub AI models integration. By completing this course, you will be able to set up local AI infrastructure, deploy models across local and cloud environments, and design orchestration workflows that balance cost, privacy, and performance.

AI Orchestration: From local models to cloud

AI Orchestration: From local models to cloud
This course is part of AI Tooling Specialization


Instructors: Alfredo Deza
Access provided by Allegiant Giving Corporation
Gain insight into a topic and learn the fundamentals.
Beginner level
Recommended experience
5 hours to complete
Flexible schedule
Learn at your own pace
What you'll learn
Build a prompt engineering pyramid from basic prompts to chain-of-thought reasoning in Rust, and evaluate decision factors for local vs cloud
Set up local AI infrastructure with Ollama, llamafile, aprender and Rust Candle GPU compilation, plus caching and RAG optimization strategies
Configure a production AI workstation with tmux, nvidia-smi, and Zenith, and integrate cloud workflows with AWS Spot, Hugging Face, and GitHub AI
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
4 assignments
Taught in English
Recently updated!
April 2026
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
This course is part of the AI Tooling Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."





