When will I have access to the lectures and assignments?

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

What will I get if I subscribe to this Specialization?

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Production ML with Hugging Face

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.

Production ML with Hugging Face

Ce cours fait partie de Spécialisation "Next-Gen AI Development with Hugging Face"

Instructeur : Noah Gift

Inclus avec

4 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

niveau Intermédiaire

Expérience recommandée

4 heures à compléter

Planning flexible

Apprenez à votre propre rythme

4 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

niveau Intermédiaire

Expérience recommandée

4 heures à compléter

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Convert and deploy ML models across GGUF, SafeTensors, and APR formats for GPU, CPU, and browser targets

Compétences que vous acquerrez

Catégorie : Model Optimization
Catégorie : Cryptographic Protocols
Catégorie : LLM Application
Catégorie : Performance Tuning
Catégorie : Performance Testing
Catégorie : CI/CD
Catégorie : Large Language Modeling
Catégorie : Application Deployment
Catégorie : AI Security
Catégorie : MLOps (Machine Learning Operations)

Outils que vous découvrirez

Catégorie : Model Deployment
Catégorie : Hugging Face
Catégorie : Python Programming
Catégorie : Rust (Programming Language)

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Récemment mis à jour !

février 2026

Évaluations

4 devoirs

Enseigné en Anglais

91%

of learners achieved a positive career outcome

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Élaborez votre expertise du sujet

Ce cours fait partie de la Spécialisation "Next-Gen AI Development with Hugging Face"

Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à cette Spécialisation.

Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable

Il y a 4 modules dans ce cours

Learn to deploy ML models to production using the Sovereign Rust Stack—a pure Rust implementation with zero Python runtime dependencies. This hands-on course teaches you to work with three critical model formats (GGUF, SafeTensors, APR), implement MLOps pipelines with CI/CD and observability, and deploy models across GPU, CPU, WebAssembly, and edge targets.

Through real-world projects including a Python-to-Rust transpiler (Depyler), browser-based speech recognition (Whisper.apr), and LLM inference benchmarking (Qwen), you'll master format conversion, cryptographic model signing, and performance optimization. The course culminates in a capstone project deploying Qwen2.5-Coder across all three formats with benchmarking. What makes this course unique: instead of relying on Python frameworks, you'll build with production-grade Rust tooling that compiles to native binaries and WebAssembly. Learn to run sub-millisecond inference in browsers, bundle models into executables, and achieve 2x performance gains over standard tools. Ideal for ML engineers and software developers ready to move beyond notebooks into production deployment.

Détails du module

Understanding ML model formats and the Sovereign AI Stack. Learn GGUF, SafeTensors, and APR formats for different deployment targets.

Inclus

6 vidéos8 lectures1 devoir

6 vidéosTotal 21 minutes

Course Introduction3 minutes
Hugging Face Model Publishing4 minutes
Model Types on Hugging Face3 minutes
APR Format Deep Dive4 minutes
Model Format Comparison3 minutes
Why Trace Models 4 minutes

8 lecturesTotal 8 minutes

Introduction to Course and Course Resources1 minute
Meet your instructors1 minute
Key Concepts1 minute
Reflection1 minute
Key Terms1 minute
Reflection1 minute
Key Terms1 minute
Reflection1 minute

1 devoirTotal 5 minutes

Quiz: Model Format5 minutes

Production infrastructure for ML systems. This module covers the essential MLOps practices needed to deploy and maintain ML models in production environments. Learn how to implement CI/CD pipelines specifically designed for ML workflows, set up comprehensive observability with logs, metrics, and traces, apply cryptographic model signing for supply chain security, and choose optimal deployment patterns based on your infrastructure requirements.

Inclus

8 vidéos6 lectures1 devoir

8 vidéosTotal 24 minutes

Model Registry Architecture3 minutes
CI/CD Pipeline for ML4 minutes
Model Observability Stack3 minutes
Model Signing & Security3 minutes
Binary Deployment Patterns3 minutes
Inference Server Architecture3 minutes
Corpus Management & DataOps3 minutes
Cost-Performance Decision Matrix3 minutes

6 lecturesTotal 60 minutes

Key Concepts10 minutes
Reflection10 minutes
Key Terms10 minutes
Reflection10 minutes
Key Terms10 minutes
Reflection10 minutes

1 devoirTotal 5 minutes

Quiz: MLOps Foundations5 minutes

Real-world projects built with the Sovereign AI Stack. This module demonstrates practical applications through three production projects: Depyler (a Python-to-Rust transpiler with self-improving ML), Whisper.apr (speech-to-text in browser and CLI), and the APR ecosystem tools. Learn how to build self-improving systems using compiler-in-the-loop training, deploy speech recognition to resource-constrained environments, and leverage the full APR toolchain for model conversion and inference.

Inclus

11 vidéos6 lectures1 devoir

11 vidéosTotal 43 minutes

Four Projects, One Stack5 minutes
Depyler Deep Dive5 minutes
Depyler Oracle Training3 minutes
Depyler Single-Shot Compile3 minutes
Whisper.apr Overview5 minutes
Whisper Code Walkthrough4 minutes
Whisper Demo3 minutes
APR Format Rosetta Stone3 minutes
APR Hub & Spoke Architecture3 minutes
APR Chat Demo3 minutes
Course Conclusion3 minutes

6 lecturesTotal 60 minutes

Key Terms10 minutes
Reflection10 minutes
Key Concepts10 minutes
Reflection10 minutes
Key Concepts10 minutes
Reflection10 minutes

1 devoirTotal 5 minutes

Quiz: Project Showcase5 minutes

Final project deploying Qwen2.5-Coder-0.5B across all three model formats. Students demonstrate mastery of format conversion, CLI deployment, server deployment, and performance benchmarking.

Inclus

3 lectures1 devoir

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.