Building and Deploying Generative AI Models

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.

Building and Deploying Generative AI Models

Ce cours fait partie de Spécialisation "Generative AI Fundamentals"

Instructeurs : Amreen Anbar

Inclus avec

3 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

niveau Intermédiaire

Expérience recommandée

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

3 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

niveau Intermédiaire

Expérience recommandée

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Construct and evaluate Transformer-based LLMs from scratch using PyTorch and industry metrics like ROUGE and BLEU.
Engineer Retrieval Augmented Generation (RAG) pipelines using LangChain to integrate current, domain-specific knowledge into models.
Deploy autonomous AI Agents to production environments on Google Cloud Platform (Vertex AI) using professional workflows.

Compétences que vous acquerrez

Catégorie : Model Evaluation
Catégorie : Fine-tuning
Catégorie : Generative Model Architectures
Catégorie : Agentic systems
Catégorie : Deep Learning
Catégorie : Large Language Modeling
Catégorie : Artificial Intelligence and Machine Learning (AI/ML)
Catégorie : Generative AI Agents
Catégorie : System Monitoring
Catégorie : Model Optimization
Catégorie : LLM Application
Catégorie : Development Environment
Catégorie : Model Training
Catégorie : Google Cloud Platform

Outils que vous découvrirez

Catégorie : PyTorch (Machine Learning Library)
Catégorie : Agentic Workflows
Catégorie : AI Workflows
Catégorie : LangChain
Catégorie : Model Deployment
Catégorie : Generative AI

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Récemment mis à jour !

décembre 2025

Évaluations

3 devoirs

Enseigné en Anglais

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Élaborez votre expertise du sujet

Ce cours fait partie de la Spécialisation "Generative AI Fundamentals"

Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à cette Spécialisation.

Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable

Il y a 3 modules dans ce cours

Transition from theoretical concepts to production-ready engineering in this hands-on course which is the final part in "Fundamentals of Generative AI" specialization. Designed for learners ready to move beyond the theory, this course focuses entirely on construction: you won't just learn about Large Language Models (LLMs); you will build, refine, and deploy them.

We start at the foundational level, coding different types of Transformer architectures from scratch using PyTorch. Through high-performance training with Automatic Mixed Precision and ROUGE/BLEU evaluation, you will learn the techniques to scale custom components into optimized systems. By utilizing pre-trained models and weighing performance trade-offs, you will gain the insight needed to select the most efficient path for large-scale deployment. Moving to applied architecture, you will master Retrieval Augmented Generation (RAG) using LangChain, learning to evaluate pipelines and apply advanced techniques such as different chunking strategies, reranking and compression, and query transformation. You'll also navigate model selection as well as the critical trade-offs between RAG and Fine-tuning. Finally, you will step into the future of AI by developing autonomous Agents. You will bridge the gap between development and production by setting up a professional workflow with Poetry and deploying a Summarizer AI Agent directly to the Google Cloud Platform (Vertex AI). By the end of this course, you will possess a tangible portfolio of code and a live deployment, proving your ability to engineer robust Generative AI solutions.

In this module, we dive deep into the Transformer architecture, its core mechanics, and different transformer architecture types (encoder-only, decoder-only, encoder-decoder). We gain hands-on experience by building and training a complete suite of PyTorch-based models from scratch. The module concludes with strategic deployment skills, teaching when to build custom models versus leveraging pre-trained models for efficiency and state-of-the-art results.

Inclus

18 vidéos11 lectures1 devoir

18 vidéosTotal 113 minutes

Course Introduction4 minutes
Meet your instructor: Amreen Anbar1 minute
Meet your instructor: Anahita Doosti1 minute
Meet your instructor: Soroush Razavi1 minute
Transformer: Evolution Unveiled8 minutes
Transformer: Types8 minutes
Transformer: The Components7 minutes
Setting The Stage: Environment, Libraries and Data8 minutes
Looking beyond theory: Let’s Build a Transformer!9 minutes
Looking beyond theory: Training and Text Generation8 minutes
Building the Complete Encoder-Decoder Summarizer: Encoder, Decoder, and the Cross-Attention Mechanism7 minutes
Building the Complete Encoder-Decoder Summarizer: Teacher Forcing, Loss, and Inference7 minutes
Scaling the Architecture: From Character Tokens to BPE and Massive Data8 minutes
Scaling the Architecture: High-Performance Optimization (AMP) and ROUGE Evaluation9 minutes
Synthesis: Implementation of the Translator Transformer9 minutes
Bypass the Training Wall: Powerful LLM Applications Without Massive Compute5 minutes
A Resource-Efficient Approach: Using pre-trained models for Summarization 6 minutes
A Resource-Efficient Approach: Using Pre-trained Models for Translation8 minutes

11 lecturesTotal 290 minutes

The original paper, "Attention Is All You Need"20 minutes
Interactive Transformer Explainer30 minutes
Notebook 140 minutes
Notebook 240 minutes
Notebook 340 minutes
Dataset (cnn_dailymail)10 minutes
Notebook 440 minutes
Dataset (wmt14)10 minutes
ROUGE and BLEU Score for NLP Evaluation20 minutes
Notebook 520 minutes
Notebook 620 minutes

1 devoirTotal 30 minutes

Section 1 Quiz30 minutes

Module 2 addresses the limitations of static knowledge and hallucinations in Large Language Models (LLMs) by introducing Retrieval Augmented Generation (RAG). Learners will progress from building fundamental pipelines with Ollama and LangChain to implementing production-ready systems by adding rigorous RAG evaluation and utilizing advanced techniques such as custom chunking strategies, vector stores, reranking, and query transformations to optimize context retrieval and response generation. The module concludes with an overview of another adaptation technique called finetuning and a comparison of RAG vs. finetuning.

Inclus

13 vidéos2 lectures1 devoir

13 vidéosTotal 85 minutes

What is RAG?6 minutes
Building a Minimal RAG from Scratch with Ollama (Part 1)7 minutes
Building a Minimal RAG from Scratch with Ollama (Part 2)5 minutes
An Improved RAG Pipeline with LangChain7 minutes
RAG Evaluation and Metrics7 minutes
Implementing RAG Evaluation7 minutes
Document Loaders and Chunking Strategies6 minutes
Vector Stores and Indexing6 minutes
Reranking and Contextual Compression7 minutes
Query Transformation7 minutes
Pick the Right Models for your RAG7 minutes
What is Finetuning?5 minutes
RAG vs. Finetuning: Which one to choose?7 minutes

2 lecturesTotal 140 minutes

Coding Notebooks 20 minutes
Final RAG Results 120 minutes

1 devoirTotal 30 minutes

Section 2 Quiz30 minutes

Module 3 marks a pivotal transition from passive information retrieval to the dynamic realm of autonomous AI Agents, anchored by the "Understand, Think, Take Action" conceptual framework. Students will critically evaluate development ecosystems before applying these concepts to build a functional Summarizer Agent. The module emphasizes professional engineering standards, guiding learners through a complete lifecycle that includes environment management with Poetry, deployment to the Vertex AI Engine, and the implementation of robust performance monitoring using Google Cloud Platform’s logging and tracing tools.

Inclus

15 vidéos1 lecture1 devoir

15 vidéosTotal 76 minutes

What is an Agent?7 minutes
Different Approaches to Building Agents6 minutes
Our Approach in This Course5 minutes
ADK Features and Tools5 minutes
Setting Up the Cloud Environment5 minutes
Setting Up the Local Environment4 minutes
From Basic to Advanced Agents6 minutes
Deployment Pathways for ADK Agents6 minutes
Project Installation: Dependency and Environment Management5 minutes
Agent Structure and Workflow6 minutes
Running The Agent Part 1: Initiating5 minutes
Running The Agent Part 2: Analyzing4 minutes
Deploying Agent to The Cloud5 minutes
Monitoring The Deployment on GCP3 minutes
Wrap Up4 minutes

1 lectureTotal 30 minutes

Project Link and Description30 minutes

1 devoirTotal 30 minutes

Section 3 Quiz30 minutes

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeurs

Amreen Anbar

Alberta Machine Intelligence Institute

2 Cours1 221 apprenants

Offert par

Alberta Machine Intelligence Institute

En savoir plus sur Algorithms

Starweaver
GenAI Data and Analytics Academy
Spécialisation
Edureka
Generative AI Architecture and Application Development
Cours
IBM
Project: Generative AI Applications with RAG and LangChain
Cours
Coursera
Deploying Open Models
Cours

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Foire Aux Questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.