Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à ce Certificat Professionnel.
Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable auprès de IBM
Il y a 5 modules dans ce cours
Demonstrate you have the job-ready skills to design and implement a complete AI system from data to deployment, with this portfolio-worthy RAG and Agentic AI Capstone Project from IBM.
You’ll design and build a production-style multimodal RAG system that combines structured data, embeddings, retrieval logic, evaluation strategies, and intelligent workflows into one cohesive, scalable solution.
You’ll create and manage structured JSON datasets, generate text and image embeddings, and construct a vector database to power accurate similarity search and metadata-filtered retrieval. As you progress, you’ll implement robust RAG pipelines, apply re-ranking and evaluation techniques, and strengthen response quality using multimodal inputs and systematic validation approaches.
You’ll also design a multi-agent recommendation system, integrate tools using the Model Context Protocol (MCP), orchestrate workflow testing, and launch an interactive Gradio chatbot interface.
By the end, you’ll have developed an end-to-end generative AI application that demonstrates practical AI engineering expertise, architectural thinking, and production-ready implementation skills.
In this module, you will use LLMs to transform unstructured restaurant descriptions into structured JSON files by designing prompts and extracting predefined attributes. You will apply multimodal LLMs to generate captions from review images and integrate those captions into structured user review data.
Finally, you will build a command-line Python interface to browse, add, edit, and delete restaurant records, integrate LLM-powered structuring functions for new entries, and implement file backup mechanisms before saving updates.
Afficher les informations sur le contenu du module
2 vidéos•Total 5 minutes
Course Introduction•3 minutes
Project Overview•2 minutes
1 lecture•Total 2 minutes
Course Overview•2 minutes
4 devoirs•Total 63 minutes
Checklist: Structure Text Data with LLMs•16 minutes
Checklist: Process Multimodal Customer Data with LLMs•16 minutes
Checklist: Build a Simple Interactive User Interface•10 minutes
Graded Quiz: Build a Structured Generative AI Application•21 minutes
3 éléments d'application•Total 135 minutes
Lab: Structure Unstructured Restaurant Data with an LLM•45 minutes
Lab: Process Multimodal Data with LLMs•45 minutes
Lab: Build a Command-Line Data Management UI for Restaurant Data•45 minutes
5 plugins•Total 23 minutes
Reading: Helpful Tips for Course Completion•5 minutes
Reading: Assignment Overview: Structure Unstructured Restaurant Data with an LLM•5 minutes
Reading: Assignment Overview: Process Multimodal Data with LLMs•5 minutes
Reading: Assignment Overview: Build a Command-Line Data Management UI for Restaurant Data•5 minutes
Podcast: Recap: Build a Structured Generative AI Application•3 minutes
Module 2: Design a Multimodal RAG System
Module 2•3 heures à terminer
Détails du module
In this module, you will design and implement the retrieval layer of a multimodal RAG system using structured restaurant text data and food images. You will construct multimodal vector indexes, generate text and image embeddings, and build retrieval workflows that combine similarity search with metadata filtering.
You will also implement late-fusion techniques to combine and rerank results across modalities, improving the relevance of retrieved outputs. The module follows a step-by-step retrieval pipeline, from index construction to hybrid retrieval and multimodal ranking, with a focus on practical design rather than tool-specific features.
Inclus
4 devoirs3 éléments d'application4 plugins
Afficher les informations sur le contenu du module
4 devoirs•Total 51 minutes
Checklist: Multimodal Vector Index Construction•10 minutes
Checklist: Similarity Retrieval with Metadata Filtering •10 minutes
Checklist: Multimodal Similarity Fusion and Ranking •10 minutes
Graded Quiz: Design a Multimodal RAG System•21 minutes
3 éléments d'application•Total 135 minutes
Lab: Construct a Multimodal Vector Index•45 minutes
Lab: Similarity Retrieval with Metadata Filtering•45 minutes
Lab: Multimodal Similarity Fusion and Retrieval Ranking•45 minutes
4 plugins•Total 13 minutes
Reading: Assignment Overview: Construct a Multimodal Vector Index•5 minutes
Reading: Assignment Overview: Similarity Retrieval with Metadata Filtering•0 minutes
Reading: Assignment Overview: Multimodal Similarity Fusion and Retrieval Ranking•5 minutes
Podcast: Recap: Design a Multimodal RAG System •3 minutes
Module 3: Combine Agents into a Multi-Agent System
Module 3•3 heures à terminer
Détails du module
In this module, you will design and implement a multi-agent recommendation system. You will define specialized agents with clear roles, goals, backstories, and tasks, and integrate them into a coordinated multi-agent workflow. You will then test how multiple agents collaborate to generate restaurant and recipe recommendations from a single user input.
Finally, you will build an interactive chatbot interface using Gradio to expose the system. The chatbot will process user queries, display coordinated agent outputs, and support basic database editing functionality within the interface.
Inclus
4 devoirs3 éléments d'application4 plugins
Afficher les informations sur le contenu du module
4 devoirs•Total 51 minutes
Checklist: Define Agents and Their Roles•10 minutes
Checklist: Integrate Agents into a Multi-Agent System •10 minutes
Checklist: Build a Chatbot Interface for the Recommendation System•10 minutes
Graded Quiz: Combine Agents into a Multi-Agent System•21 minutes
3 éléments d'application•Total 135 minutes
Lab: Design Specialized Agents for a Recommendation System•45 minutes
Lab: Implement and Test a Multi-Agent Recommendation System•45 minutes
Lab: Build a Chatbot Interface for the Recommendation System•45 minutes
4 plugins•Total 18 minutes
Reading: Assignment Overview: Design Specialized Agents for a Recommendation System•5 minutes
Reading: Assignment Overview: Implement and Test a Multi-Agent Recommendation System•5 minutes
Reading: Assignment Overview: Build a Chatbot Interface for the Recommendation System •5 minutes
Podcast: Recap: Combine Agents into a Multi-Agent System•3 minutes
Module 4: Integrate Agents, RAG, and Tools with MCP
Module 4•3 heures à terminer
Détails du module
In this module, you will organize agent tools, databases, and documents within an MCP server. You will then build an MCP client and an LLM-based MCP host that communicate with the server and validate the system through testing.
You will also design and implement an LLM-powered MCP host with a GUI, enabling the LLM to access server-exposed tools and documents. This module brings together components built earlier into a unified MCP-based system and validates end-to-end tool execution through a GUI-based application.
Inclus
4 devoirs3 éléments d'application4 plugins
Afficher les informations sur le contenu du module
4 devoirs•Total 51 minutes
Checklist: Organize Tools and Data in an MCP Server•10 minutes
Checklist: Implement an MCP Client for Server Communication•10 minutes
Checklist: Design an LLM-based MCP Host•10 minutes
Graded Quiz: Integrate Agents, RAG, and Tools with MCP•21 minutes
3 éléments d'application•Total 90 minutes
Lab: Build an MCP Server•30 minutes
Lab: Build an MCP Client•30 minutes
Lab: Build a Full MCP Application•30 minutes
4 plugins•Total 18 minutes
Reading: Assignment Overview: Build an MCP Server•5 minutes
Reading: Assignment Overview: Build an MCP Client •5 minutes
Reading: Assignment Overview: Build a Full MCP Application•5 minutes
Podcast: Summary: Integrate Agents, RAG, and Tools with MCP•3 minutes
Module 5: Final Project and Course Wrap-Up
Module 5•1 heure à terminer
Détails du module
In this module, you will complete your AI capstone project by submitting screenshots of tasks performed in previous labs. You’ll organize and present these artifacts to clearly demonstrate how you designed, built, and integrated structured data, multimodal RAG systems, and multi-agent workflows using LangChain, LangGraph, and MCP. This submission will serve as a final evaluation through an AI-based grading system and provide a portfolio-ready showcase of your end-to-end generative AI solution.
Inclus
1 vidéo2 lectures1 élément d'application1 plugin
Afficher les informations sur le contenu du module
1 vidéo•Total 3 minutes
Course Wrap-Up•3 minutes
2 lectures•Total 2 minutes
Congratulations and Next Steps •1 minute
Thanks from the Course Team•1 minute
1 élément d'application•Total 60 minutes
Project: Final Project Submission and Evaluation•60 minutes
1 plugin•Total 5 minutes
Reading: Prepare to Submit Your Project•5 minutes
Obtenez un certificat professionnel
Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.
Instructeurs
Évaluations de l’enseignant
Évaluations de l’enseignant
Nous avons demandé à tous les étudiants de fournir des commentaires sur nos enseignants au sujet de la qualité de leur pédagogie.
At IBM, we know how rapidly tech evolves and recognize the crucial need for businesses and professionals to build job-ready, hands-on skills quickly. As a market-leading tech innovator, we’re committed to helping you thrive in this dynamic landscape. Through IBM Skills Network, our expertly designed training programs in AI, software development, cybersecurity, data science, business management, and more, provide the essential skills you need to secure your first job, advance your career, or drive business success. Whether you’re upskilling yourself or your team, our courses, Specializations, and Professional Certificates build the technical expertise that ensures you, and your organization, excel in a competitive world.
OK
En savoir plus sur Software Development
RecommandéCertificats ProfessionnelsSpécialisationsEn rapport
Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?
Felipe M.
Étudiant(e) depuis 2018
’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’
Jennifer J.
Étudiant(e) depuis 2020
’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’
Larry W.
Étudiant(e) depuis 2021
’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’
Chaitanya A.
’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’
This capstone enables you to demonstrate your job-ready skills for advanced roles such as AI Agent Engineer, RAG Systems Developer, MCP Developer, Generative AI Application Engineer, Multi-Agent Systems Developer, and AI Workflow Engineer.
It is ideal for software developers, Python programmers, and AI practitioners who want hands-on experience building and deploying multimodal RAG systems, integrating tools using Model Context Protocol (MCP), and orchestrating multi-agent workflows.
The course is also well suited for professionals reskilling into generative AI engineering roles that require end-to-end system design, tool integration, and production-ready AI application development.
What prior knowledge is essential for this course?
You should be comfortable with Python programming and have a foundational understanding of large language models (LLMs), embeddings, and retrieval-based workflows. Familiarity with RAG concepts and basic agent architectures will help you progress more smoothly.
Completing the earlier courses in one of the related IBM Professional Certificates and specializations is strongly recommended, as this capstone builds on newly learnt concepts and applies them in an end-to-end implementation.
What tools and technologies will I learn in this course?
You’ll work with large language models (LLMs) and multimodal LLMs to structure data, generate embeddings, and build retrieval pipelines. You’ll design multimodal vector indexes, implement similarity-based retrieval with metadata filtering, and apply late-fusion ranking techniques. You’ll also develop multi-agent systems using LangChain and LangGraph, build an interactive Gradio chatbot interface, and integrate agents, retrieval systems, and tools using Model Context Protocol (MCP) by configuring servers, clients, and an LLM-based host.
What practical skills will I gain from this course?
You’ll learn how to structure unstructured text and multimodal data using LLMs, build multimodal vector indexes, and implement similarity-based retrieval with metadata filtering and ranking techniques. You’ll also design and test a multi-agent recommendation system, build an interactive Gradio chatbot interface, and integrate agents, retrieval systems, and tools using Model Context Protocol (MCP) to create an end-to-end generative AI application.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.