What job roles is this course designed for?

<text variant="body1">This capstone enables you to demonstrate your job-ready skills for advanced roles such as AI Agent Engineer, RAG Systems Developer, MCP Developer, Generative AI Application Engineer, Multi-Agent Systems Developer, and AI Workflow Engineer. <text variant="body1">It is ideal for software developers, Python programmers, and AI practitioners who want hands-on experience building and deploying multimodal RAG systems, integrating tools using Model Context Protocol (MCP), and orchestrating multi-agent workflows. <text variant="body1">The course is also well suited for professionals reskilling into generative AI engineering roles that require end-to-end system design, tool integration, and production-ready AI application development.

What prior knowledge is essential for this course?

You should be comfortable with Python programming and have a foundational understanding of large language models (LLMs), embeddings, and retrieval-based workflows. Familiarity with RAG concepts and basic agent architectures will help you progress more smoothly. Completing the earlier courses in one of the related IBM Professional Certificates and specializations is strongly recommended, as this capstone builds on newly learnt concepts and applies them in an end-to-end implementation.

What tools and technologies will I learn in this course?

You’ll work with large language models (LLMs) and multimodal LLMs to structure data, generate embeddings, and build retrieval pipelines. You’ll design multimodal vector indexes, implement similarity-based retrieval with metadata filtering, and apply late-fusion ranking techniques. You’ll also develop multi-agent systems using LangChain and LangGraph, build an interactive Gradio chatbot interface, and integrate agents, retrieval systems, and tools using Model Context Protocol (MCP) by configuring servers, clients, and an LLM-based host.

What practical skills will I gain from this course?

You’ll learn how to structure unstructured text and multimodal data using LLMs, build multimodal vector indexes, and implement similarity-based retrieval with metadata filtering and ranking techniques. You’ll also design and test a multi-agent recommendation system, build an interactive Gradio chatbot interface, and integrate agents, retrieval systems, and tools using Model Context Protocol (MCP) to create an end-to-end generative AI application.

When will I have access to the lectures and assignments?

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

What will I get if I subscribe to this Certificate?

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

RAG and Agentic AI Capstone Project

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.

RAG and Agentic AI Capstone Project

Ce cours fait partie de Certificat Professionnel IBM RAG and Agentic AI

Instructeurs : Abdul Fatir

2 112 déjà inscrits

Inclus avec

5 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

15 avis

niveau Avancées

Expérience recommandée

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

5 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

15 avis

niveau Avancées

Expérience recommandée

1 semaine à compléter

à 10 heures par semaine

Planning flexible

Apprenez à votre propre rythme

Ce que vous apprendrez

Demonstrate you have the job-ready skills to design and implement a complete AI system from data to deployment.
Transform unstructured text and multimodal data into structured JSON formats using LLMs to drive data-driven decision-making.
Architect multimodal vector databases and multi-agent systems to coordinate specialized agents for high-accuracy recommendations.
Integrate complex AI ecosystems using MCP, configuring servers and clients to build, validate, and scale tool-augmented agents.

Compétences que vous acquerrez

Catégorie : Tool Calling
Catégorie : Large Language Modeling
Catégorie : Unstructured Data
Catégorie : System Testing
Catégorie : Generative AI Agents
Catégorie : Multimodal Prompts
Catégorie : Embeddings
Catégorie : Retrieval-Augmented Generation
Catégorie : Agentic systems
Catégorie : LLM Application

Outils que vous découvrirez

Catégorie : Agentic Workflows
Catégorie : JSON
Catégorie : Vector Databases
Catégorie : Generative AI
Catégorie : AI Orchestration
Catégorie : Model Context Protocol
Catégorie : AI Workflows

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Récemment mis à jour !

mars 2026

Évaluations

16 devoirs

Enseigné en Anglais

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Élaborez votre expertise en Software Development

Ce cours fait partie de la Certificat Professionnel IBM RAG and Agentic AI

Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à ce Certificat Professionnel.

Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable auprès de IBM

Il y a 5 modules dans ce cours

Demonstrate you have the job-ready skills to design and implement a complete AI system from data to deployment, with this portfolio-worthy RAG and Agentic AI Capstone Project from IBM.

You’ll design and build a production-style multimodal RAG system that combines structured data, embeddings, retrieval logic, evaluation strategies, and intelligent workflows into one cohesive, scalable solution. You’ll create and manage structured JSON datasets, generate text and image embeddings, and construct a vector database to power accurate similarity search and metadata-filtered retrieval. As you progress, you’ll implement robust RAG pipelines, apply re-ranking and evaluation techniques, and strengthen response quality using multimodal inputs and systematic validation approaches. You’ll also design a multi-agent recommendation system, integrate tools using the Model Context Protocol (MCP), orchestrate workflow testing, and launch an interactive Gradio chatbot interface. By the end, you’ll have developed an end-to-end generative AI application that demonstrates practical AI engineering expertise, architectural thinking, and production-ready implementation skills.

Détails du module

In this module, you will use LLMs to transform unstructured restaurant descriptions into structured JSON files by designing prompts and extracting predefined attributes. You will apply multimodal LLMs to generate captions from review images and integrate those captions into structured user review data. Finally, you will build a command-line Python interface to browse, add, edit, and delete restaurant records, integrate LLM-powered structuring functions for new entries, and implement file backup mechanisms before saving updates.

Inclus

2 vidéos1 lecture4 devoirs3 éléments d'application5 plugins

2 vidéosTotal 5 minutes

Course Introduction3 minutes
Project Overview2 minutes

1 lectureTotal 2 minutes

Course Overview2 minutes

4 devoirsTotal 63 minutes

Checklist: Structure Text Data with LLMs16 minutes
Checklist: Process Multimodal Customer Data with LLMs16 minutes
Checklist: Build a Simple Interactive User Interface10 minutes
Graded Quiz: Build a Structured Generative AI Application21 minutes

3 éléments d'applicationTotal 135 minutes

Lab: Structure Unstructured Restaurant Data with an LLM45 minutes
Lab: Process Multimodal Data with LLMs45 minutes
Lab: Build a Command-Line Data Management UI for Restaurant Data45 minutes

5 pluginsTotal 23 minutes

Reading: Helpful Tips for Course Completion5 minutes
Reading: Assignment Overview: Structure Unstructured Restaurant Data with an LLM5 minutes
Reading: Assignment Overview: Process Multimodal Data with LLMs5 minutes
Reading: Assignment Overview: Build a Command-Line Data Management UI for Restaurant Data5 minutes
Podcast: Recap: Build a Structured Generative AI Application3 minutes

In this module, you will design and implement the retrieval layer of a multimodal RAG system using structured restaurant text data and food images. You will construct multimodal vector indexes, generate text and image embeddings, and build retrieval workflows that combine similarity search with metadata filtering. You will also implement late-fusion techniques to combine and rerank results across modalities, improving the relevance of retrieved outputs. The module follows a step-by-step retrieval pipeline, from index construction to hybrid retrieval and multimodal ranking, with a focus on practical design rather than tool-specific features.

Inclus

4 devoirs3 éléments d'application4 plugins

4 devoirsTotal 51 minutes

Checklist: Multimodal Vector Index Construction10 minutes
Checklist: Similarity Retrieval with Metadata Filtering 10 minutes
Checklist: Multimodal Similarity Fusion and Ranking 10 minutes
Graded Quiz: Design a Multimodal RAG System21 minutes

3 éléments d'applicationTotal 135 minutes

Lab: Construct a Multimodal Vector Index45 minutes
Lab: Similarity Retrieval with Metadata Filtering45 minutes
Lab: Multimodal Similarity Fusion and Retrieval Ranking45 minutes

4 pluginsTotal 13 minutes

Reading: Assignment Overview: Construct a Multimodal Vector Index5 minutes
Reading: Assignment Overview: Similarity Retrieval with Metadata Filtering0 minutes
Reading: Assignment Overview: Multimodal Similarity Fusion and Retrieval Ranking5 minutes
Podcast: Recap: Design a Multimodal RAG System 3 minutes

In this module, you will design and implement a multi-agent recommendation system. You will define specialized agents with clear roles, goals, backstories, and tasks, and integrate them into a coordinated multi-agent workflow. You will then test how multiple agents collaborate to generate restaurant and recipe recommendations from a single user input. Finally, you will build an interactive chatbot interface using Gradio to expose the system. The chatbot will process user queries, display coordinated agent outputs, and support basic database editing functionality within the interface.

Inclus

4 devoirs3 éléments d'application4 plugins

4 devoirsTotal 51 minutes

Checklist: Define Agents and Their Roles10 minutes
Checklist: Integrate Agents into a Multi-Agent System 10 minutes
Checklist: Build a Chatbot Interface for the Recommendation System10 minutes
Graded Quiz: Combine Agents into a Multi-Agent System21 minutes

3 éléments d'applicationTotal 135 minutes

Lab: Design Specialized Agents for a Recommendation System45 minutes
Lab: Implement and Test a Multi-Agent Recommendation System45 minutes
Lab: Build a Chatbot Interface for the Recommendation System45 minutes

4 pluginsTotal 18 minutes

Reading: Assignment Overview: Design Specialized Agents for a Recommendation System5 minutes
Reading: Assignment Overview: Implement and Test a Multi-Agent Recommendation System5 minutes
Reading: Assignment Overview: Build a Chatbot Interface for the Recommendation System 5 minutes
Podcast: Recap: Combine Agents into a Multi-Agent System3 minutes

In this module, you will organize agent tools, databases, and documents within an MCP server. You will then build an MCP client and an LLM-based MCP host that communicate with the server and validate the system through testing. You will also design and implement an LLM-powered MCP host with a GUI, enabling the LLM to access server-exposed tools and documents. This module brings together components built earlier into a unified MCP-based system and validates end-to-end tool execution through a GUI-based application.

Inclus

4 devoirs3 éléments d'application4 plugins

4 devoirsTotal 51 minutes

Checklist: Organize Tools and Data in an MCP Server10 minutes
Checklist: Implement an MCP Client for Server Communication10 minutes
Checklist: Design an LLM-based MCP Host10 minutes
Graded Quiz: Integrate Agents, RAG, and Tools with MCP21 minutes

3 éléments d'applicationTotal 90 minutes

Lab: Build an MCP Server30 minutes
Lab: Build an MCP Client30 minutes
Lab: Build a Full MCP Application30 minutes

4 pluginsTotal 18 minutes

Reading: Assignment Overview: Build an MCP Server5 minutes
Reading: Assignment Overview: Build an MCP Client 5 minutes
Reading: Assignment Overview: Build a Full MCP Application5 minutes
Podcast: Summary: Integrate Agents, RAG, and Tools with MCP3 minutes

In this module, you will complete your AI capstone project by submitting screenshots of tasks performed in previous labs. You’ll organize and present these artifacts to clearly demonstrate how you designed, built, and integrated structured data, multimodal RAG systems, and multi-agent workflows using LangChain, LangGraph, and MCP. This submission will serve as a final evaluation through an AI-based grading system and provide a portfolio-ready showcase of your end-to-end generative AI solution.

Inclus

1 vidéo2 lectures1 élément d'application1 plugin

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.