Building RAG Systems with Open Models

This course is part of Open Generative AI: Build with Open Models and Tools Professional Certificate

Instructor: Professionals from the Industry

Access provided by Veterans Transition Support

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

4 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your Machine Learning expertise

This course is part of the Open Generative AI: Build with Open Models and Tools Professional Certificate

When you enroll in this course, you'll also be enrolled in this Professional Certificate.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate from Coursera

There are 4 modules in this course

The Building RAG Systems with Open Models course is designed for developers, engineers, and technical product builders who are new to Generative AI but already have intermediate machine learning knowledge, basic Python proficiency, and familiarity with development environments such as VS Code, and who want to engineer, customize, and deploy open generative AI solutions while avoiding vendor lock-in.

The course provides learners with the skills to design and implement retrieval-augmented generation (RAG) applications for real-world use cases. Learners start by exploring the fundamentals of RAG architecture, breaking down key components such as retrievers, rankers, generators, and orchestration layers, while learning design patterns for tasks like question answering, summarization, and knowledge synthesis. They then dive into embeddings and vector databases, comparing FAISS, ChromaDB, Milvus, and Pinecone, and applying indexing and chunking strategies to improve retrieval efficiency and semantic relevance. The final module brings all elements together to build production-ready RAG pipelines using LangChain and open LLMs, incorporating advanced retrieval methods, hallucination mitigation, and evaluation frameworks for accuracy and reliability. By the end, learners will have built a functional RAG application with configurable components, optimized for performance and equipped with robust evaluation metrics.

Learn the fundamentals of Retrieval-Augmented Generation (RAG) and why it’s critical for reducing hallucinations and improving accuracy. You’ll break down RAG’s core components, retrievers, rankers, generators, and orchestration layers, and apply design patterns for use cases like Q&A, summarization, and knowledge synthesis. You’ll also explore advanced variations such as hierarchical retrieval and hybrid search, giving you practical strategies to match RAG designs to real-world needs.

What's included

1 video1 reading1 assignment2 ungraded labs

Evaluate embedding models and vector databases to understand how they impact retrieval quality and system performance. You’ll compare embedding options by dimensionality and domain fit, and explore database choices such as Facebook AI Similarity Search (FAISS), ChromaDB, Milvus, and Pinecone. You’ll also analyze indexing strategies, chunking methods, and update workflows—skills that help you make informed decisions when building retrieval systems for different environments.

What's included

2 videos1 reading1 assignment1 ungraded lab

2 videosTotal 14 minutes

Podcast: Why Choosing the Right Embeddings Makes or Breaks Your System4 minutes
How Database & Embedding Choices Affect RAG9 minutes

1 readingTotal 15 minutes

The Building Blocks: Embeddings and Databases Explained15 minutes

1 assignmentTotal 30 minutes

Which Setup Would You Choose?30 minutes

1 ungraded labTotal 60 minutes

Compare Embeddings and Databases in Action60 minutes

You’ll put theory into practice by integrating embeddings and vector databases into working RAG pipelines. You’ll test indexing strategies, experiment with chunking, and observe how different configurations affect retrieval accuracy and efficiency. You’ll also practice maintaining and updating vector indices, building the skills to manage RAG systems that remain reliable as datasets grow and change.

What's included

1 video1 reading1 assignment2 ungraded labs

Assemble full RAG pipelines using frameworks like LangChain and open Large Language Models (LLMs). You’ll implement advanced retrieval strategies such as hybrid search, re-ranking, and query expansion, and evaluate pipelines with metrics that track accuracy, latency, and reliability. You’ll also practice handling real-world challenges, such as hallucination mitigation and citation tracking, ensuring your systems are not just demos, but production-ready solutions.