The Building RAG Systems with Open Models course is designed for developers, engineers, and technical product builders who are new to Generative AI but already have intermediate machine learning knowledge, basic Python proficiency, and familiarity with development environments such as VS Code, and who want to engineer, customize, and deploy open generative AI solutions while avoiding vendor lock-in.

Building RAG Systems with Open Models

Building RAG Systems with Open Models
This course is part of Open Generative AI: Build with Open Models and Tools Professional Certificate

Instructor: Professionals from the Industry
Access provided by Veterans Transition Support
Recommended experience
Details to know

Add to your LinkedIn profile
4 assignments
February 2026
See how employees at top companies are mastering in-demand skills

Build your Machine Learning expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Coursera

There are 4 modules in this course
Learn the fundamentals of Retrieval-Augmented Generation (RAG) and why it’s critical for reducing hallucinations and improving accuracy. You’ll break down RAG’s core components, retrievers, rankers, generators, and orchestration layers, and apply design patterns for use cases like Q&A, summarization, and knowledge synthesis. You’ll also explore advanced variations such as hierarchical retrieval and hybrid search, giving you practical strategies to match RAG designs to real-world needs.
What's included
1 video1 reading1 assignment2 ungraded labs
Evaluate embedding models and vector databases to understand how they impact retrieval quality and system performance. You’ll compare embedding options by dimensionality and domain fit, and explore database choices such as Facebook AI Similarity Search (FAISS), ChromaDB, Milvus, and Pinecone. You’ll also analyze indexing strategies, chunking methods, and update workflows—skills that help you make informed decisions when building retrieval systems for different environments.
What's included
2 videos1 reading1 assignment1 ungraded lab
You’ll put theory into practice by integrating embeddings and vector databases into working RAG pipelines. You’ll test indexing strategies, experiment with chunking, and observe how different configurations affect retrieval accuracy and efficiency. You’ll also practice maintaining and updating vector indices, building the skills to manage RAG systems that remain reliable as datasets grow and change.
What's included
1 video1 reading1 assignment2 ungraded labs
Assemble full RAG pipelines using frameworks like LangChain and open Large Language Models (LLMs). You’ll implement advanced retrieval strategies such as hybrid search, re-ranking, and query expansion, and evaluate pipelines with metrics that track accuracy, latency, and reliability. You’ll also practice handling real-world challenges, such as hallucination mitigation and citation tracking, ensuring your systems are not just demos, but production-ready solutions.
What's included
4 videos1 reading1 assignment2 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.






