Master Retrieval-Augmented Generation (RAG), the most popular generative AI tool, to unlock the full potential of your data. This course enables you to develop highly sought-after skills as corporate investment in generative AI soars.

Unlocking Data with Generative AI and RAG

Unlocking Data with Generative AI and RAG

Instructor: Packt - Course Instructors
Access provided by Indonesia Cyber Education Institute
Recommended experience
Recommended experience
What you'll learn
Understand the principles and significance of Retrieval-Augmented Generation (RAG) in AI
Integrate large language models with internal data for improved AI performance
Master vectorization, vector databases, and techniques for efficient data retrieval
Details to know

Add to your LinkedIn profile
14 assignments
June 2026
See how employees at top companies are mastering in-demand skills

There are 14 modules in this course
This module introduces the concept of retrieval-augmented generation (RAG) in generative AI, exploring its architecture, key terminology, and practical implementation. Learners will examine the challenges associated with RAG, compare it to model fine-tuning approaches, and understand how RAG enhances AI applications in real-world contexts.
What's included
1 video7 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
7 readings•Total 34 minutes
- Introduction•6 minutes
- Challenges of RAG•5 minutes
- RAG vocabulary•5 minutes
- Fine-tuning – full-model fine-tuning (FMFT) and parameter-efficient fine-tuning (PEFT)•5 minutes
- Implementing RAG in AI applications•4 minutes
- Comparing RAG with model fine-tuning•5 minutes
- The architecture of RAG systems•4 minutes
1 assignment•Total 16 minutes
- Exploring Retrieval-Augmented Generation•16 minutes
In this module, you will build a complete retrieval-augmented generation (RAG) pipeline from scratch, learning how to preprocess data, perform vector indexing, and integrate retrieval and generation using LangChain and Chroma DB. You'll gain hands-on experience with essential libraries, understand the flow of data through the pipeline, and execute queries to see RAG in action.
What's included
1 video8 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
8 readings•Total 43 minutes
- Introduction•4 minutes
- No interface!•6 minutes
- Imports•4 minutes
- Indexing•6 minutes
- Embedding and indexing the chunks•4 minutes
- Retrieval and generation•5 minutes
- Setting up a LangChain chain using LCEL•5 minutes
- Submitting a question for RAG•9 minutes
1 assignment•Total 16 minutes
- RAG Pipeline Fundamentals•16 minutes
This module explores real-world implementations of retrieval-augmented generation (RAG) in areas such as automated reporting, e-commerce, knowledge management, and innovation scouting. Learners will discover how RAG enhances data analysis, personalizes content, and improves the utility of knowledge bases. Practical exercises will guide you in integrating sources into RAG pipelines for robust, transparent AI solutions.
What's included
1 video6 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
6 readings•Total 30 minutes
- Introduction•6 minutes
- RAG for automated reporting•5 minutes
- E-commerce support•5 minutes
- Utilizing knowledge bases with RAG•4 minutes
- Innovation scouting and trend analysis•5 minutes
- Code lab 3.1 – Adding sources to your RAG•5 minutes
1 assignment•Total 16 minutes
- Practical Applications of RAG•16 minutes
This module explores the essential building blocks of retrieval-augmented generation (RAG) systems, including indexing, retrieval, prompt engineering, LLM integration, and user interface design. Learners will gain practical insights into how these components interact to create effective RAG applications. By the end, you'll understand both the technical and user-facing aspects necessary for building robust RAG solutions.
What's included
1 video5 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
5 readings•Total 28 minutes
- Introduction•8 minutes
- Retrieval and generation•5 minutes
- Prompting•4 minutes
- Defining your LLM•4 minutes
- UI•7 minutes
1 assignment•Total 16 minutes
- Exploring RAG System Architecture•16 minutes
This module explores the unique security risks associated with retrieval-augmented generation (RAG) applications, including challenges posed by large language models and external data sources. Learners will investigate common vulnerabilities, such as hallucinations and sensitive information disclosure, and gain hands-on experience with red teaming and defensive strategies. Practical coding labs provide opportunities to secure API keys and implement protective measures against attacks.
What's included
1 video7 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
7 readings•Total 41 minutes
- Introduction•4 minutes
- RAG security challenges•5 minutes
- Hallucinations•5 minutes
- Common areas to target with red teaming•6 minutes
- Code lab 5.1 – Securing your keys•6 minutes
- Code lab 5.2 – Red team attack!•7 minutes
- Code lab 5.3 – Blue team defend!•8 minutes
1 assignment•Total 16 minutes
- Securing RAG Applications•16 minutes
This module introduces the fundamentals of building applications with retrieval-augmented generation (RAG) and demonstrates how to leverage Gradio for creating user-friendly interfaces. Learners will explore the advantages of Gradio, its integration with popular machine learning frameworks, and practical steps for interfacing with RAG models.
What's included
1 video2 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
2 readings•Total 16 minutes
- Introduction•4 minutes
- Benefits of using Gradio•12 minutes
1 assignment•Total 16 minutes
- RAG and Gradio Fundamentals•16 minutes
This module explores how vectors and vector stores underpin retrieval-augmented generation (RAG) systems, delving into vector representations, embedding models, and the practical considerations for choosing and using vector stores. Learners will gain insights into the impact of vector dimensions, semantic algorithms, and performance factors in real-world RAG applications.
What's included
1 video12 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
12 readings•Total 67 minutes
- Introduction•5 minutes
- Vector dimensions and size•7 minutes
- Where vectors lurk in your code•6 minutes
- The amount of text you vectorize matters!•5 minutes
- Not all semantics are created equal!•9 minutes
- Word2Vec, Sentence2Vec, and Doc2Vec•5 minutes
- Bidirectional encoder representations from transformers•4 minutes
- OpenAI and other similar large-scale embedding services•6 minutes
- Speed•4 minutes
- Data sources (other than vector)•5 minutes
- Common vector store options•6 minutes
- Choosing a vector store•5 minutes
1 assignment•Total 16 minutes
- Vectors and Vector Stores in Retrieval-Augmented Generation•16 minutes
This module explores the principles and techniques behind similarity searching using vector representations. Learners will examine semantic versus keyword search, distance metrics like Euclidean distance, and various search paradigms including dense, sparse, and hybrid approaches. Practical labs and real-world tools such as Pinecone and LangChain will help solidify understanding of indexing, search algorithms, and vector search services.
What's included
1 video10 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
10 readings•Total 67 minutes
- Introduction•5 minutes
- Semantic versus keyword search•6 minutes
- Euclidean distance (L2)•6 minutes
- Different search paradigms – sparse, dense, and hybrid•4 minutes
- Code lab 8.2 – Hybrid search with a custom function•18 minutes
- Code lab 8.3 – Hybrid search with LangChain's EnsembleRetriever to replace our custom function•5 minutes
- Semantic search algorithms•6 minutes
- Enhancing search with indexing techniques•7 minutes
- Vector search options•6 minutes
- Pinecone•4 minutes
1 assignment•Total 16 minutes
- Vector Search Fundamentals•16 minutes
This module guides learners through the quantitative evaluation of retrieval-augmented generation (RAG) systems using standardized frameworks and visualization tools. You will implement the ragas library to generate synthetic ground truth data, assess retrieval and generation metrics, and explore additional evaluation techniques to optimize RAG pipelines.
What's included
1 video10 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
10 readings•Total 54 minutes
- Introduction•5 minutes
- Evaluation helps you get better•6 minutes
- Final thoughts on standardized evaluation frameworks•5 minutes
- Code lab 9.1 – ragas•5 minutes
- Setting up LLMs/embedding models•5 minutes
- Generating the synthetic ground truth•7 minutes
- Analyzing the ragas results•5 minutes
- Retrieval evaluation•5 minutes
- End-to-end evaluation•6 minutes
- Additional evaluation techniques•5 minutes
1 assignment•Total 16 minutes
- Quantitative Evaluation and Visualization in RAG•16 minutes
This module explores the essential building blocks of retrieval-augmented generation (RAG) systems within LangChain, focusing on vector stores, retrievers, and large language models (LLMs). Learners will gain hands-on experience with popular retriever options and understand how these components interact to enable effective information retrieval and generation.
What's included
1 video3 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
3 readings•Total 41 minutes
- Introduction•14 minutes
- Code lab 10.2 – LangChain Retrievers•14 minutes
- Code lab 10.3 – LangChain LLMs•13 minutes
1 assignment•Total 16 minutes
- LangChain RAG Components and Vector Search•16 minutes
This module explores advanced techniques for enhancing retrieval-augmented generation (RAG) workflows using LangChain. Learners will dive into practical tools such as text splitters and output parsers, gaining hands-on experience with LangChain Expression Language (LCEL) to optimize document processing and result formatting.
What's included
1 video4 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
4 readings•Total 31 minutes
- Introduction•9 minutes
- Code lab 11.2 – Text splitters•8 minutes
- Recursive character text splitter•5 minutes
- Code lab 11.3 – Output parsers•9 minutes
1 assignment•Total 16 minutes
- LangChain and RAG Techniques•16 minutes
This module explores how to enhance retrieval-augmented generation (RAG) pipelines by integrating AI agents using LangGraph. Learners will discover how graph theory concepts, agent state management, and decision-making nodes can be leveraged to build more dynamic and intelligent workflows. Practical coding exercises guide you through implementing and customizing agentic RAG systems.
What's included
1 video6 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
6 readings•Total 38 minutes
- Introduction•6 minutes
- Graphs, AI agents, and LangGraph•3 minutes
- Code lab 12.1 – adding a LangGraph agent to RAG•8 minutes
- Agent state•4 minutes
- Nodes and edges in our agent•8 minutes
- Cyclical graph setup•9 minutes
1 assignment•Total 16 minutes
- AI Agents and LangGraph Fundamentals•16 minutes
This module explores effective prompt engineering techniques to enhance retrieval-augmented generation (RAG) systems. Learners will discover strategies for designing, adapting, and optimizing prompts for various large language models, and practice applying these concepts to tasks such as summarization, data extraction, transformation, and expansion.
What's included
1 video10 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
10 readings•Total 50 minutes
- Introduction•5 minutes
- Top-p•4 minutes
- Take your shot•7 minutes
- Fundamentals of prompt design•4 minutes
- Adapting prompts for different LLMs•7 minutes
- Code lab 13.2 – Prompting options•6 minutes
- Summarizing•5 minutes
- Extracting key data•3 minutes
- Transformation•4 minutes
- Expansion•5 minutes
1 assignment•Total 16 minutes
- Enhancing RAG Performance Through Prompt Design•16 minutes
This module delves into advanced strategies for enhancing retrieval-augmented generation (RAG) systems, including re-ranking, query decomposition, and multi-modal RAG techniques. Learners will gain hands-on experience with code labs and explore methods for integrating text and image data to improve GenAI applications.
What's included
1 video6 readings1 assignment
1 video•Total 1 minute
- Overview•1 minute
6 readings•Total 55 minutes
- Introduction•4 minutes
- Re-ranking in hybrid RAG•8 minutes
- Code lab 14.2 – Query decomposition•9 minutes
- Code lab 14.3 – MM-RAG•5 minutes
- Introducing MM-RAG in code•22 minutes
- Other advanced RAG techniques to explore•7 minutes
1 assignment•Total 16 minutes
- Enhancing Retrieval with Advanced Techniques•16 minutes
Instructor

Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Explore more from Data Science
PPackt
Course

Specialization
DDuke University
Guided Project

