Unlocking Data with Generative AI and RAG

Unlocking Data with Generative AI and RAG

Instructor: Packt - Course Instructors

Access provided by Indonesia Cyber Education Institute

14 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

14 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

1 week to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

Understand the principles and significance of Retrieval-Augmented Generation (RAG) in AI
Integrate large language models with internal data for improved AI performance
Master vectorization, vector databases, and techniques for efficient data retrieval

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

14 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 14 modules in this course

Master Retrieval-Augmented Generation (RAG), the most popular generative AI tool, to unlock the full potential of your data. This course enables you to develop highly sought-after skills as corporate investment in generative AI soars.

This resource equips learners with the knowledge and skills to harness RAG (Retrieval-Augmented Generation) for more intelligent AI applications. It bridges theoretical concepts with practical implementation, focusing on real-world use cases and advanced techniques. Designed for professionals seeking to enhance AI systems, it provides actionable insights and hands-on experience. This resource is ideal for AI researchers, data scientists, software developers, and business analysts with a foundational understanding of AI. It provides practical, hands-on learning through real-world coding examples, making it accessible to both technical and non-technical audiences. A basic knowledge of Python and Jupyter Notebooks is required. For non-technical readers trying to understand how RAG can be utilized, much of the text explains the importance of RAG and how it can be best utilized. For technical readers, we provide a full RAG pipeline coding use case. For each new topic, we show how that topic impacts the code, giving you an in-depth understanding of how coding choices can impact the capabilities of RAG-based applications.

This module introduces the concept of retrieval-augmented generation (RAG) in generative AI, exploring its architecture, key terminology, and practical implementation. Learners will examine the challenges associated with RAG, compare it to model fine-tuning approaches, and understand how RAG enhances AI applications in real-world contexts.

What's included

1 video7 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

7 readingsTotal 34 minutes

Introduction6 minutes
Challenges of RAG5 minutes
RAG vocabulary5 minutes
Fine-tuning – full-model fine-tuning (FMFT) and parameter-efficient fine-tuning (PEFT)5 minutes
Implementing RAG in AI applications4 minutes
Comparing RAG with model fine-tuning5 minutes
The architecture of RAG systems4 minutes

1 assignmentTotal 16 minutes

Exploring Retrieval-Augmented Generation16 minutes

In this module, you will build a complete retrieval-augmented generation (RAG) pipeline from scratch, learning how to preprocess data, perform vector indexing, and integrate retrieval and generation using LangChain and Chroma DB. You'll gain hands-on experience with essential libraries, understand the flow of data through the pipeline, and execute queries to see RAG in action.

What's included

1 video8 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

8 readingsTotal 43 minutes

Introduction4 minutes
No interface!6 minutes
Imports4 minutes
Indexing6 minutes
Embedding and indexing the chunks4 minutes
Retrieval and generation5 minutes
Setting up a LangChain chain using LCEL5 minutes
Submitting a question for RAG9 minutes

1 assignmentTotal 16 minutes

RAG Pipeline Fundamentals16 minutes

This module explores real-world implementations of retrieval-augmented generation (RAG) in areas such as automated reporting, e-commerce, knowledge management, and innovation scouting. Learners will discover how RAG enhances data analysis, personalizes content, and improves the utility of knowledge bases. Practical exercises will guide you in integrating sources into RAG pipelines for robust, transparent AI solutions.

What's included

1 video6 readings1 assignment

This module explores the essential building blocks of retrieval-augmented generation (RAG) systems, including indexing, retrieval, prompt engineering, LLM integration, and user interface design. Learners will gain practical insights into how these components interact to create effective RAG applications. By the end, you'll understand both the technical and user-facing aspects necessary for building robust RAG solutions.

What's included

1 video5 readings1 assignment

This module explores the unique security risks associated with retrieval-augmented generation (RAG) applications, including challenges posed by large language models and external data sources. Learners will investigate common vulnerabilities, such as hallucinations and sensitive information disclosure, and gain hands-on experience with red teaming and defensive strategies. Practical coding labs provide opportunities to secure API keys and implement protective measures against attacks.

What's included

1 video7 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

7 readingsTotal 41 minutes

Introduction4 minutes
RAG security challenges5 minutes
Hallucinations5 minutes
Common areas to target with red teaming6 minutes
Code lab 5.1 – Securing your keys6 minutes
Code lab 5.2 – Red team attack!7 minutes
Code lab 5.3 – Blue team defend!8 minutes

1 assignmentTotal 16 minutes

Securing RAG Applications16 minutes

This module introduces the fundamentals of building applications with retrieval-augmented generation (RAG) and demonstrates how to leverage Gradio for creating user-friendly interfaces. Learners will explore the advantages of Gradio, its integration with popular machine learning frameworks, and practical steps for interfacing with RAG models.

What's included

1 video2 readings1 assignment

This module explores how vectors and vector stores underpin retrieval-augmented generation (RAG) systems, delving into vector representations, embedding models, and the practical considerations for choosing and using vector stores. Learners will gain insights into the impact of vector dimensions, semantic algorithms, and performance factors in real-world RAG applications.

What's included

1 video12 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

12 readingsTotal 67 minutes

Introduction5 minutes
Vector dimensions and size7 minutes
Where vectors lurk in your code6 minutes
The amount of text you vectorize matters!5 minutes
Not all semantics are created equal!9 minutes
Word2Vec, Sentence2Vec, and Doc2Vec5 minutes
Bidirectional encoder representations from transformers4 minutes
OpenAI and other similar large-scale embedding services6 minutes
Speed4 minutes
Data sources (other than vector)5 minutes
Common vector store options6 minutes
Choosing a vector store5 minutes

1 assignmentTotal 16 minutes

Vectors and Vector Stores in Retrieval-Augmented Generation16 minutes

This module explores the principles and techniques behind similarity searching using vector representations. Learners will examine semantic versus keyword search, distance metrics like Euclidean distance, and various search paradigms including dense, sparse, and hybrid approaches. Practical labs and real-world tools such as Pinecone and LangChain will help solidify understanding of indexing, search algorithms, and vector search services.

What's included

1 video10 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

10 readingsTotal 67 minutes

Introduction5 minutes
Semantic versus keyword search6 minutes
Euclidean distance (L2)6 minutes
Different search paradigms – sparse, dense, and hybrid4 minutes
Code lab 8.2 – Hybrid search with a custom function18 minutes
Code lab 8.3 – Hybrid search with LangChain's EnsembleRetriever to replace our custom function5 minutes
Semantic search algorithms6 minutes
Enhancing search with indexing techniques7 minutes
Vector search options6 minutes
Pinecone4 minutes

1 assignmentTotal 16 minutes

Vector Search Fundamentals16 minutes

This module guides learners through the quantitative evaluation of retrieval-augmented generation (RAG) systems using standardized frameworks and visualization tools. You will implement the ragas library to generate synthetic ground truth data, assess retrieval and generation metrics, and explore additional evaluation techniques to optimize RAG pipelines.

What's included

1 video10 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

10 readingsTotal 54 minutes

Introduction5 minutes
Evaluation helps you get better6 minutes
Final thoughts on standardized evaluation frameworks5 minutes
Code lab 9.1 – ragas5 minutes
Setting up LLMs/embedding models5 minutes
Generating the synthetic ground truth7 minutes
Analyzing the ragas results5 minutes
Retrieval evaluation5 minutes
End-to-end evaluation6 minutes
Additional evaluation techniques5 minutes

1 assignmentTotal 16 minutes

Quantitative Evaluation and Visualization in RAG16 minutes

This module explores the essential building blocks of retrieval-augmented generation (RAG) systems within LangChain, focusing on vector stores, retrievers, and large language models (LLMs). Learners will gain hands-on experience with popular retriever options and understand how these components interact to enable effective information retrieval and generation.

What's included

1 video3 readings1 assignment

This module explores advanced techniques for enhancing retrieval-augmented generation (RAG) workflows using LangChain. Learners will dive into practical tools such as text splitters and output parsers, gaining hands-on experience with LangChain Expression Language (LCEL) to optimize document processing and result formatting.

What's included

1 video4 readings1 assignment

This module explores how to enhance retrieval-augmented generation (RAG) pipelines by integrating AI agents using LangGraph. Learners will discover how graph theory concepts, agent state management, and decision-making nodes can be leveraged to build more dynamic and intelligent workflows. Practical coding exercises guide you through implementing and customizing agentic RAG systems.

What's included

1 video6 readings1 assignment

This module explores effective prompt engineering techniques to enhance retrieval-augmented generation (RAG) systems. Learners will discover strategies for designing, adapting, and optimizing prompts for various large language models, and practice applying these concepts to tasks such as summarization, data extraction, transformation, and expansion.

What's included

1 video10 readings1 assignment

1 videoTotal 1 minute

Overview1 minute

10 readingsTotal 50 minutes

Introduction5 minutes
Top-p4 minutes
Take your shot7 minutes
Fundamentals of prompt design4 minutes
Adapting prompts for different LLMs7 minutes
Code lab 13.2 – Prompting options6 minutes
Summarizing5 minutes
Extracting key data3 minutes
Transformation4 minutes
Expansion5 minutes

1 assignmentTotal 16 minutes

Enhancing RAG Performance Through Prompt Design16 minutes

This module delves into advanced strategies for enhancing retrieval-augmented generation (RAG) systems, including re-ranking, query decomposition, and multi-modal RAG techniques. Learners will gain hands-on experience with code labs and explore methods for integrating text and image data to improve GenAI applications.