Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
Completed by Aswin Antony
October 24, 2024
1 hours (approximately)
Aswin Antony's account is verified. Coursera certifies their successful completion of Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
What you will learn
Extract and store metadata of documents containing both text and images, and generate embeddings the documents.
Search the metadata with text queries to find similar text or images.
Search the metadata with image queries to find similar images.Using a text query as input, search for contextual answers using both text and images.
Skills you will gain
- Category: Embeddings
- Category: Multimodal Prompts
- Category: Google Gemini
- Category: Prompt Engineering
- Category: Metadata Management
- Category: Data Store
- Category: Retrieval-Augmented Generation
- Category: Gemini
- Category: Large Language Modeling
- Category: Artificial Intelligence
- Category: Cloud Computing
- Category: Image Analysis

