Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
Completed by Ravi Kumar
October 2, 2024
1 hours (approximately)
Ravi Kumar's account is verified. Coursera certifies their successful completion of Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API
What you will learn
Extract and store metadata of documents containing both text and images, and generate embeddings the documents.
Search the metadata with text queries to find similar text or images.
Search the metadata with image queries to find similar images.Using a text query as input, search for contextual answers using both text and images.
Skills you will gain
- Category: Large Language Modeling
- Category: Cloud Computing
- Category: Image Analysis
- Category: Artificial Intelligence
- Category: Data Store
- Category: Prompt Engineering
- Category: Metadata Management
- Category: Gemini
- Category: Embeddings
- Category: Retrieval-Augmented Generation
- Category: Multimodal Prompts
- Category: Google Gemini

