• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
Log In
Join for Free
  • Browse
  • Multimodal

Results for "multimodal"


  • Status: Free Trial
    Free Trial
    I

    IBM

    Build Multimodal Generative AI Applications

    Skills you'll gain: Multimodal Prompts, LLM Application, Generative Model Architectures, OpenAI API, Application Development, Prompt Engineering, Web Applications, Flask (Web Framework), Web Development, Software Development

    4.8
    Rating, 4.8 out of 5 stars
    ·
    45 reviews

    Intermediate · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    U

    University of Colorado Boulder

    Modern AI Models for Vision and Multimodal Understanding

    Skills you'll gain: Vision Transformer (ViT), Recurrent Neural Networks (RNNs), Multimodal Prompts, Artificial Intelligence and Machine Learning (AI/ML), Embeddings, Digital Signal Processing, Transfer Learning

    Build toward a degree

    4.3
    Rating, 4.3 out of 5 stars
    ·
    6 reviews

    Advanced · Course · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Building Multimodal Search and RAG

    Skills you'll gain: Retrieval-Augmented Generation, Multimodal Prompts, Embeddings, Large Language Modeling, Generative AI, Vector Databases, Image Analysis, Applied Machine Learning

    4.5
    Rating, 4.5 out of 5 stars
    ·
    40 reviews

    Intermediate · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    I

    IBM

    IBM RAG and Agentic AI

    Skills you'll gain: Prompt Engineering, AI Orchestration, AI Workflows, Model Context Protocol, LangChain, Retrieval-Augmented Generation, Agentic Workflows, Tool Calling, LangGraph, LLM Application, Agentic systems, Multimodal Prompts, Generative AI, Generative AI Agents, Vector Databases, Generative Model Architectures, OpenAI API, Embeddings, Responsible AI, Software Development

    4.6
    Rating, 4.6 out of 5 stars
    ·
    672 reviews

    Advanced · Professional Certificate · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    C

    Codio

    Multimodal Generative AI: Vision, Speech, and Assistants

    Skills you'll gain: OpenAI API, OpenAI, Image Analysis, Generative AI, ChatGPT, LLM Application, Multimodal Prompts, Application Programming Interface (API), Large Language Modeling, Artificial Intelligence, Natural Language Processing, Computer Vision

    Beginner · Course · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Introducing Multimodal Llama 3.2

    Skills you'll gain: Tool Calling, LLM Application, Multimodal Prompts, Prompt Patterns, Prompt Engineering, Large Language Modeling

    4.6
    Rating, 4.6 out of 5 stars
    ·
    12 reviews

    Beginner · Project · Less Than 2 Hours

What brings you to Coursera today?

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pearson

    Programming Generative AI

    Skills you'll gain: Generative AI, Large Language Modeling, PyTorch (Machine Learning Library), Generative Model Architectures, Multimodal Prompts, Image Analysis, Model Evaluation, Autoencoders, Hugging Face, Computer Vision, Convolutional Neural Networks, Artificial Neural Networks, LLM Application, Natural Language Processing, Deep Learning, Embeddings, Tensorflow, Transfer Learning, Performance Tuning

    Intermediate · Specialization · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Retrieval Augmented Generation

    Skills you'll gain: Retrieval-Augmented Generation, Large Language Modeling, LLM Application, Development Environment, Multimodal Prompts, Embeddings, Vector Databases, User Interface (UI), Generative AI, AI Workflows, AI Personalization, Prompt Engineering, Agentic systems, Data Visualization, Image Analysis, Application Development, Augmented Reality, Text Mining, Graph Theory, Query Languages

    4.5
    Rating, 4.5 out of 5 stars
    ·
    14 reviews

    Intermediate · Specialization · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    P

    Packt

    Multimodal RAG with GPT – Build Smarter Search & AI Systems

    Skills you'll gain: Retrieval-Augmented Generation, Development Environment, Multimodal Prompts, Embeddings, User Interface (UI), Generative AI, LLM Application, AI Personalization, Image Analysis, Vector Databases

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    U

    University of Illinois Urbana-Champaign

    Multimodal Literacies: Communication and Learning in the Era of Digital Media

    Skills you'll gain: Differentiated Instruction, Teaching, Instructional Strategies, Digital pedagogy, Literacy, Oral Expression, digital literacy, Learning Styles, Media and Communications, Multimedia, Higher Education, Cultural Diversity, Writing, Non-Verbal Communication

    4.7
    Rating, 4.7 out of 5 stars
    ·
    176 reviews

    Mixed · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    S

    Scrimba

    Generative AI for Web Development

    Skills you'll gain: Prompt Engineering, Anthropic Claude, Vibe coding, Model Context Protocol, OpenAI API, LLM Application, Context Management, Debugging, ChatGPT, Generative AI, Large Language Modeling, Multimodal Prompts, Pseudocode, AI Workflows, Artificial Intelligence, Responsible AI, AI Enablement, Software Installation, Web Development Tools, Software Development

    4.4
    Rating, 4.4 out of 5 stars
    ·
    169 reviews

    Intermediate · Specialization · 1 - 3 Months

  • G

    Google Cloud

    Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API

    Skills you'll gain: Gemini, Google Gemini, Multimodal Prompts, Retrieval-Augmented Generation, Query Languages, Data Manipulation, Data Store, Embeddings, Metadata Management, Document Management, Text Mining, Data Capture, Cloud API, Image Analysis, Cloud Computing, Artificial Intelligence

    4.1
    Rating, 4.1 out of 5 stars
    ·
    8 reviews

    Intermediate · Project · Less Than 2 Hours

Searches related to multimodal

multimodal generative ai: vision, speech, and assistants
multimodal and cross-modal ai integrations
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
multimodal rag with gpt – build smarter search & ai systems
multimodal use cases with gemini 1.5
multimodality with gemini
multimodal literacies: communication and learning in the era of digital media
modern ai models for vision and multimodal understanding
1234…18

In summary, here are 10 of our most popular multimodal courses

  • Build Multimodal Generative AI Applications: IBM
  • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
  • Building Multimodal Search and RAG: DeepLearning.AI
  • IBM RAG and Agentic AI: IBM
  • Multimodal Generative AI: Vision, Speech, and Assistants: Codio
  • Introducing Multimodal Llama 3.2: DeepLearning.AI
  • Programming Generative AI: Pearson
  • Retrieval Augmented Generation: Packt
  • Multimodal RAG with GPT – Build Smarter Search & AI Systems: Packt
  • Multimodal Literacies: Communication and Learning in the Era of Digital Media : University of Illinois Urbana-Champaign

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok