• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
  • Online Degrees
  • Careers
  • Log In
  • Join for Free
    Coursera
    • Browse
    • Multimodal
    Skip to search results

    Filter by

    Subject
    Required
     *

    Language
    Required
     *

    The language used throughout the course, in both instruction and assessments.

    Learning Product
    Required
     *

    Build job-relevant skills in under 2 hours with hands-on tutorials.
    Learn from top instructors with graded assignments, videos, and discussion forums.
    Learn a new tool or skill in an interactive, hands-on environment.
    Get in-depth knowledge of a subject by completing a series of courses and projects.
    Earn career credentials from industry leaders that demonstrate your expertise.

    Level
    Required
     *

    Duration
    Required
     *

    Skills
    Required
     *

    Subtitles
    Required
     *

    Educator
    Required
     *

    Results for "multimodal"

    • Status: New
      New
      Status: Free Trial
      Free Trial
      U

      University of Colorado Boulder

      Modern AI Models for Vision and Multimodal Understanding

      Skills you'll gain: Linear Algebra

      Build toward a degree

      Advanced · Course · 1 - 4 Weeks

    • Status: Free
      Free
      D

      DeepLearning.AI

      Building Multimodal Search and RAG

      Skills you'll gain: Large Language Modeling, Generative AI, Prompt Engineering, Artificial Intelligence and Machine Learning (AI/ML), Image Analysis, Applied Machine Learning, Unsupervised Learning, Unstructured Data

      4.5
      Rating, 4.5 out of 5 stars
      ·
      33 reviews

      Intermediate · Project · Less Than 2 Hours

    • Status: New
      New
      Status: Free Trial
      Free Trial
      U

      University of Colorado Boulder

      Computer Vision

      Skills you'll gain: Image Analysis, Computer Vision, Computer Graphics, Unsupervised Learning, Deep Learning, Visualization (Computer Graphics), Artificial Intelligence and Machine Learning (AI/ML), Applied Machine Learning, Data Ethics, Artificial Intelligence, Computational Thinking, Data Processing, Linear Algebra, Computational Logic, Probability Distribution

      Build toward a degree

      4.3
      Rating, 4.3 out of 5 stars
      ·
      12 reviews

      Intermediate · Specialization · 1 - 3 Months

    • Status: New
      New
      Status: Free
      Free
      D

      DeepLearning.AI

      Large Multimodal Model Prompting with Gemini

      Skills you'll gain: Prompt Engineering, Generative AI, Application Programming Interface (API), Image Analysis, Text Mining, Application Development, Real Time Data

      Beginner · Project · Less Than 2 Hours

    • Status: New
      New
      Status: Free Trial
      Free Trial
      I

      IBM

      Build Multimodal Generative AI Applications

      Skills you'll gain: OpenAI, Application Development, Prompt Engineering, Web Applications, Flask (Web Framework), Web Development, Software Development

      4.8
      Rating, 4.8 out of 5 stars
      ·
      15 reviews

      Intermediate · Course · 1 - 4 Weeks

    • Status: Free
      Free
      D

      DeepLearning.AI

      Introducing Multimodal Llama 3.2

      Skills you'll gain: Prompt Engineering, Large Language Modeling, Generative AI, API Design, Application Programming Interface (API), Open Source Technology, Computer Vision

      4.5
      Rating, 4.5 out of 5 stars
      ·
      11 reviews

      Beginner · Project · Less Than 2 Hours

    What brings you to Coursera today?

    • Status: Free Trial
      Free Trial
      C

      Codio

      Multimodal Generative AI: Vision, Speech, and Assistants

      Skills you'll gain: OpenAI, Image Analysis, Generative AI, Application Programming Interface (API), Large Language Modeling, Prompt Engineering, Artificial Intelligence, Natural Language Processing, Computer Vision

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API

      Skills you'll gain: Query Languages, Data Manipulation, Metadata Management, Text Mining, Cloud API, Generative AI, Google Cloud Platform, Image Analysis, Cloud Computing, Artificial Intelligence

      4
      Rating, 4 out of 5 stars
      ·
      7 reviews

      Intermediate · Project · Less Than 2 Hours

    • G

      Google Cloud

      Multimodal Use Cases with Gemini 1.5

      Skills you'll gain: Image Analysis, Large Language Modeling, Text Mining, Google Cloud Platform, Computer Vision, Prompt Engineering, Generative AI, Data Processing, AI Personalization, Document Management

      Beginner · Project · Less Than 2 Hours

    • Status: Free
      Free
      D

      DeepLearning.AI

      Open Source Models with Hugging Face

      Skills you'll gain: Generative AI, Cloud Applications, Natural Language Processing, Application Deployment, Open Source Technology, Applied Machine Learning, User Interface (UI), API Design, Computer Vision

      4.7
      Rating, 4.7 out of 5 stars
      ·
      62 reviews

      Beginner · Project · Less Than 2 Hours

    • G

      Google Cloud

      Introduction to Vertex AI Studio

      Skills you'll gain: Prompt Engineering, Generative AI, Product Lifecycle Management, Project Design, Application Lifecycle Management, Performance Tuning, Application Development, Application Deployment

      4.7
      Rating, 4.7 out of 5 stars
      ·
      230 reviews

      Beginner · Course · 1 - 4 Weeks

    • Status: Free
      Free
      D

      DeepLearning.AI

      Multi AI Agent Systems with crewAI

      Skills you'll gain: Generative AI Agents, Artificial Intelligence, Agentic systems, Business Process Automation, Automation, AI Personalization, Debugging, Prompt Engineering, Coordinating, Prioritization

      4.8
      Rating, 4.8 out of 5 stars
      ·
      229 reviews

      Beginner · Project · Less Than 2 Hours

    Searches related to multimodal

    multimodal generative ai: vision, speech, and assistants
    multimodal retrieval augmented generation (rag) using the vertex ai gemini api
    multimodal rag with gpt – build smarter search & ai systems
    multimodality with gemini
    multimodal use cases with gemini 1.5
    multimodal literacies: communication and learning in the era of digital media
    build multimodal generative ai applications
    large multimodal model prompting with gemini
    1234…12

    In summary, here are 10 of our most popular multimodal courses

    • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
    • Building Multimodal Search and RAG: DeepLearning.AI
    • Computer Vision: University of Colorado Boulder
    • Large Multimodal Model Prompting with Gemini: DeepLearning.AI
    • Build Multimodal Generative AI Applications: IBM
    • Introducing Multimodal Llama 3.2: DeepLearning.AI
    • Multimodal Generative AI: Vision, Speech, and Assistants: Codio
    • Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API: Google Cloud
    • Multimodal Use Cases with Gemini 1.5: Google Cloud
    • Open Source Models with Hugging Face: DeepLearning.AI

    Other topics to explore

    Arts and Humanities
    338 courses
    Business
    1095 courses
    Computer Science
    668 courses
    Data Science
    425 courses
    Information Technology
    145 courses
    Health
    471 courses
    Math and Logic
    70 courses
    Personal Development
    137 courses
    Physical Science and Engineering
    413 courses
    Social Sciences
    401 courses
    Language Learning
    150 courses

    Coursera Footer

    Technical Skills

    • ChatGPT
    • Coding
    • Computer Science
    • Cybersecurity
    • DevOps
    • Ethical Hacking
    • Generative AI
    • Java Programming
    • Python
    • Web Development

    Analytical Skills

    • Artificial Intelligence
    • Big Data
    • Business Analysis
    • Data Analytics
    • Data Science
    • Financial Modeling
    • Machine Learning
    • Microsoft Excel
    • Microsoft Power BI
    • SQL

    Business Skills

    • Accounting
    • Digital Marketing
    • E-commerce
    • Finance
    • Google
    • Graphic Design
    • IBM
    • Marketing
    • Project Management
    • Social Media Marketing

    Career Resources

    • Essential IT Certifications
    • High-Income Skills to Learn
    • How to Get a PMP Certification
    • How to Learn Artificial Intelligence
    • Popular Cybersecurity Certifications
    • Popular Data Analytics Certifications
    • What Does a Data Analyst Do?
    • Career Development Resources
    • Career Aptitude Test
    • Share your Coursera Learning Story

    Coursera

    • About
    • What We Offer
    • Leadership
    • Careers
    • Catalog
    • Coursera Plus
    • Professional Certificates
    • MasterTrack® Certificates
    • Degrees
    • For Enterprise
    • For Government
    • For Campus
    • Become a Partner
    • Social Impact
    • Free Courses
    • ECTS Credit Recommendations

    Community

    • Learners
    • Partners
    • Beta Testers
    • Blog
    • The Coursera Podcast
    • Tech Blog

    More

    • Press
    • Investors
    • Terms
    • Privacy
    • Help
    • Accessibility
    • Contact
    • Articles
    • Directory
    • Affiliates
    • Modern Slavery Statement
    • Do Not Sell/Share
    Learn Anywhere
    Download on the App Store
    Get it on Google Play
    Logo of Certified B Corporation
    © 2025 Coursera Inc. All rights reserved.
    • Coursera Facebook
    • Coursera Linkedin
    • Coursera Twitter
    • Coursera YouTube
    • Coursera Instagram
    • Coursera TikTok