• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    ChatGPT & Zapier: Agentic AI for Everyone

    Skills you'll gain: Generative AI Agents, Agentic systems, Agentic Workflows, Generative AI, Email Automation, Prompt Engineering, Make.com, AI Workflows, ChatGPT, Expense Management, Expense Reports, Workflow Management, Artificial Intelligence, Google Sheets, Travel Arrangements, Natural Language Processing

    4.7
    Rating, 4.7 out of 5 stars
    ·
    125 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    U

    University of California, Davis

    AI Agents: From Prompts to Multi-Agent Systems

    Skills you'll gain: Prompt Engineering, AI Workflows, Agentic systems, Generative AI, AI Orchestration, Artificial Intelligence, Responsible AI, Innovation, Algorithms

    4.6
    Rating, 4.6 out of 5 stars
    ·
    112 reviews

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    S

    Starweaver

    ChatGPT (and other AI) for Product Management & Innovation

    Skills you'll gain: AI Product Strategy, Google Gemini, ChatGPT, Product Management, Generative AI, AI Enablement, Agile Product Development, Microsoft Copilot, Competitive Analysis, Market Research, User Story, Product Strategy, Prompt Engineering, LLM Application

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    P

    Packt

    Multimodal RAG with GPT – Build Smarter Search & AI Systems

    Skills you'll gain: Retrieval-Augmented Generation, Development Environment, Multimodal Prompts, Embeddings, User Interface (UI), Generative AI, LLM Application, AI Personalization, Image Analysis, Vector Databases

    Intermediate · Course · 1 - 3 Months

  • Status: Free
    Free
    D

    DeepLearning.AI

    Large Multimodal Model Prompting with Gemini

    Skills you'll gain: Multimodal Prompts, Gemini, Prompt Engineering, Google Gemini, Prompt Patterns, LLM Application, Generative AI, Application Programming Interface (API), Image Analysis

    4.7
    Rating, 4.7 out of 5 stars
    ·
    15 reviews

    Beginner · Project · Less Than 2 Hours

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pearson

    Programming Generative AI: Unit 3

    Skills you'll gain: Multimodal Prompts, Generative AI, Model Evaluation, Generative Model Architectures, Image Analysis, Embeddings, Transfer Learning, Computer Vision, Performance Tuning

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    J

    Johns Hopkins University

    Training AI with Humans

    Skills you'll gain: Machine Learning, Applied Machine Learning, Research, Model Evaluation, Data Ethics, Artificial Intelligence and Machine Learning (AI/ML), Research Design, Data Collection, Surveys, Human Factors, Classification Algorithms, Experimentation, Data Quality, Statistical Analysis

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    AI Agent Architecture in Java with Generative AI

    Skills you'll gain: Generative AI Agents, AI Orchestration, LLM Application, OpenAI API, Java Programming, AI Workflows, Agentic systems, Generative AI, Prompt Patterns, Prompt Engineering, Large Language Modeling, Document Management, Secure Coding, Business Logic, Open Web Application Security Project (OWASP), Middleware, Plan Execution, Software Design Patterns, Persona Development

    4.3
    Rating, 4.3 out of 5 stars
    ·
    8 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: Free
    Free
    C

    Coursera

    OpenAI for Beginners: AI Assistants for Project Managers

    Skills you'll gain: No-Code Development, Generative AI, AI Product Strategy, Application Deployment, LLM Application, AI Enablement, OpenAI, Project Management, Prompt Engineering

    3.8
    Rating, 3.8 out of 5 stars
    ·
    28 reviews

    Beginner · Guided Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    IA generativa: para além do chatbot

    Skills you'll gain: Responsible AI, Prompt Engineering, Generative AI, AI Enablement, Artificial Intelligence, Google Cloud Platform

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    IA generativa: más allá del chatbot

    Skills you'll gain: Prompt Engineering, Responsible AI, Generative AI, AI Enablement, AI Product Strategy, Google Cloud Platform, AI Workflows, LLM Application, Artificial Intelligence

    5
    Rating, 5 out of 5 stars
    ·
    10 reviews

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    M

    Macquarie University

    Create video, audio and infographics for online learning

    Skills you'll gain: Video Production, Infographics, Course Development, Developing Training Materials, Multimedia, Podcasting, Content Creation, Constructive Feedback, Design Reviews, Storyboarding, Education Software and Technology, Design, Design Elements And Principles, Digital pedagogy, Storytelling

    4.8
    Rating, 4.8 out of 5 stars
    ·
    104 reviews

    Beginner · Course · 1 - 3 Months

Searches related to multimodal ai

multimodal generative ai
build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
ai multimodal
multimodal rag with gpt – build smarter search & ai systems
1…567…292

In summary, here are 10 of our most popular multimodal ai courses

  • ChatGPT & Zapier: Agentic AI for Everyone: Vanderbilt University
  • AI Agents: From Prompts to Multi-Agent Systems: University of California, Davis
  • ChatGPT (and other AI) for Product Management & Innovation: Starweaver
  • Multimodal RAG with GPT – Build Smarter Search & AI Systems: Packt
  • Large Multimodal Model Prompting with Gemini: DeepLearning.AI
  • Programming Generative AI: Unit 3: Pearson
  • Training AI with Humans: Johns Hopkins University
  • AI Agent Architecture in Java with Generative AI: Vanderbilt University
  • OpenAI for Beginners: AI Assistants for Project Managers: Coursera
  • IA generativa: para além do chatbot: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok