• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • G

    Google Cloud

    Gemini in Google Slides - Español

    Skills you'll gain: Gemini, Generative AI, Productivity, Google Workspace, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Slides - Italiano

    Skills you'll gain: Google Gemini, Gemini, Google Workspace, Generative AI, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Meet - Français

    Skills you'll gain: Gemini, Google Workspace, Generative AI, Language Interpretation, Translation, and Studies

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Drive - Bahasa Indonesia

    Skills you'll gain: Google Gemini, Google Workspace, Prompt Engineering, Generative AI, AI Personalization, File Management

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    G

    Google Cloud

    Gemini in Google Docs - Deutsch

    Skills you'll gain: Gemini, Google Workspace, Generative AI, Prompt Engineering Tools, Prompt Engineering, Productivity

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Docs - Italiano

    Skills you'll gain: Google Gemini, Gemini, Generative AI, Productivity, Google Workspace, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    G

    Google Cloud

    Google Vids로 멋진 동영상 만들기

    Skills you'll gain: Video Editing, Generative AI, Photo/Video Production and Technology, Video Production, Multimedia, Google Workspace, Animations, Storytelling

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    P

    Pearson

    Learn JavaScript: Write Modern Code with JavaScript ESNext

    Skills you'll gain: Javascript, Scripting, Node.JS, TypeScript, Data Manipulation, JSON, Web Development Tools, Generative AI, Server Side, Data Structures, Programming Principles, Object Oriented Programming (OOP), Web Servers, Development Environment

    Intermediate · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Slides - Bahasa Indonesia

    Skills you'll gain: Gemini, Google Workspace, AI Workflows, Generative AI, AI Enablement, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Docs 繁體中文

    Skills you'll gain: Google Gemini, Google Workspace, Generative AI, Prompt Engineering, Document Management

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Data Scientists and Analysts - 한국어

    Skills you'll gain: Gemini, Artificial Intelligence and Machine Learning (AI/ML), Customer Insights, Applied Machine Learning, Customer Analysis, Generative AI, Data Integration, Predictive Analytics, Customer Data Management, Time Series Analysis and Forecasting, Marketing Strategies

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    G

    Google Cloud

    Accelerate App Development with Gemini CLI

    Skills you'll gain: Google Gemini, Gemini, Command-Line Interface, Code Review, Web Development Tools, Computer Programming Tools, Secure Coding, Model Context Protocol, Software Installation, Application Security, Configuration Management

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal and cross-modal ai integrations
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
build a diy multimodal question answering system with vertex ai
1…271272273…292

In summary, here are 10 of our most popular multimodal ai courses

  • Gemini in Google Slides - Español: Google Cloud
  • Gemini in Google Slides - Italiano: Google Cloud
  • Gemini in Google Meet - Français: Google Cloud
  • Gemini in Google Drive - Bahasa Indonesia: Google Cloud
  • Gemini in Google Docs - Deutsch: Google Cloud
  • Gemini in Google Docs - Italiano: Google Cloud
  • Google Vids로 멋진 동영상 만들기: Google Cloud
  • Learn JavaScript: Write Modern Code with JavaScript ESNext: Pearson
  • Gemini in Google Slides - Bahasa Indonesia: Google Cloud
  • Gemini in Google Docs 繁體中文: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok