• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • Status: Preview
    Preview
    G

    Google Cloud

    Gemini for Cloud Architects - Français

    Skills you'll gain: Gemini, Google Cloud Platform, Kubernetes, Infrastructure as Code (IaC), Cloud Infrastructure, Generative AI Agents, Cloud Computing Architecture, Application Deployment

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Slides - Italiano

    Skills you'll gain: Google Gemini, Gemini, Google Workspace, Generative AI, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Network Engineers - 简体中文

    Skills you'll gain: Gemini, Google Cloud Platform, Generative AI, Prompt Engineering, Virtual Networking, Virtual Private Networks (VPN), AI Enablement, Network Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Drive - 繁體中文

    Skills you'll gain: Google Gemini, Google Workspace, Cloud Storage, Prompt Engineering, File Management, Generative AI Agents

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pearson

    Practical Cybersecurity Fundamentals: Unit 5

    Skills you'll gain: AI Security, MITRE ATT&CK Framework, Responsible AI, Threat Modeling, Cybersecurity, Information Assurance, Threat Detection, Data Ethics, Artificial Intelligence, ChatGPT, Model Evaluation, Information Privacy

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Feature Engineering - 한국어

    Skills you'll gain: Feature Engineering, Data Preprocessing, Tensorflow, Data Pipelines, Data Store, Keras (Neural Network Library), Applied Machine Learning, Model Evaluation, Data Transformation, Data Modeling, Machine Learning

    Intermediate · Course · 1 - 3 Months

  • G

    Google Cloud

    Gemini for Data Scientists and Analysts - Deutsch

    Skills you'll gain: Google Gemini, Google Cloud Platform, Customer Analysis, Generative AI, Marketing Analytics, Analytics, Advanced Analytics, Customer Insights, Data-Driven Decision-Making, Data Analysis, Predictive Modeling, Forecasting

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Meet - Français

    Skills you'll gain: Gemini, Google Workspace, Generative AI, Language Interpretation, Translation, and Studies

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Slides - Español

    Skills you'll gain: Gemini, Generative AI, Productivity, Google Workspace, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Sheets - 日本語版

    Skills you'll gain: Gemini, Prompt Engineering, Google Sheets, Google Workspace, Generative AI, Spreadsheet Software, AI Enablement

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Slides - Français

    Skills you'll gain: Gemini, Generative AI, Google Workspace, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Meet - Bahasa Indonesia

    Skills you'll gain: Gemini, Google Workspace, Generative AI, Language Interpretation, Translation, and Studies, Technical Communication

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal and cross-modal ai integrations
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
build a diy multimodal question answering system with vertex ai
1…276277278…292

In summary, here are 10 of our most popular multimodal ai courses

  • Gemini for Cloud Architects - Français: Google Cloud
  • Gemini in Google Slides - Italiano: Google Cloud
  • Gemini for Network Engineers - 简体中文: Google Cloud
  • Gemini in Google Drive - 繁體中文: Google Cloud
  • Practical Cybersecurity Fundamentals: Unit 5: Pearson
  • Feature Engineering - 한국어: Google Cloud
  • Gemini for Data Scientists and Analysts - Deutsch: Google Cloud
  • Gemini in Google Meet - Français: Google Cloud
  • Gemini in Google Slides - Español: Google Cloud
  • Gemini in Google Sheets - 日本語版: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok