• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    IA générative : au-delà du chatbot

    Skills you'll gain: Prompt Engineering, Responsible AI, Generative AI, AI Product Strategy, Google Cloud Platform, Business Strategy, Artificial Intelligence

    Beginner · Course · 1 - 3 Months

  • Status: Preview
    Preview
    G

    Google Cloud

    Create Image Captioning Models - 简体中文

    Skills you'll gain: Image Analysis, Generative AI, Model Evaluation, Convolutional Neural Networks, Generative Model Architectures, Deep Learning

    Advanced · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    R

    Real Madrid Graduate School Universidad Europea

    Tendencias e innovaciones en los medios deportivos

    Skills you'll gain: AI Personalization, Augmented and Virtual Reality (AR/VR), Live Streaming, Media Production, Media and Communications, Digital Media Strategy, Media Strategy, Revenue Management, Content Creation, Web Content, Brand Management, E-Commerce, Social Media Strategy, Social Media, Community Organizing, Innovation, Data-Driven Decision-Making, Video Production, User Centered Design, Game Theory

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    P

    Packt

    Data Science Model Deployments and Cloud Computing on GCP

    Skills you'll gain: Docker (Software), CI/CD, Model Deployment, Cloud Deployment, Cloud Development, Application Performance Management, Google App Engine

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    G

    Google Cloud

    Create Image Captioning Models - 繁體中文

    Skills you'll gain: Image Analysis, Model Evaluation, Generative AI, Convolutional Neural Networks, Deep Learning, Embeddings, Vision Transformer (ViT)

    Advanced · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Como criar apps de IA generativa no Google Cloud

    Skills you'll gain: Retrieval-Augmented Generation, Generative AI, LLM Application, Large Language Modeling, Google Cloud Platform, Prompt Engineering, Application Design, Embeddings, Prototyping, Solution Architecture

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    R

    Real Madrid Graduate School Universidad Europea

    Trends and Innovations in Sports Media

    Skills you'll gain: Driving engagement, AI Personalization, Media Production, Media Strategy, Media and Communications, Community Organizing, Augmented and Virtual Reality (AR/VR), Digital Media Strategy, Content Creation, Social Media Strategy, Web Content, Social Media, Revenue Management, Innovation, Brand Management, Data-Driven Decision-Making, E-Commerce, Video Production, User Centered Design, Game Theory

    Beginner · Course · 1 - 4 Weeks

  • U

    University of Maryland, College Park

    Create Program Changes with Power Skills & Digital Enablers

    Skills you'll gain: Leadership Studies, Program Management, Stakeholder Management, Organizational Leadership, Digital Transformation, Organizational Change, AI Enablement, Change Management, Workforce Development, Collaboration

    Beginner · Course · 1 - 3 Months

  • G

    Google Cloud

    Gemini for Application Developers - Deutsch

    Skills you'll gain: Gemini, Generative AI, Google Cloud Platform, Application Development, Prompt Patterns, Code Review, Natural Language Processing

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Data Scientists and Analysts - 한국어

    Skills you'll gain: Gemini, Artificial Intelligence and Machine Learning (AI/ML), Customer Insights, Applied Machine Learning, Customer Analysis, Generative AI, Data Integration, Predictive Analytics, Customer Data Management, Time Series Analysis and Forecasting, Marketing Strategies

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Application Developers - 繁體中文

    Skills you'll gain: Google Gemini, Google Cloud Platform, Generative AI, Prompt Engineering Tools, AI Workflows, Cloud Services, Application Development, Code Review

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Attention Mechanism - Português Brasileiro

    Skills you'll gain: Transfer Learning, Recurrent Neural Networks (RNNs), Machine Learning Methods, Artificial Intelligence and Machine Learning (AI/ML), Embeddings, Text Mining

    Intermediate · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal and cross-modal ai integrations
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
analyze multimodal ai for business insights
1…267268269…292

In summary, here are 10 of our most popular multimodal ai courses

  • IA générative : au-delà du chatbot: Google Cloud
  • Create Image Captioning Models - 简体中文: Google Cloud
  • Tendencias e innovaciones en los medios deportivos: Real Madrid Graduate School Universidad Europea
  • Data Science Model Deployments and Cloud Computing on GCP: Packt
  • Create Image Captioning Models - 繁體中文: Google Cloud
  • Como criar apps de IA generativa no Google Cloud: Google Cloud
  • Trends and Innovations in Sports Media: Real Madrid Graduate School Universidad Europea
  • Create Program Changes with Power Skills & Digital Enablers : University of Maryland, College Park
  • Gemini for Application Developers - Deutsch: Google Cloud
  • Gemini for Data Scientists and Analysts - 한국어: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok