• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • G

    Google Cloud

    Gemini for Data Scientists and Analysts-Português Brasileiro

    Skills you'll gain: Google Gemini, Marketing Strategies, Customer Analysis, Big Data, Forecasting, Customer Insights, Predictive Analytics, Analytics, Target Audience, Data Analysis, Marketing Analytics, Artificial Intelligence, Google Cloud Platform, Generative AI

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Drive - 日本語版

    Skills you'll gain: Google Gemini, Gemini, Google Workspace, Prompt Engineering, Gmail, Generative AI, File Management

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Data Scientists and Analysts - 한국어

    Skills you'll gain: Gemini, Artificial Intelligence and Machine Learning (AI/ML), Customer Insights, Applied Machine Learning, Customer Analysis, Generative AI, Data Integration, Predictive Analytics, Customer Data Management, Time Series Analysis and Forecasting, Marketing Strategies

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Work with Gemini Models in BigQuery - Deutsch

    Skills you'll gain: Google Gemini, Generative AI, Google Cloud Platform, SQL, Artificial Intelligence and Machine Learning (AI/ML), Big Data, Predictive Modeling, Customer Relationship Management

    Intermediate · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Google Threat Intelligence - Português Brasileiro

    Skills you'll gain: Cyber Threat Intelligence, Cyber Threat Hunting, Threat Detection, Threat Management, Incident Response, Cybersecurity, Google Gemini, Computer Security Incident Management, Vulnerability Management, AI Security, Continuous Monitoring, Infrastructure Security

    Intermediate · Course · 1 - 3 Months

  • G

    Google Cloud

    Pesquisa vetorial e embeddings

    Skills you'll gain: Retrieval-Augmented Generation, Embeddings, Vector Databases, Semantic Web, Artificial Intelligence, Prompt Engineering, Generative AI, Google Cloud Platform, Responsible AI

    Intermediate · Course · 1 - 4 Weeks

  • G

    Google Cloud

    運用 BigQuery 建立嵌入項目、向量搜尋和 RAG

    Skills you'll gain: Generative AI, Google Gemini, Retrieval-Augmented Generation, Embeddings, Prompt Engineering, Vector Databases, Large Language Modeling, LLM Application, Google Cloud Platform

    Advanced · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Security Engineers - Deutsch

    Skills you'll gain: Cloud Deployment, Google Gemini, Google Cloud Platform, AI Security, Generative AI, Cloud Security, System Configuration, Vulnerability Assessments

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Docs - Bahasa Indonesia

    Skills you'll gain: Gemini, Google Workspace, Generative AI, Prompt Engineering

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Network Engineers - Bahasa Indonesia

    Skills you'll gain: Gemini, Google Cloud Platform, Generative AI, Prompt Engineering, Virtual Networking, General Networking, Network Architecture

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Docs - 한국어

    Skills you'll gain: Gemini, Generative AI, Prompt Engineering, Google Workspace, Document Management, Technical Writing

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Create Image Captioning Models - 繁體中文

    Skills you'll gain: Image Analysis, Model Evaluation, Generative AI, Convolutional Neural Networks, Deep Learning, Embeddings, Vision Transformer (ViT)

    Advanced · Course · 1 - 4 Weeks

Searches related to multimodal ai

multimodal generative ai
build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
ai multimodal
multimodal rag with gpt – build smarter search & ai systems
1…281282283…292

In summary, here are 10 of our most popular multimodal ai courses

  • Gemini for Data Scientists and Analysts-Português Brasileiro: Google Cloud
  • Gemini in Google Drive - 日本語版: Google Cloud
  • Gemini for Data Scientists and Analysts - 한국어: Google Cloud
  • Work with Gemini Models in BigQuery - Deutsch: Google Cloud
  • Google Threat Intelligence - Português Brasileiro: Google Cloud
  • Pesquisa vetorial e embeddings: Google Cloud
  • 運用 BigQuery 建立嵌入項目、向量搜尋和 RAG: Google Cloud
  • Gemini for Security Engineers - Deutsch: Google Cloud
  • Gemini in Google Docs - Bahasa Indonesia: Google Cloud
  • Gemini for Network Engineers - Bahasa Indonesia: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok