• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
Log In
Join for Free
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how models process and combine different inputs such as text, images, audio, or video. You can build skills in feature representation, alignment techniques, evaluation methods, and designing workflows that use multiple data types. Many courses introduce tools like Python libraries, model APIs, and frameworks that support building and testing multimodal AI systems.


Popular Multimodal AI Courses and Certifications


  • G

    Google Cloud

    Build an application to send Chat Prompts using the Gemini model

    Skills you'll gain: Gemini, Generative AI, LLM Application, Prompt Engineering, Google Cloud Platform, AI Enablement, AI Personalization

    Beginner · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    C

    Coursera

    GenAI for Compensation: Smarter Pay Equity Analysis

    Skills you'll gain: Responsible AI, Google Gemini, Anthropic Claude, Human Resources, Artificial Intelligence, Human Resource Strategy, Forecasting, Mitigation

    Intermediate · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Large Language Models - 한국어

    Skills you'll gain: Prompt Engineering, Large Language Modeling, Generative AI, Google Gemini, LLM Application

    Beginner · Course · 1 - 4 Weeks

  • P

    Packt

    Harnessing Open Source LLMs and ChatGPT with Minimal Code

    Skills you'll gain: ChatGPT, OpenAI API, Model Deployment, LLM Application, Tool Calling, Large Language Modeling, Prompt Engineering, Application Programming Interface (API), No-Code Development, Python Programming, Software Installation, Open Source Technology, Development Environment, Data Science

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    C

    Coursera

    OpenCL Programming

    Skills you'll gain: Distributed Computing, Scalability, Performance Tuning, C++ (Programming Language), System Programming, Computer Architecture, Cross Platform Development, Hardware Architecture, Application Development, Algorithms, C (Programming Language), Development Environment

    Beginner · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    B

    Board Infinity

    Mastering Cloud FinOps

    Skills you'll gain: Cloud Management, Dashboard, Resource Utilization, Financial Management, Cost Management, Anomaly Detection, Serverless Computing, Cloud Computing Architecture, Forecasting, Containerization, Change Management, Presentations, Generative AI, Case Studies

    4
    Rating, 4 out of 5 stars
    ·
    6 reviews

    Intermediate · Course · 1 - 4 Weeks

  • P

    Packt

    Vector Databases Deep Dive

    Skills you'll gain: NoSQL, Database Systems, Databases, Data Infrastructure, Data Storage Technologies, Scalability, Artificial Intelligence and Machine Learning (AI/ML), Algorithms

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    U

    University of Glasgow

    Intellectual Autonomy

    Skills you'll gain: Critical Thinking, Analytical Skills, Research, Persuasive Communication, Ethical Standards And Conduct, Artificial Intelligence, Human Learning, Psychology, Learning Strategies

    Intermediate · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    U

    University of Glasgow

    6G Evolution: Blockchain, Semantic Communications & Radar

    Skills you'll gain: Federated Learning, Emerging Technologies, Generative AI, Internet Of Things, Software-Defined Networking, Digital Communications, Network Architecture, Zero Trust Network Access, Artificial Intelligence and Machine Learning (AI/ML), Distributed Computing, Artificial Intelligence, Information Technology, Health Technology, Electronics Engineering, Electrical Engineering, Machine Learning, Trustworthiness

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    U

    Universidad de los Andes

    Visión artificial contemporánea

    Skills you'll gain: Computer Vision, Vision Transformer (ViT), Convolutional Neural Networks, Image Analysis, Generative AI, Deep Learning, Augmented and Virtual Reality (AR/VR), Artificial Intelligence and Machine Learning (AI/ML), Computer Graphics, Autoencoders, Unsupervised Learning, 3D Assets

    Build toward a degree

    4.8
    Rating, 4.8 out of 5 stars
    ·
    6 reviews

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free
    Free
    A

    Amazon Web Services

    Machine Learning Essentials for Business and Technical Decision Makers

    Skills you'll gain: Applied Machine Learning, Machine Learning, AI Enablement, MLOps (Machine Learning Operations), Technology Roadmaps, Data-Driven Decision-Making, Artificial Intelligence and Machine Learning (AI/ML), Business Analytics, Business Solutions, Organizational Strategy, AI Product Strategy, Organizational Change, Product Roadmaps, Feasibility Studies, System Requirements, Solution Design, Training Programs

    4.8
    Rating, 4.8 out of 5 stars
    ·
    11 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    C

    Coursera

    GenAI for Product Marketers: Data-Driven Campaigns

    Skills you'll gain: AI Personalization, Marketing Analytics, A/B Testing, Marketing Strategies, Marketing Effectiveness, Product Marketing, Campaign Management, Marketing, Advertising Campaigns, Performance Measurement, Generative AI, Target Audience, Google Ads, Predictive Analytics, AI Enablement, ChatGPT, Analytics, Automation

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal and cross-modal ai integrations
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
analyze multimodal ai for business insights
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1…230231232…295

In summary, here are 10 of our most popular multimodal ai courses

  • Build an application to send Chat Prompts using the Gemini model: Google Cloud
  • GenAI for Compensation: Smarter Pay Equity Analysis: Coursera
  • Introduction to Large Language Models - 한국어: Google Cloud
  • Harnessing Open Source LLMs and ChatGPT with Minimal Code: Packt
  • OpenCL Programming: Coursera
  • Mastering Cloud FinOps: Board Infinity
  • Vector Databases Deep Dive: Packt
  • Intellectual Autonomy: University of Glasgow
  • 6G Evolution: Blockchain, Semantic Communications & Radar: University of Glasgow
  • Visión artificial contemporánea: Universidad de los Andes

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok