• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal

Results for "multimodal"


  • Status: Free Trial
    Free Trial
    C

    Codio

    Multimodal Generative AI: Vision, Speech, and Assistants

    Skills you'll gain: OpenAI API, OpenAI, Image Analysis, Generative AI, ChatGPT, LLM Application, Multimodal Prompts, Tool Calling, Application Programming Interface (API), Large Language Modeling, Artificial Intelligence, AI Integrations, Natural Language Processing, Computer Vision, File Management

    3.9
    Rating, 3.9 out of 5 stars
    ·
    7 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Building Multimodal Search and RAG

    Skills you'll gain: Retrieval-Augmented Generation, Multimodal Prompts, LLM Application, Embeddings, Large Language Modeling, Generative AI, Vector Databases, Image Analysis, Applied Machine Learning

    4.5
    Rating, 4.5 out of 5 stars
    ·
    43 reviews

    Intermediate · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    I

    IBM

    Build Multimodal Generative AI Applications

    Skills you'll gain: Multimodal Prompts, LLM Application, OpenAI API, AI powered creativity, Embeddings, AI Integrations, Large Language Modeling, Decision Intelligence, Retrieval-Augmented Generation, Prompt Engineering, Flask (Web Framework), Application Deployment, Web Development, Software Development

    4.7
    Rating, 4.7 out of 5 stars
    ·
    53 reviews

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    I

    IBM

    IBM RAG and Agentic AI

    Skills you'll gain: Prompt Engineering, AI Orchestration, AI Workflows, LangChain, Retrieval-Augmented Generation, Agentic Workflows, Tool Calling, LangGraph, LLM Application, Prompt Patterns, Agentic systems, Multimodal Prompts, Model Context Protocol, Generative AI, AI Security, Generative AI Agents, Vector Databases, OpenAI API, AI Integrations, Software Development

    4.6
    Rating, 4.6 out of 5 stars
    ·
    897 reviews

    Advanced · Professional Certificate · 3 - 6 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    End-to-End Multimodal AI: Fine-Tuning, Fusion, and MLOps

    Skills you'll gain: API Design, MLOps (Machine Learning Operations), Restful API, Fine-tuning, OAuth, Model Deployment, Technical Communication, Model Training, Model Evaluation, Transfer Learning, Vision Transformer (ViT), Model Optimization, AI Workflows, Artificial Intelligence and Machine Learning (AI/ML), Machine Learning Software, Solution Architecture, Machine Learning, Data Architecture, Machine Learning Algorithms, Data Science

    Intermediate · Course · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    U

    University of Colorado Boulder

    Modern AI Models for Vision and Multimodal Understanding

    Skills you'll gain: Vision Transformer (ViT), Recurrent Neural Networks (RNNs), Generative Model Architectures, Embeddings, Digital Signal Processing, Transfer Learning, Machine Learning Methods, Classification Algorithms

    Build toward a degree

    4.4
    Rating, 4.4 out of 5 stars
    ·
    29 reviews

    Advanced · Course · 1 - 4 Weeks

What brings you to Coursera today?

  • Status: Free Trial
    Free Trial
    P

    Packt

    Multimodal RAG with GPT – Build Smarter Search & AI Systems

    Skills you'll gain: Retrieval-Augmented Generation, Development Environment, Multimodal Prompts, Embeddings, User Interface (UI), OpenAI API, Generative AI, LLM Application, Program Development, Software Development Tools, Prompt Engineering, UI Components, Image Analysis, AI Workflows, Large Language Modeling, AI Integrations, Artificial Intelligence, Vector Databases, Applied Machine Learning, Data Processing

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Fine-tune Multimodal Models with Transfer Learning

    Skills you'll gain: Model Optimization, Fine-tuning, Multimodal Prompts, Generative Model Architectures, PyTorch (Machine Learning Library), Model Training, Data Processing, Tensorflow, Knowledge Transfer, Keras (Neural Network Library), Deep Learning, Artificial Neural Networks

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    P

    Packt

    Retrieval Augmented Generation

    Skills you'll gain: Retrieval-Augmented Generation, Large Language Modeling, LLM Application, Development Environment, Prompt Patterns, Multimodal Prompts, Tool Calling, Embeddings, Generative AI Agents, Vector Databases, User Interface (UI), Prompt Engineering Tools, OpenAI API, Generative AI, AI Workflows, Program Development, Agentic systems, Plot (Graphics), Software Development Tools, Augmented Reality

    4.3
    Rating, 4.3 out of 5 stars
    ·
    17 reviews

    Intermediate · Specialization · 3 - 6 Months

  • Status: Free
    Free
    D

    DeepLearning.AI

    Introducing Multimodal Llama 3.2

    Skills you'll gain: Tool Calling, LLM Application, Multimodal Prompts, Prompt Patterns, Prompt Engineering, Token Optimization, Large Language Modeling

    4.5
    Rating, 4.5 out of 5 stars
    ·
    13 reviews

    Beginner · Project · Less Than 2 Hours

  • Status: Preview
    Preview
    U

    University of Illinois Urbana-Champaign

    Multimodal Literacies: Communication and Learning in the Era of Digital Media

    Skills you'll gain: Differentiated Instruction, Teaching, Instructional Strategies, Oral Comprehension, Digital pedagogy, Literacy, Oral Expression, Pedagogy, digital literacy, Learning Styles, Media and Communications, Multimedia, Higher Education, Cultural Diversity, Language Learning, Writing

    4.7
    Rating, 4.7 out of 5 stars
    ·
    180 reviews

    Mixed · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API

    Skills you'll gain: Gemini, Google Gemini, Multimodal Prompts, Retrieval-Augmented Generation, Data Store, Embeddings, Metadata Management, Image Analysis, Large Language Modeling, Prompt Engineering, Cloud Computing, Artificial Intelligence

    3.9
    Rating, 3.9 out of 5 stars
    ·
    10 reviews

    Intermediate · Project · Less Than 2 Hours

1234…23

In summary, here are 10 of our most popular multimodal courses

  • Multimodal Generative AI: Vision, Speech, and Assistants: Codio
  • Building Multimodal Search and RAG: DeepLearning.AI
  • Build Multimodal Generative AI Applications: IBM
  • IBM RAG and Agentic AI: IBM
  • End-to-End Multimodal AI: Fine-Tuning, Fusion, and MLOps: Coursera
  • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
  • Multimodal RAG with GPT – Build Smarter Search & AI Systems: Packt
  • Fine-tune Multimodal Models with Transfer Learning: Coursera
  • Retrieval Augmented Generation: Packt
  • Introducing Multimodal Llama 3.2: DeepLearning.AI

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Accounting
  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • Human Resources (HR)
  • Microsoft Excel
  • Project Management
  • Python
  • SQL

Professional Certificates

  • Google AI Certificate
  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM AI Engineering Certificate
  • IBM AI Product Manager Certificate
  • IBM Data Science Certificate
  • Intuit Academy Bookkeeping Certificate

Courses & Specializations

  • AI Essentials Specialization
  • AI For Business Specialization
  • AI For Everyone Course
  • AI in Healthcare Specialization
  • Deep Learning Specialization
  • Excel Skills for Business Specialization
  • Financial Markets Course
  • Machine Learning Specialization
  • Prompt Engineering for ChatGPT Course
  • Python for Everybody Specialization

Career Resources

  • Career Aptitude Test
  • CAPM Certification Requirements
  • CompTIA A+ Certification Requirements
  • CompTIA Security+ Certification Requirements
  • Essential IT Certifications
  • Free IT Certifications and Courses
  • High-Income Skills to Learn
  • How to Learn Artificial Intelligence
  • PMP Certification Requirements
  • Popular Cybersecurity Certifications

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok