• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Multimodal

Results for "multimodal"


  • C

    Codio

    Multimodal Generative AI: Vision, Speech, and Assistants

    Skills you'll gain: OpenAI API, OpenAI, Image Analysis, Generative AI, ChatGPT, LLM Application, Multimodal Prompts, Tool Calling, Application Programming Interface (API), Large Language Modeling, Artificial Intelligence, AI Integrations, Natural Language Processing, Computer Vision, File Management

    ★ 3.9 (7) · Beginner · Course · 1 - 4 Weeks

    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • D

    DeepLearning.AI

    Building Multimodal Search and RAG

    Skills you'll gain: Retrieval-Augmented Generation, Multimodal Prompts, LLM Application, Embeddings, Large Language Modeling, Generative AI, Vector Databases, Image Analysis, Applied Machine Learning

    ★ 4.5 (43) · Intermediate · Project · Less Than 2 Hours

    Category: Free
    Free
    Category: Credit offered
    Credit offered
  • I

    IBM

    Build Multimodal Generative AI Applications

    Skills you'll gain: Multimodal Prompts, LLM Application, OpenAI API, AI powered creativity, Embeddings, AI Integrations, Large Language Modeling, Decision Intelligence, Retrieval-Augmented Generation, Prompt Engineering, Flask (Web Framework), Application Deployment, Web Development, Software Development

    ★ 4.7 (53) · Intermediate · Course · 1 - 4 Weeks

    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • I

    IBM

    IBM RAG and Agentic AI

    Skills you'll gain: Prompt Engineering, AI Orchestration, AI Workflows, LangChain, Retrieval-Augmented Generation, Agentic Workflows, Tool Calling, LangGraph, LLM Application, Prompt Patterns, Agentic systems, Multimodal Prompts, Model Context Protocol, Generative AI, AI Security, Generative AI Agents, Vector Databases, OpenAI API, AI Integrations, Software Development

    ★ 4.6 (897) · Advanced · Professional Certificate · 3 - 6 Months

    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • C

    Coursera

    End-to-End Multimodal AI: Fine-Tuning, Fusion, and MLOps

    Skills you'll gain: API Design, MLOps (Machine Learning Operations), Restful API, Fine-tuning, OAuth, Model Deployment, Technical Communication, Model Training, Model Evaluation, Transfer Learning, Vision Transformer (ViT), Model Optimization, AI Workflows, Artificial Intelligence and Machine Learning (AI/ML), Machine Learning Software, Solution Architecture, Machine Learning, Data Architecture, Machine Learning Algorithms, Data Science

    Intermediate · Course · 3 - 6 Months

    Category: New
    New
    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • U

    University of Colorado Boulder

    Modern AI Models for Vision and Multimodal Understanding

    Skills you'll gain: Vision Transformer (ViT), Recurrent Neural Networks (RNNs), Generative Model Architectures, Embeddings, Digital Signal Processing, Transfer Learning, Machine Learning Methods, Classification Algorithms

    ★ 4.4 (29) · Advanced · Course · 1 - 4 Weeks

    Status: Free Trial
    Free Trial
    Category: Build toward a degree
    Build toward a degree

What brings you to Coursera today?

  • P

    Packt

    Multimodal RAG with GPT – Build Smarter Search & AI Systems

    Skills you'll gain: Retrieval-Augmented Generation, Development Environment, Multimodal Prompts, Embeddings, User Interface (UI), OpenAI API, Generative AI, LLM Application, Program Development, Software Development Tools, Prompt Engineering, UI Components, Image Analysis, AI Workflows, Large Language Modeling, AI Integrations, Artificial Intelligence, Vector Databases, Applied Machine Learning, Data Processing

    Intermediate · Course · 1 - 3 Months

    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • C

    Coursera

    Fine-tune Multimodal Models with Transfer Learning

    Skills you'll gain: Model Optimization, Fine-tuning, Multimodal Prompts, Generative Model Architectures, PyTorch (Machine Learning Library), Model Training, Data Processing, Tensorflow, Knowledge Transfer, Keras (Neural Network Library), Deep Learning, Artificial Neural Networks

    Intermediate · Course · 1 - 4 Weeks

    Category: New
    New
    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • P

    Packt

    Retrieval Augmented Generation

    Skills you'll gain: Retrieval-Augmented Generation, Large Language Modeling, LLM Application, Development Environment, Prompt Patterns, Multimodal Prompts, Tool Calling, Embeddings, Generative AI Agents, Vector Databases, User Interface (UI), Prompt Engineering Tools, OpenAI API, Generative AI, AI Workflows, Program Development, Agentic systems, Plot (Graphics), Software Development Tools, Augmented Reality

    ★ 4.3 (17) · Intermediate · Specialization · 3 - 6 Months

    Status: Free Trial
    Free Trial
    Category: Credit offered
    Credit offered
  • D

    DeepLearning.AI

    Introducing Multimodal Llama 3.2

    Skills you'll gain: Tool Calling, LLM Application, Multimodal Prompts, Prompt Patterns, Prompt Engineering, Token Optimization, Large Language Modeling

    ★ 4.5 (13) · Beginner · Project · Less Than 2 Hours

    Category: Free
    Free
    Category: Credit offered
    Credit offered
  • U

    University of Illinois Urbana-Champaign

    Multimodal Literacies: Communication and Learning in the Era of Digital Media

    Skills you'll gain: Differentiated Instruction, Teaching, Instructional Strategies, Oral Comprehension, Digital pedagogy, Literacy, Oral Expression, Pedagogy, digital literacy, Learning Styles, Media and Communications, Multimedia, Higher Education, Cultural Diversity, Language Learning, Writing

    ★ 4.7 (180) · Mixed · Course · 1 - 4 Weeks

    Category: Preview
    Preview
    Category: Credit offered
    Credit offered
  • G

    Google Cloud

    Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API

    Skills you'll gain: Gemini, Google Gemini, Multimodal Prompts, Retrieval-Augmented Generation, Data Store, Embeddings, Metadata Management, Image Analysis, Large Language Modeling, Prompt Engineering, Cloud Computing, Artificial Intelligence

    ★ 3.9 (10) · Intermediate · Project · Less Than 2 Hours

    Category: Credit offered
    Credit offered
1234…23

In summary, here are 10 of our most popular multimodal courses

  • Multimodal Generative AI: Vision, Speech, and Assistants: Codio
  • Building Multimodal Search and RAG: DeepLearning.AI
  • Build Multimodal Generative AI Applications: IBM
  • IBM RAG and Agentic AI: IBM
  • End-to-End Multimodal AI: Fine-Tuning, Fusion, and MLOps: Coursera
  • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
  • Multimodal RAG with GPT – Build Smarter Search & AI Systems: Packt
  • Fine-tune Multimodal Models with Transfer Learning: Coursera
  • Retrieval Augmented Generation: Packt
  • Introducing Multimodal Llama 3.2: DeepLearning.AI

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Accounting
  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • Human Resources (HR)
  • Microsoft Excel
  • Project Management
  • Python
  • SQL

Professional Certificates

  • Google AI Certificate
  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM AI Engineering Certificate
  • IBM AI Product Manager Certificate
  • IBM Data Science Certificate
  • Intuit Academy Bookkeeping Certificate

Courses & Specializations

  • AI Essentials Specialization
  • AI For Business Specialization
  • AI For Everyone Course
  • AI in Healthcare Specialization
  • Deep Learning Specialization
  • Excel Skills for Business Specialization
  • Financial Markets Course
  • Machine Learning Specialization
  • Prompt Engineering for ChatGPT Course
  • Python for Everybody Specialization

Career Resources

  • Career Aptitude Test
  • CAPM Certification Requirements
  • CompTIA A+ Certification Requirements
  • CompTIA Security+ Certification Requirements
  • Essential IT Certifications
  • Free IT Certifications and Courses
  • High-Income Skills to Learn
  • How to Learn Artificial Intelligence
  • PMP Certification Requirements
  • Popular Cybersecurity Certifications

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Do Not Sell/Share
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok