Ready to level up your GenAI skills? Step into the exciting world of multimodal AI, where language, images, and speech come together to build smarter, more interactive applications.

Build Multimodal Generative AI Applications
4 days left! Save on skills that make you shine with 40% off 3 months of Coursera Plus. Save now

Build Multimodal Generative AI Applications
This course is part of IBM RAG and Agentic AI Professional Certificate


Instructors: Hailey Quach +1 more
11,527 already enrolled
Included with
58 reviews
Recommended experience
What you'll learn
Build the job-ready skills you need to build multimodal generative AI applications in just 3 weeks
Understand the fundamental concepts and challenges in multimodal AI, including the integration of text, speech, images, and video
Build multimodal AI applications using state-of-the-art models and frameworks such as IBM’s Granite, Meta’s Llama, OpenAI’s Whisper, DALL·E and Sora
Develop multimodal AI solutions, including chatbots and image/video generation models, using IBM watsonx.ai, Hugging Face, Flask and Gradio
Skills you'll gain
- Category: Multimodal Prompts
- Category: Retrieval-Augmented Generation
- Category: Software Development
- Category: LLM Application
- Category: Web Development
- Category: AI powered creativity
- Category: Large Language Modeling
- Category: Decision Intelligence
- Category: AI Integrations
- Category: Embeddings
- Category: Application Deployment
Tools you'll learn
- Category: OpenAI API
- Category: Prompt Engineering
- Category: Flask (Web Framework)
Details to know

Add to your LinkedIn profile
6 assignments
Build your Software Development expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM

There are 3 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors

Offered by

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
81.03%
- 4 stars
10.34%
- 3 stars
5.17%
- 2 stars
0%
- 1 star
3.44%
Showing 3 of 58
Reviewed on Oct 26, 2025
Wow, It was next Level Experience to learn the Multimodal Gen AI Development. Truly Amazing.
