Wenn Sie sich für diesen Kurs anmelden, werden Sie auch für dieses berufsbezogene Zertifikat angemeldet.
Lernen Sie neue Konzepte von Branchenexperten
Gewinnen Sie ein Grundverständnis bestimmter Themen oder Tools
Erwerben Sie berufsrelevante Kompetenzen durch praktische Projekte
Erwerben Sie ein Berufszertifikat von Microsoft zur Vorlage
In diesem Kurs gibt es 4 Module
Learn to build AI that sees, hears, and understands the world in an integrated way. This course takes you beyond single-modality models, teaching you to architect applications that connect different data types like text, images, and speech.
Starting with text-to-image generation, you will progress to integrating various AI components and orchestrating the full power of Azure AI Services to build sophisticated, cross-modal solutions. By the end, you'll be equipped to design the next generation of intelligent, multi-faceted AI applications.
This module introduces the foundational concepts of multimodal AI. You will learn the architectural patterns for combining different AI components, such as text and image models, and progress from basic integration to building complex systems that can reason across multiple data types.
Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025.
Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
Das ist alles enthalten
4 Videos9 Lektüren7 Aufgaben
Infos zu Modulinhalt anzeigen
4 Videos•Insgesamt 18 Minuten
Introduction to Microsoft Generative AI engineering certification•4 Minuten
Introduction to multimodal and cross-modal integrations course•3 Minuten
Understanding multimodal AI•5 Minuten
Advanced multimodal applications•5 Minuten
9 Lektüren•Insgesamt 95 Minuten
Course syllabus and recommended background•5 Minuten
Components of multimodal AI setup•15 Minuten
Visualizing a multimodal workflow•15 Minuten
Architectural choices in multimodal AI: Single model vs. chained pipelines•10 Minuten
Analyzing your first multimodal integration•10 Minuten
Advanced integration strategies and use cases•10 Minuten
Insights on advanced multimodal AI•10 Minuten
Case study: Designing a multimodal product search•10 Minuten
Module 1 summary: From architectural theory to practical integration•10 Minuten
7 Aufgaben•Insgesamt 195 Minuten
First steps with a true multimodal model•15 Minuten
Building your first multimodal pipeline•30 Minuten
Multimodal integration: Practice Quiz•30 Minuten
Building a multimodal system•30 Minuten
Architecting a complex multimodal solution•30 Minuten
Advanced multimodal skills: Practice Quiz•30 Minuten
Module 1 evaluation: Graded Quiz•30 Minuten
Text-to-image generation
Modul 2•4 Stunden abzuschließen
Moduldetails
This module provides a deep dive into the popular and creative task of generating images from text descriptions. You will explore the models that power this technology, like DALL·E, and learn both basic and advanced prompting techniques to craft and refine specific, high-quality visual outputs.
Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025.
Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
Das ist alles enthalten
5 Videos5 Lektüren5 Aufgaben
Infos zu Modulinhalt anzeigen
5 Videos•Insgesamt 19 Minuten
Module 2 introduction: From words to worlds with text-to-image models•6 Minuten
From text to image in practice•4 Minuten
Text-to-image model comparisons•3 Minuten
Mastering text-to-image control•3 Minuten
Module 2 summary: From architecture to artistic control•3 Minuten
5 Lektüren•Insgesamt 50 Minuten
Exploration of text-to-image practices•10 Minuten
Insights from text-to-image applications•10 Minuten
Advanced text-to-image techniques•10 Minuten
Advanced text-to-image insights•10 Minuten
Case study: A creative workflow for a marketing campaign•10 Minuten
5 Aufgaben•Insgesamt 180 Minuten
Generating and refining images with text-to-image prompts•30 Minuten
Solving text-to-image challenges: Practice Quiz•30 Minuten
Module 2 evaluation: Graded Quiz•30 Minuten
Cross-modal applications with Azure AI vision
Modul 3•6 Stunden abzuschließen
Moduldetails
This module focuses on practical implementation using a powerful, specialized tool. You will leverage the features of Azure AI Vision to build and optimize cross-modal applications like image captioning and visual search. You'll learn how this single service can analyze visual content to generate rich textual descriptions and extract embedded text (OCR), providing the core components for sophisticated multimodal solutions.
Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025.
Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
Das ist alles enthalten
7 Videos6 Lektüren7 Aufgaben
Infos zu Modulinhalt anzeigen
7 Videos•Insgesamt 28 Minuten
An overview of the Azure AI services toolkit•5 Minuten
Module 3 introduction: The multiple applications of Azure AI Vision•3 Minuten
Bringing sight to your applications with Azure AI Vision•5 Minuten
Getting started with Azure AI Vision•4 Minuten
Exploring cross-modal features in Vision Studio•4 Minuten
Refining cross-modal applications•6 Minuten
Module 3 summary: From a single feature to a complete vision solution•2 Minuten
6 Lektüren•Insgesamt 60 Minuten
Prototyping vs. production: The role of Vision Studio•10 Minuten
Cross-modal AI implementation insights•10 Minuten
Interpreting OCR results with the SDK•10 Minuten
Advanced strategies for cross-modal AI•10 Minuten
Optimizing multimodal workflows•10 Minuten
Case study: Building an automated inventory checker•10 Minuten
7 Aufgaben•Insgesamt 255 Minuten
Exploring cross-modal techniques•30 Minuten
Extract text from images•60 Minuten
Cross-modal techniques quiz: Practice Quiz•30 Minuten
Chaining vision skills with the Python SDK•30 Minuten
Extending a multimodal application•45 Minuten
Advanced cross-modal skills: Practice Quiz•30 Minuten
Module 3 evaluation: Graded Quiz•30 Minuten
Advanced AI integration with Azure services
Modul 4•5 Stunden abzuschließen
Moduldetails
This capstone module builds upon your deep expertise in Azure AI Vision. You will learn to integrate your vision applications with other powerful Azure AI Services, such as Language and Speech, to create comprehensive, end-to-end solutions. The focus will be on orchestrating these distinct services to develop a sophisticated application that solves a real-world business problem, demonstrating your ability to design and build a complete multimodal system from the ground up.
Important Notice on the Azure Interface: The screencast videos and screenshots were last updated in late 2025.
Please be aware that Microsoft may have updated the Azure interface since then. If the steps shown in the course materials look different from your current Azure environment, please follow the most up-to-date interface, as the underlying concepts and learning objectives remain the same.
Das ist alles enthalten
6 Videos5 Lektüren5 Aufgaben
Infos zu Modulinhalt anzeigen
6 Videos•Insgesamt 26 Minuten
Module 4 introduction: Building an end-to-end solution•3 Minuten
Orchestrating Azure AI services: A demonstration•6 Minuten
Setting up your environment for integration•6 Minuten
Demonstrating text-to-speech with the SDK•6 Minuten
Module 4 summary: Orchestrating a full AI solution•2 Minuten
Course summary•3 Minuten
5 Lektüren•Insgesamt 60 Minuten
Integrating Azure AI services•15 Minuten
Managing multimodal workflows•10 Minuten
Adding speech to your application•15 Minuten
Analyzing your end-to-end application•10 Minuten
Production considerations for multimodal apps•10 Minuten
5 Aufgaben•Insgesamt 210 Minuten
Integrating Vision with the language service•60 Minuten
Designing multimodal workflows: Practice Quiz•30 Minuten
Building an end-to-end multimodal application•60 Minuten
Analyzing multimodal solutions: Practice Quiz•30 Minuten
Module 4 evaluation: Graded Quiz•30 Minuten
Erwerben Sie ein Karrierezertifikat.
Fügen Sie dieses Zeugnis Ihrem LinkedIn-Profil, Lebenslauf oder CV hinzu. Teilen Sie sie in Social Media und in Ihrer Leistungsbeurteilung.
Our goal at Microsoft is to empower every individual and organization on the planet to achieve more.
In this next revolution of digital transformation, growth is being driven by technology. Our integrated cloud approach creates an unmatched platform for digital transformation. We address the real-world needs of customers by seamlessly integrating Microsoft 365, Dynamics 365, LinkedIn, GitHub, Microsoft Power Platform, and Azure to unlock business value for every organization—from large enterprises to family-run businesses. The backbone and foundation of this is Azure.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.