Wenn Sie sich für diesen Kurs anmelden, müssen Sie auch ein bestimmtes Programm auswählen.
Lernen Sie neue Konzepte von Branchenexperten
Gewinnen Sie ein Grundverständnis bestimmter Themen oder Tools
Erwerben Sie berufsrelevante Kompetenzen durch praktische Projekte
Erwerben Sie ein Berufszertifikat zur Vorlage
In diesem Kurs gibt es 2 Module
Transform how AI systems understand and connect different data modalities. This course empowers machine learning professionals to build cutting-edge cross-modal retrieval systems that bridge the gap between text and images. You'll master the technical implementation of approximate nearest-neighbor search algorithms and design sophisticated attention mechanisms that fuse visual and textual information. Through hands-on work with production-scale tools like FAISS and real datasets like Flickr30K, you'll develop the expertise to create intelligent systems that understand content across modalities—enabling breakthrough applications in search, recommendation, and content understanding that mirror how humans naturally process diverse information types.
Learners will build foundational understanding of cross-modal retrieval systems and implement approximate nearest-neighbor search algorithms using FAISS for production-scale similarity search across multimodal embeddings.
Das ist alles enthalten
1 Video2 LektĂĽren1 Aufgabe1 Unbewertetes Labor
Infos zu Modulinhalt anzeigen
1 Video•Insgesamt 7 Minuten
Fundamentals of Cross-Modal Retrieval Systems•7 Minuten
2 Lektüren•Insgesamt 18 Minuten
FAISS Architecture and Index Types for Production Systems•10 Minuten
Implementing FAISS Indexing for Cross-Modal Search•8 Minuten
1 Aufgabe•Insgesamt 3 Minuten
Cross-Modal Retrieval and FAISS Implementation Assessment•3 Minuten
1 Unbewertetes Labor•Insgesamt 15 Minuten
Building Production-Scale Cross-Modal Retrieval with FAISS•15 Minuten
Learners will design and implement sophisticated attention-based fusion algorithms that intelligently combine visual and textual embeddings, mastering the creation of multimodal neural architectures for advanced cross-modal AI applications.
Das ist alles enthalten
2 LektĂĽren3 Aufgaben
Infos zu Modulinhalt anzeigen
2 Lektüren•Insgesamt 18 Minuten
Architecture and Mathematics of Attention-Based Multimodal Fusion•10 Minuten
Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
Finanzielle UnterstĂĽtzung verfĂĽgbar, weitere Informationen
Âą Einige Aufgaben in diesem Kurs werden mit AI bewertet. FĂĽr diese Aufgaben werden Ihre Daten in Ăśbereinstimmung mit Datenschutzhinweis von Courseraverwendet.