Do I need any specific software or tools to complete the course successfully?

<text variant="body1">Only a modern web browser is required to complete this course and all hands-on labs. You will be provided access to cloud-based environments to complete the labs at no charge.

What will I get if I subscribe to this Certificate?

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Generative AI Language Modeling with Transformers

Obtenez l'une de nos meilleures offres avec Coursera Plus pour 199 $ (habituellement 399 $). Économisez maintenant.

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues. Consultez les langues disponibles.

Generative AI Language Modeling with Transformers

Ce cours fait partie de plusieurs programmes.

Instructeurs : Joseph Santarcangelo

29 379 déjà inscrits

Inclus avec

Demander à Coursera

2 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

150 avis

niveau Intermédiaire

Expérience recommandée

Planning flexible

9 heures à compléter

Apprenez à votre propre rythme

90%

La plupart des étudiants ont apprécié ce cours

2 modules

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.

150 avis

niveau Intermédiaire

Expérience recommandée

Planning flexible

9 heures à compléter

Apprenez à votre propre rythme

90%

La plupart des étudiants ont apprécié ce cours

Ce que vous apprendrez

Explain the role of attention mechanisms in transformer models for capturing contextual relationships in text
Describe the differences in language modeling approaches between decoder-based models like GPT and encoder-based models like BERT
Implement key components of transformer models, including positional encoding, attention mechanisms, and masking, using PyTorch
Apply transformer-based models for real-world NLP tasks, such as text classification and language translation, using PyTorch and Hugging Face tools

Compétences que vous acquerrez

Catégorie : Model Training
Catégorie : Large Language Modeling
Catégorie : Applied Machine Learning
Catégorie : Natural Language Processing
Catégorie : Model Optimization
Catégorie : Embeddings
Catégorie : Transfer Learning
Catégorie : Generative Model Architectures
Catégorie : Data Preprocessing

Outils que vous découvrirez

Catégorie : Generative AI
Catégorie : PyTorch (Machine Learning Library)

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Évaluations

6 devoirs

Enseigné en Anglais

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

En savoir plus sur Coursera pour les affaires

logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Élaborez votre expertise du sujet

Ce cours est disponible dans le cadre de

Lorsque vous vous inscrivez à ce cours, vous devez également sélectionner un programme spécifique.

Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable

Il y a 2 modules dans ce cours

This course provides a practical introduction to using transformer-based models for natural language processing (NLP) applications. You will learn to build and train models for text classification using encoder-based architectures like Bidirectional Encoder Representations from Transformers (BERT), and explore core concepts such as positional encoding, word embeddings, and attention mechanisms.

The course covers multi-head attention, self-attention, and causal language modeling with GPT for tasks like text generation and translation. You will gain hands-on experience implementing transformer models in PyTorch, including pretraining strategies such as masked language modeling (MLM) and next sentence prediction (NSP). Through guided labs, you’ll apply encoder and decoder models to real-world scenarios. This course is designed for learners interested in generative AI engineering and requires prior knowledge of Python, PyTorch, and machine learning. Enroll now to build your skills in NLP with transformers!

In this module, you will learn how transformers process sequential data using positional encoding and attention mechanisms. You will explore how to implement positional encoding in PyTorch and understand how attention helps models focus on relevant parts of input sequences. You'll dive deeper into self-attention and scaled dot-product attention with multiple heads to see how they contribute to language modeling tasks. The module also explains how the transformer architecture leverages these mechanisms efficiently. Through hands-on labs, you’ll implement these concepts and build transformer encoder layers in PyTorch. Finally, you'll apply transformer models for text classification, including building a data pipeline, defining the model, and training it, while also exploring techniques to optimize transformer training performance.

Inclus

6 vidéos4 lectures2 devoirs2 éléments d'application2 plugins

6 vidéosTotal 40 minutes

Course Introduction3 minutes
Positional Encoding7 minutes
Attention Mechanism7 minutes
Self-attention Mechanism7 minutes
From Attention to Transformers7 minutes
Transformers for Classification: Encoder9 minutes

4 lecturesTotal 17 minutes

Course Overview5 minutes
Specialization Overview7 minutes
Optimization Techniques for Efficient Transformer Training 3 minutes
Summary and Highlights2 minutes

2 devoirsTotal 45 minutes

Graded Quiz: Fundamental Concepts of Transformer Architecture30 minutes
Practice Quiz: Positional Encoding, Attention, and Application in Classification15 minutes

2 éléments d'applicationTotal 105 minutes

Hands-on Lab: Attention Mechanism and Positional Encoding45 minutes
Hands-on Lab: Applying Transformers for Classification60 minutes

2 pluginsTotal 7 minutes

Helpful Tips for Course Completion2 minutes
Reading: Beginner's Guide to Transformer Model Fundamentals5 minutes

In this module, you will learn how decoder-based models like GPT are trained using causal language modeling and implemented in PyTorch for both training and inference. You will explore encoder-based models, such as Bidirectional Encoder Representations from Transformers (BERT), and understand their pretraining strategies using masked language modeling (MLM) and next sentence prediction (NSP), along with data preparation techniques in PyTorch. You will also examine how transformer architectures are applied to machine translation, including their implementation using PyTorch. Through hands-on labs, you will gain practical experience with decoder models, encoder models, and translation tasks. The module concludes with a cheat sheet, glossary, and summary to help consolidate your understanding of key concepts.

Inclus

10 vidéos6 lectures4 devoirs4 éléments d'application3 plugins

10 vidéosTotal 67 minutes

Language Modeling with the Decoders and GPT-like Models7 minutes
Training Decoder Models7 minutes
Decoder Models- PyTorch Implementation-Causal LM6 minutes
Decoder Models: PyTorch Implementation Using Training and Inference5 minutes
Encoder Models with BERT: Pretraining Using MLM6 minutes
Encoder Models with BERT: Pretraining Using NSP6 minutes
Data Preparation for BERT with PyTorch9 minutes
Pretraining BERT Models with PyTorch8 minutes
Transformer Architecture for Language Translation5 minutes
Transformer Architecture for Translation: PyTorch Implementation8 minutes

6 lecturesTotal 9 minutes

Summary and Highlights1 minute
Summary and Highlights1 minute
Summary and Highlights1 minute
Course Conclusion2 minutes
Thanks from the Course team2 minutes
Congratulations and Next Steps2 minutes

4 devoirsTotal 63 minutes

Graded Quiz: Advanced Concepts of Transformer Architecture30 minutes
Practice Quiz: Decoder Models12 minutes
Practice Quiz: Encoder Models12 minutes
Practice Quiz: Application of Transformers for Translation9 minutes

4 éléments d'applicationTotal 180 minutes

Hands-on Lab: Decoder GPT-like Models45 minutes
Hands-on Lab: Pretraining BERT Models60 minutes
Hands-on Lab: Data Preparation for BERT45 minutes
Lab: Transformers for Translation30 minutes

3 pluginsTotal 25 minutes

Reading: Getting Started with Advanced Concepts of Transformer Models7 minutes
Cheat Sheet: Language Modeling with Transformers15 minutes
Course Glossary: Language Modeling with Transformers 3 minutes

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeurs

Évaluations de l’enseignant

(32 évaluations)

Joseph Santarcangelo

IBM

37 Cours2 501 352 apprenants

Offert par

IBM

En savoir plus sur Machine Learning

Statut : Prévisualisation
University of Glasgow
Generative Pre-trained Transformers (GPT)
Cours
Statut : Essai gratuit
Pearson
Introduction to Transformer Models for NLP: Unit 1
Cours
Statut : Essai gratuit
Whizlabs
NVIDIA: Fundamentals of NLP and Transformers
Cours
Statut : Prévisualisation
Board Infinity
Transformers in Action: A Practical Approach to NLP and AI
Cours

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.

Étudiant(e) depuis 2018

’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’

Jennifer J.

Étudiant(e) depuis 2020

’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’

Larry W.

Étudiant(e) depuis 2021

’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’

Chaitanya A.

’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’

Avis des étudiants

5 stars
74,17 %
4 stars
13,24 %
3 stars
4,63 %
2 stars
1,98 %
1 star
5,96 %

Affichage de 3 sur 150

Révisé le 29 déc. 2024

This course gives me a wide picture of what transformers can be.

Révisé le 4 nov. 2025

Excellent course to understand about AI/ML/GenAI. The videos are not very detailed and just the right amount to skim through the details.

Révisé le 1 sept. 2025

I loved this course. It is very informative and has a lot of examples. It will take some time to master all this information.

Voir plus d’avis

Foire Aux Questions

It will take only two weeks to complete this course if you spend 3–5 hours of study time per week.

It would be good if you had a basic knowledge of Python and a familiarity with machine learning and neural network concepts. It would be beneficial if you are familiar with text preprocessing steps and N-gram, Word2Vec, and sequence-to-sequence models. Knowledge of evaluation metrics such as bilingual evaluation understudy (BLEU) will be advantageous.

This course is part of the Generative AI Engineering Essentials with LLMs PC specialization. When you complete the specialization, you will prepare yourself with the skills and confidence to take on jobs such as AI Engineer, NLP Engineer, Machine Learning Engineer, Deep Learning Engineer, and Data Scientist.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.