This course introduces you to the Transformer architecture and the Bidirectional Encoder Representations from Transformers (BERT) model. You learn about the main components of the Transformer architecture, such as the self-attention mechanism, and how it is used to build the BERT model. You also learn about the different tasks that BERT can be used for, such as text classification, question answering, and natural language inference. This course is estimated to take approximately 45 minutes to complete.



Transformer Models and BERT Model

Instructor: Google Cloud Training
Access provided by University of Colorado
11,706 already enrolled
(121 reviews)
What you'll learn
- Understand the main components of the Transformer architecture. 
- Learn how a BERT model is built using Transformers. 
- Use BERT to solve different natural language processing (NLP) tasks. 
Skills you'll gain
Details to know

Add to your LinkedIn profile
1 assignment
See how employees at top companies are mastering in-demand skills

There is 1 module in this course
In this module you will learn about the main components of the Transformer architecture, such as the self-attention mechanism, and how it is used to build the BERT model. You also learn about the different tasks that BERT can be used for, such as text classification, question answering, and natural language inference.
What's included
2 videos1 reading1 assignment
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
121 reviews
- 5 stars57.02% 
- 4 stars20.66% 
- 3 stars7.43% 
- 2 stars5.78% 
- 1 star9.09% 
Showing 3 of 121
Reviewed on Mar 12, 2024
Excellent and concise presentation of Transformer and BERT models. The course designer may consider adding programming assignments to illustrate the concepts and to reinforce student learning.
Reviewed on May 13, 2025
I am looking for free resources for experimentation, or a lighter model that can run on my laptop.
Reviewed on Jun 24, 2023
very clear and detailed explanation of the transformers with practical example of training BERT model





