This course introduces you to the Transformer architecture and the Bidirectional Encoder Representations from Transformers (BERT) model. You learn about the main components of the Transformer architecture, such as the self-attention mechanism, and how it is used to build the BERT model. You also learn about the different tasks that BERT can be used for, such as text classification, question answering, and natural language inference. This course is estimated to take approximately 45 minutes to complete.



Transformer Models and BERT Model

Instructor: Google Cloud Training
Access provided by Micron Technology
11,735 already enrolled
(121 reviews)
What you'll learn
Understand the main components of the Transformer architecture.
Learn how a BERT model is built using Transformers.
Use BERT to solve different natural language processing (NLP) tasks.
Skills you'll gain
Details to know

Add to your LinkedIn profile
1 assignment
See how employees at top companies are mastering in-demand skills

There is 1 module in this course
In this module you will learn about the main components of the Transformer architecture, such as the self-attention mechanism, and how it is used to build the BERT model. You also learn about the different tasks that BERT can be used for, such as text classification, question answering, and natural language inference.
What's included
2 videos1 reading1 assignment
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
121 reviews
- 5 stars
56.55%
- 4 stars
21.31%
- 3 stars
7.37%
- 2 stars
5.73%
- 1 star
9.01%
Showing 3 of 121
Reviewed on Mar 12, 2024
Excellent and concise presentation of Transformer and BERT models. The course designer may consider adding programming assignments to illustrate the concepts and to reinforce student learning.
Reviewed on May 13, 2025
I am looking for free resources for experimentation, or a lighter model that can run on my laptop.
Reviewed on Jul 16, 2023
course was amazing gave me a good overview of BERT model and concepts like Encoding and decoding but not for beginner :>