IBM

Generative AI for NLP with PyTorch Specialization

Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

IBM

Generative AI for NLP with PyTorch Specialization

Build Generative AI NLP Skills With PyTorch.

Get hands-on with PyTorch, Hugging Face, transformers, and NLP in an applied model project

IBM Skills Network Team
Fateme Akbari
Kang Wang

Instructors: IBM Skills Network Team

Included with Coursera Plus

Ask Coursera

Get in-depth knowledge of a subject

from 2,107 reviews of courses in this program

Advanced level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Get in-depth knowledge of a subject

from 2,107 reviews of courses in this program

Advanced level

Recommended experience

4 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Build, train, and fine-tune NLP models in PyTorch through a portfolio-ready capstone with an LSTM and DistilBERT comparison

  • Develop deep and convolutional neural networks in PyTorch using gradient descent, dropout, batch normalization, and GPU acceleration

  • Apply attention mechanisms, tokenization, and multi-head attention to fine-tune pretrained transformers including BERT and DistilBERT

  • Design end-to-end NLP pipelines and compare RNN, LSTM, and transformer architectures on real text classification tasks

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English
Recently updated!

June 2026

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

  • Learn in-demand skills from university and industry experts
  • Master a subject or tool with hands-on projects
  • Develop a deep understanding of key concepts
  • Earn a career certificate from IBM

Specialization - 4 course series

Introduction to Neural Networks and PyTorch

Introduction to Neural Networks and PyTorch

Course 1, 19 hours

What you'll learn

  • Get hands-on building, training, and evaluating PyTorch models you can showcase in your professional portfolio

  • Gain practical experience with tensors, datasets, and automatic differentiation using PyTorch core tools, including autograd and DataLoader

  • Develop linear regression models using gradient descent, mini-batch optimization, and training/validation splits to evaluate model performance

  • ·Apply cross-entropy loss, sigmoid-based classification, and advanced optimization techniques to build logistic regression models in PyTorch

Skills you'll gain

Category: PyTorch (Machine Learning Library)
Category: Logistic Regression
Category: Data Preprocessing
Category: Regression Analysis
Category: Machine Learning
Category: Data Processing
Category: Tensorflow
Category: Probability & Statistics
Category: Deep Learning
Category: Statistical Methods
Category: Supervised Learning
Category: Applied Machine Learning
Category: Predictive Modeling
Deep Learning with PyTorch

Deep Learning with PyTorch

Course 2, 21 hours

What you'll learn

  • Get hands-on experience using PyTorch to build and deploy AI systems and complete a portfolio-worthy project.

  • Develop and train shallow neural networks with various architectures and apply Softmax regression in multi-class classification problems.

  • Explore deep neural networks, including techniques such as dropout, weight initialization, and batch normalization.

  • Gain practical experience with convolutional neural networks, exploring layers, activation functions, and more.

Skills you'll gain

Category: PyTorch (Machine Learning Library)
Category: Deep Learning
Category: Logistic Regression
Category: Artificial Neural Networks
Category: Classification Algorithms
Category: Convolutional Neural Networks
Category: Model Evaluation
Category: Artificial Intelligence and Machine Learning (AI/ML)
Category: Applied Machine Learning
Category: Model Optimization
Category: Transfer Learning
Category: Image Analysis
Category: Model Training
Generative AI Language Modeling with Transformers

Generative AI Language Modeling with Transformers

Course 3, 9 hours

What you'll learn

  • Explain the role of attention mechanisms in transformer models for capturing contextual relationships in text

  • Describe the differences in language modeling approaches between decoder-based models like GPT and encoder-based models like BERT

  • Implement key components of transformer models, including positional encoding, attention mechanisms, and masking, using PyTorch

  • Apply transformer-based models for real-world NLP tasks, such as text classification and language translation, using PyTorch and Hugging Face tools

Skills you'll gain

Category: PyTorch (Machine Learning Library)
Category: Generative AI
Category: Large Language Modeling
Category: Generative Model Architectures
Category: Natural Language Processing
Category: Applied Machine Learning
Category: Model Training
Category: Transfer Learning
Category: Embeddings
Category: Data Preprocessing
Category: Model Optimization
Generative AI for NLP with PyTorch Capstone Project

Generative AI for NLP with PyTorch Capstone Project

Course 4, 11 hours

What you'll learn

  • Get hands-on experience using PyTorch to build NLP models in a portfolio-worthy capstone project that demonstrates your skills to employers.

  • Design and implement an end-to-end NLP workflow, including text preparation, tokenization, model training, and evaluation.

  • Apply sequential and transformer-based architectures to text classification tasks and adapt pretrained models to domain-specific data.

  • Compare model performance using relevant metrics and communicate design decisions, results, and trade-offs through a capstone submission.

Skills you'll gain

Category: Deep Learning
Category: Model Training
Category: Artificial Neural Networks
Category: Generative AI
Category: PyTorch (Machine Learning Library)
Category: Machine Learning Algorithms
Category: Natural Language Processing
Category: Data Preprocessing
Category: Hugging Face
Category: Machine Learning
Category: Transfer Learning
Category: Recurrent Neural Networks (RNNs)
Category: Model Optimization
Category: Model Evaluation
Category: Fine-tuning
Category: Data Processing
Category: Generative Model Architectures
Category: Large Language Modeling

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

IBM Skills Network Team
95 Courses2,054,753 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions