The demand for technical generative AI (GenAI) skills is increasing, and businesses are actively seeking AI engineers who can work with large language models (LLMs). This IBM course is designed to build job-ready skills that can accelerate your AI career.



Generative AI Engineering and Fine-Tuning Transformers
This course is part of multiple programs.



Instructors: Joseph Santarcangelo
Access provided by New York State Department of Labor
15,166 already enrolled
(90 reviews)
Recommended experience
What you'll learn
- Sought-after, job-ready skills businesses need for working with transformer-based LLMs in generative AI engineering 
- How to perform parameter-efficient fine-tuning (PEFT) using methods like LoRA and QLoRA to optimize model training 
- How to use pretrained transformer models for language tasks and fine-tune them for specific downstream applications 
- How to load models, run inference, and train models using the Hugging Face and PyTorch frameworks 
Skills you'll gain
Details to know

Add to your LinkedIn profile
4 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 2 modules in this course
In this module, you will delve into the practical aspects of working with large language models (LLMs) using industry-standard tools like Hugging Face and PyTorch. You’ll explore the distinctions between these frameworks, learn how to load and perform inference with pretrained models, and understand the processes of pretraining and fine-tuning LLMs. Through hands-on labs, you’ll gain experience in implementing these techniques, enhancing your ability to develop and optimize generative AI models for various applications. By the end of this module, you’ll be equipped with the skills to effectively utilize and fine-tune LLMs, aligning them with specific tasks and performance requirements.
What's included
5 videos4 readings2 assignments4 app items
In this module, you will explore cutting-edge methods for fine-tuning large language models using parameter-efficient fine-tuning (PEFT) techniques. You’ll gain an understanding of adapters, low-rank adaptation (LoRA), and quantization, along with practical applications of PyTorch and Hugging Face libraries. The hands-on labs and readings will deepen your knowledge of soft prompts, quantized LoRA (QLoRA), and key terminology. You will also have access to a concise cheat sheet and a glossary that reinforce essential techniques, terms, and tools introduced throughout the course.
What's included
4 videos5 readings2 assignments2 app items4 plugins
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Learner reviews
90 reviews
- 5 stars74.72% 
- 4 stars12.08% 
- 3 stars6.59% 
- 2 stars3.29% 
- 1 star3.29% 
Showing 3 of 90
Reviewed on Nov 16, 2024
The coding part in the labs provided in this course was very helpful and helped me to stabilize my learning.
Reviewed on Jan 16, 2025
The labs all too often failed on environment issues - packages, version alignment, etc. This should be seamless in your controlled environment.
Reviewed on Jan 1, 2025
The course is good but lacks depth on complex subjects.





