When you enroll in this course, you'll also be asked to select a specific program.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate from IBM
There are 6 modules in this course
Python is a core skill in machine learning, and this course equips you with the tools to apply it effectively. You’ll learn key ML concepts, build models with scikit-learn, and gain hands-on experience using Jupyter Notebooks.
Start with regression techniques like linear, multiple linear, polynomial, and logistic regression. Then move into supervised models such as decision trees, K-Nearest Neighbors, and support vector machines. You’ll also explore unsupervised learning, including clustering methods and dimensionality reduction with PCA, t-SNE, and UMAP.
Through real-world labs, you’ll practice model evaluation, cross-validation, regularization, and pipeline optimization. A final project on rainfall prediction and a course-wide exam will help you apply and reinforce your skills.
Enroll now to start building machine learning models with confidence using Python.
In this module, you will explore foundational machine learning concepts that prepare you for hands-on modeling with Python. You will explain the relevance of Python and scikit-learn in machine learning, summarize the IBM AI Engineering certification path, and classify common types of learning algorithms. You’ll outline the stages of the machine learning model lifecycle and describe what a typical day looks like for a machine learning engineer. You will also compare key roles in the AI field, identify widely used open-source tools, and learn to utilize scikit-learn to build and evaluate simple models.
What's included
8 videos3 readings4 assignments
Show info about module content
8 videos•Total 52 minutes
Course Introduction•3 minutes
IBM AI Engineering PC Overview •8 minutes
An Overview of Machine Learning•8 minutes
Machine Learning Model Lifecycle•2 minutes
A Day in the life of a Machine Learning Engineer•8 minutes
Data Scientist vs AI Engineer•11 minutes
Tools for Machine Learning•9 minutes
Scikit-learn Machine Learning Ecosystem•5 minutes
3 readings•Total 20 minutes
Course Overview•5 minutes
Helpful Tips for Course Completion•10 minutes
Module 1 Summary and Highlights•5 minutes
4 assignments•Total 57 minutes
Practice Quiz: Exploring Machine Learning Concepts•12 minutes
Practice Quiz: Understanding ML Engineering and AI differences•12 minutes
Practice Quiz: Essential Tools and Ecosystems for ML•12 minutes
Graded Quiz: Introduction to Machine Learning•21 minutes
Linear and Logistic Regression
Module 2•3 hours to complete
Module details
In this module, you will explore two essential regression techniques used in machine learning—linear and logistic regression. You’ll explain the role of regression in predicting outcomes, describe the differences between simple and multiple linear regression, and apply both using scikit-learn on real-world data. You will also interpret how polynomial and non-linear regression models capture complex patterns. The module introduces logistic regression as a classification method and guides you in training and testing classification models effectively. To support your learning, you’ll receive a Cheat Sheet: Linear and Logistic Regression that summarizes key concepts, formulas, and use cases.
What's included
6 videos2 readings3 assignments3 app items
Show info about module content
6 videos•Total 38 minutes
Introduction to Regression•4 minutes
Introduction to Simple Linear Regression•5 minutes
Introduction to Multiple Linear Regression•8 minutes
Polynomial and Non-Linear Regression•7 minutes
Introduction to Logistic Regression•7 minutes
Training a Logistic Regression Model•6 minutes
2 readings•Total 15 minutes
Module 2 Summary and Highlights•5 minutes
Cheat Sheet: Linear and Logistic Regression•10 minutes
3 assignments•Total 41 minutes
Practice Quiz: Linear Regression •10 minutes
Practice Quiz: Logistic Regression•10 minutes
Graded Quiz: Linear and Logistic Regression•21 minutes
3 app items•Total 60 minutes
Lab: Simple Linear Regression•15 minutes
Lab: Multiple Linear Regression•15 minutes
Lab: Logistic Regression•30 minutes
Building Supervised Learning Models
Module 3•4 hours to complete
Module details
In this module, you will build and evaluate a range of supervised machine learning models to solve both classification and regression problems. You’ll start by describing how classification models predict categorical outcomes, and implement multi-class classification strategies using real-world data. You’ll then explore how decision trees make predictions and apply them to both classification and regression tasks. The module also covers using support vector machines (SVM) for fraud detection, applying K-Nearest Neighbors (KNN) for customer classification, and training ensemble models like Random Forest and XGBoost to improve accuracy and efficiency. You’ll differentiate bias and variance in model performance and explore how ensemble methods help balance this tradeoff. To support your learning, you’ll receive a Cheat Sheet: Building Supervised Learning Models with key terms, model types, and evaluation tips.
What's included
6 videos3 readings3 assignments6 app items
Show info about module content
6 videos•Total 39 minutes
Classification•6 minutes
Decision Trees•7 minutes
Regression Trees•6 minutes
Supervised Learning with SVMs•7 minutes
Supervised Learning with KNN•6 minutes
Bias, Variance, and Ensemble Models •6 minutes
3 readings•Total 23 minutes
Errata: Regression Trees Video•3 minutes
Module 3 Summary and Highlights•5 minutes
Cheat Sheet: Building Supervised Learning Models•15 minutes
3 assignments•Total 41 minutes
Practice Quiz: Classification and Regression•10 minutes
Practice Quiz: Other Supervised Learning Models•10 minutes
Graded Quiz: Building Supervised Learning Models•21 minutes
6 app items•Total 160 minutes
Lab: Multi-class Classification•30 minutes
Lab: Decision Trees•25 minutes
Lab: Regression Trees•30 minutes
Lab: Credit Card Fraud Detection with Decision Trees and SVM•30 minutes
Lab: K-Nearest Neighbors Classifier•25 minutes
Lab: Random Forests and XGBoost•20 minutes
Building Unsupervised Learning Models
Module 4•3 hours to complete
Module details
In this module, you will learn how unsupervised learning techniques uncover hidden patterns in data without using labeled responses. You’ll describe clustering concepts and apply K-Means to real-world customer segmentation tasks. You’ll also compare DBSCAN and HDBSCAN models to identify dense clusters in spatial data. Moving beyond clustering, you’ll explore dimensionality reduction as a tool for simplifying high-dimensional datasets. You’ll apply PCA to uncover key components and use advanced techniques like t-SNE and UMAP to visualize data structure. To support your learning, you’ll receive a Cheat Sheet: Building Unsupervised Learning Models, highlighting core methods, practical use cases, and comparison guidelines.
What's included
5 videos2 readings3 assignments4 app items
Show info about module content
5 videos•Total 31 minutes
Clustering Strategies and Real-World Applications•7 minutes
K-means and More on K-means•7 minutes
DBSCAN and HDBSCAN Clustering•7 minutes
Clustering, Dimension Reduction, and Feature Engineering•5 minutes
Dimension Reduction Algorithms•5 minutes
2 readings•Total 20 minutes
Module 4 Summary and Highlights•5 minutes
Cheat Sheet: Building Unsupervised Learning Models•15 minutes
3 assignments•Total 41 minutes
Practice Quiz: Clustering•10 minutes
Practice Quiz: Dimension Reduction & Feature Engineering•10 minutes
Graded Quiz: Building Unsupervised Learning Models •21 minutes
4 app items•Total 115 minutes
Lab: K-Means•25 minutes
Lab: Comparing DBSCAN and HDBSCAN•30 minutes
Lab: Applications of Principal Component Analysis (PCA)•30 minutes
Lab: t-SNE and UMAP•30 minutes
Evaluating and Validating Machine Learning Models
Module 5•4 hours to complete
Module details
In this module, you will learn how to assess the effectiveness of machine learning models using industry-standard evaluation and validation techniques. You’ll explain key classification and regression metrics, evaluate models using real-world data, and interpret results with tools like confusion matrices and feature importance charts. You'll explore how to assess clustering quality in unsupervised learning and apply cross-validation to reduce overfitting. The module also introduces regularization methods to improve model generalization and reduce feature complexity. Finally, you'll build complete machine learning pipelines and optimize them with GridSearchCV, while identifying common pitfalls like data leakage. To support your learning, you’ll receive a Cheat Sheet: Evaluating and Validating Machine Learning Models covering key metrics, techniques, and model tuning strategies.
What's included
6 videos2 readings3 assignments5 app items
Show info about module content
6 videos•Total 39 minutes
Classification Metrics and Evaluation Techniques•6 minutes
Regression Metrics and Evaluation Techniques•5 minutes
Evaluating Unsupervised Learning Models: Heuristics and Techniques•7 minutes
Cross-Validation and Advanced Model Validation Techniques•6 minutes
Regularization in Regression and Classification•7 minutes
Data Leakage and Other Pitfalls•7 minutes
2 readings•Total 20 minutes
Module 5 Summary and Highlights•5 minutes
Cheat Sheet: Evaluating and Validating Machine Learning Models•15 minutes
3 assignments•Total 41 minutes
Practice Quiz: Evaluating Machine Learning Models•10 minutes
Practice Quiz: Best Practices for Ensuring Model Generalizability•10 minutes
Graded Quiz: Evaluating and Validating Machine Learning Models•21 minutes
5 app items•Total 160 minutes
Lab: Evaluating Classification Models•25 minutes
Lab: Evaluating Random Forest Performance•30 minutes
Lab: Evaluating K-means Clustering•30 minutes
Lab: Regularization in Linear Regression•30 minutes
Lab: Machine Learning Pipelines and GridSearchCV•45 minutes
Final Project and Exam
Module 6•4 hours to complete
Module details
In this final module, you will apply and demonstrate the full range of skills you have gained throughout the course. You will start with a practice project using the Titanic dataset to build and optimize classification models using pipelines, cross-validation, and hyperparameter tuning. Then, you will complete the final project by developing a rainfall prediction classifier using historical weather data. This includes data cleaning, feature engineering, model building, and evaluating performance. To conclude the course, you will take a graded final exam that tests your knowledge across all six modules. This module gives you the opportunity to showcase your learning in both practical and theoretical contexts.
What's included
1 video3 readings1 assignment3 app items
Show info about module content
1 video•Total 7 minutes
Course Wrap-up•7 minutes
3 readings•Total 13 minutes
Final Project Scenario•2 minutes
Congratulations and Next Steps•6 minutes
Thanks from the Course Team•5 minutes
1 assignment•Total 45 minutes
Final Exam•45 minutes
3 app items•Total 150 minutes
Practice Project: Titanic Survival Prediction•30 minutes
Final Project: Building a Rainfall Prediction Classifier•60 minutes
Final Project Submission and Evaluation•60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
At IBM, we know how rapidly tech evolves and recognize the crucial need for businesses and professionals to build job-ready, hands-on skills quickly. As a market-leading tech innovator, we’re committed to helping you thrive in this dynamic landscape. Through IBM Skills Network, our expertly designed training programs in AI, software development, cybersecurity, data science, business management, and more, provide the essential skills you need to secure your first job, advance your career, or drive business success. Whether you’re upskilling yourself or your team, our courses, Specializations, and Professional Certificates build the technical expertise that ensures you, and your organization, excel in a competitive world.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.7
18,371 reviews
5 stars
75.93%
4 stars
18.60%
3 stars
3.43%
2 stars
1%
1 star
1.01%
Showing 3 of 18371
R
RV
5·
Reviewed on Jan 14, 2025
good course , some part is typical more statistical part shown, even i have good understanding of ML , so new learner will find little typical. rest tutor voice and language is understandable.
F
FO
5·
Reviewed on Oct 8, 2020
I'm extremely excited with what I have learnt so far. As a newbie in Machine Learning, the exposure gained will serve as the much needed foundation to delve into its application to real life problems.
C
CA
4·
Reviewed on Dec 31, 2019
could be split in two courses to be given enough focus. it was very condensed and needed more time and explanation in each section. The instructor was very good but more details would have been nice
Why is Python the most popular language for machine learning?
Python’s popularity in machine learning stems from its simplicity, readability, and extensive libraries like TensorFlow, PyTorch, and scikit-learn, which streamline complex ML tasks. Its active community and ease of integration with other languages and tools also make Python an ideal choice for ML.
How do machine learning engineers use Python in their work?
Machine learning engineers use Python to develop algorithms, preprocess data, train models, and analyze results. With Python’s rich libraries and frameworks, they can experiment with various models, optimize performance, and deploy applications efficiently.
What are the advantages of using Python for machine learning over other languages?
Python offers a wide range of ML libraries, is beginner-friendly, and has great support for data visualization and model interpretation. It also supports rapid prototyping, making it easier to test and refine models compared to other languages like C++ or Java.
Who should take this course ?
This course is designed for aspiring and current machine learning practitioners who want to build foundational skills in Python-based machine learning, from data preparation and model development to evaluation and optimization.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.