IBM

Machine Learning with Python

Joseph Santarcangelo
Jeff Grossman

Instructors: Joseph Santarcangelo

Access provided by University of Pittsburgh

610,978 already enrolled

Gain insight into a topic and learn the fundamentals.
4.7

(17,905 reviews)

Intermediate level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.7

(17,905 reviews)

Intermediate level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course

What you'll learn

  • Explain key concepts, tools, and roles involved in machine learning, including supervised and unsupervised learning techniques.

  • Apply core machine learning algorithms such as regression, classification, clustering, and dimensionality reduction using Python and scikit-learn.

  • Evaluate model performance using appropriate metrics, validation strategies, and optimization techniques.

  • Build and assess end-to-end machine learning solutions on real-world datasets through hands-on labs, projects, and practical evaluations.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

15 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate from IBM

There are 6 modules in this course

In this module, you will explore foundational machine learning concepts that prepare you for hands-on modeling with Python. You will explain the relevance of Python and scikit-learn in machine learning, summarize the IBM AI Engineering certification path, and classify common types of learning algorithms. You’ll outline the stages of the machine learning model lifecycle and describe what a typical day looks like for a machine learning engineer. You will also compare key roles in the AI field, identify widely used open-source tools, and learn to utilize scikit-learn to build and evaluate simple models.

What's included

8 videos2 readings2 assignments1 plugin

In this module, you will explore two essential regression techniques used in machine learning—linear and logistic regression. You’ll explain the role of regression in predicting outcomes, describe the differences between simple and multiple linear regression, and apply both using scikit-learn on real-world data. You will also interpret how polynomial and non-linear regression models capture complex patterns. The module introduces logistic regression as a classification method and guides you in training and testing classification models effectively. To support your learning, you’ll receive a Cheat Sheet: Linear and Logistic Regression that summarizes key concepts, formulas, and use cases.

What's included

6 videos1 reading3 assignments3 app items1 plugin

In this module, you will build and evaluate a range of supervised machine learning models to solve both classification and regression problems. You’ll start by describing how classification models predict categorical outcomes, and implement multi-class classification strategies using real-world data. You’ll then explore how decision trees make predictions and apply them to both classification and regression tasks. The module also covers using support vector machines (SVM) for fraud detection, applying K-Nearest Neighbors (KNN) for customer classification, and training ensemble models like Random Forest and XGBoost to improve accuracy and efficiency. You’ll differentiate bias and variance in model performance and explore how ensemble methods help balance this tradeoff. To support your learning, you’ll receive a Cheat Sheet: Building Supervised Learning Models with key terms, model types, and evaluation tips.

What's included

6 videos1 reading3 assignments6 app items1 plugin

In this module, you will learn how unsupervised learning techniques uncover hidden patterns in data without using labeled responses. You’ll describe clustering concepts and apply K-Means to real-world customer segmentation tasks. You’ll also compare DBSCAN and HDBSCAN models to identify dense clusters in spatial data. Moving beyond clustering, you’ll explore dimensionality reduction as a tool for simplifying high-dimensional datasets. You’ll apply PCA to uncover key components and use advanced techniques like t-SNE and UMAP to visualize data structure. To support your learning, you’ll receive a Cheat Sheet: Building Unsupervised Learning Models, highlighting core methods, practical use cases, and comparison guidelines.

What's included

5 videos1 reading3 assignments4 app items1 plugin

In this module, you will learn how to assess the effectiveness of machine learning models using industry-standard evaluation and validation techniques. You’ll explain key classification and regression metrics, evaluate models using real-world data, and interpret results with tools like confusion matrices and feature importance charts. You'll explore how to assess clustering quality in unsupervised learning and apply cross-validation to reduce overfitting. The module also introduces regularization methods to improve model generalization and reduce feature complexity. Finally, you'll build complete machine learning pipelines and optimize them with GridSearchCV, while identifying common pitfalls like data leakage. To support your learning, you’ll receive a Cheat Sheet: Evaluating and Validating Machine Learning Models covering key metrics, techniques, and model tuning strategies.

What's included

6 videos1 reading3 assignments5 app items1 plugin

In this final module, you will apply and demonstrate the full range of skills you have gained throughout the course. You will start with a practice project using the Titanic dataset to build and optimize classification models using pipelines, cross-validation, and hyperparameter tuning. Then, you will complete the final project by developing a rainfall prediction classifier using historical weather data. This includes data cleaning, feature engineering, model building, and evaluating performance. To conclude the course, you will take a graded final exam that tests your knowledge across all six modules. This module gives you the opportunity to showcase your learning in both practical and theoretical contexts.

What's included

1 video3 readings1 assignment3 app items

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.7 (3,382 ratings)
Joseph Santarcangelo
IBM
36 Courses2,193,412 learners
Jeff Grossman
IBM
3 Courses673,796 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.7

17,905 reviews

  • 5 stars

    75.90%

  • 4 stars

    18.62%

  • 3 stars

    3.45%

  • 2 stars

    0.99%

  • 1 star

    1.02%

Showing 3 of 17905

FO
5

Reviewed on Oct 8, 2020

RV
5

Reviewed on Jan 14, 2025

AJ
5

Reviewed on Jul 8, 2019

Explore more from Data Science