Applied Machine Learning in Python

Applied Machine Learning in Python

This course is part of Applied Data Science with Python Specialization

Instructor: Kevyn Collins-Thompson

326,668 already enrolled

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

8,769 reviews

Intermediate level

Some related experience required

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

92%

Most learners liked this course

4 modules

Gain insight into a topic and learn the fundamentals.

8,769 reviews

Intermediate level

Some related experience required

Flexible schedule

3 weeks at 10 hours a week

Learn at your own pace

92%

Most learners liked this course

What you'll learn

Describe how machine learning is different than descriptive statistics
Create and evaluate data clusters
Explain different approaches for creating predictive models
Build features that meet analysis needs

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

5 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Applied Data Science with Python Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

This course will introduce the learner to applied machine learning, focusing more on the techniques and methods than on the statistics behind these methods. The course will start with a discussion of how machine learning is different than descriptive statistics, and introduce the scikit learn toolkit through a tutorial. The issue of dimensionality of data will be discussed, and the task of clustering data, as well as evaluating those clusters, will be tackled. Supervised approaches for creating predictive models will be described, and learners will be able to apply the scikit learn predictive modelling methods while understanding process issues related to data generalizability (e.g. cross validation, overfitting). The course will end with a look at more advanced techniques, such as building ensembles, and practical limitations of predictive models. By the end of this course, students will be able to identify the difference between a supervised (classification) and unsupervised (clustering) technique, identify which technique they need to apply for a particular dataset and need, engineer features to meet that need, and write python code to carry out an analysis.

This module introduces basic machine learning concepts, tasks, and workflow using an example classification problem based on the K-nearest neighbors method, and implemented using the scikit-learn library.

What's included

7 videos4 readings1 assignment1 programming assignment1 ungraded lab

7 videos Total 75 minutes

Introduction 11 minutes
What's New? 1 minute
Key Concepts in Machine Learning 14 minutes
Python Tools for Machine Learning 5 minutes
An Example Machine Learning Problem 12 minutes
Examining the Data 9 minutes
K-Nearest Neighbors Classification 24 minutes

4 readings Total 60 minutes

Syllabus 10 minutes
Help us learn more about you! 10 minutes
Notice for Auditing Learners: Assignment Submission 10 minutes
Zachary Lipton: The Foundations of Algorithmic Bias (optional) 30 minutes

1 assignment Total 20 minutes

Module 1 Quiz 20 minutes

1 programming assignment Total 180 minutes

Assignment 1 180 minutes

1 ungraded lab Total 60 minutes

Module 1 Notebook 60 minutes

This module delves into a wider variety of supervised learning methods for both classification and regression, learning about the connection between model complexity and generalization performance, the importance of proper feature scaling, and how to control model complexity by applying techniques like regularization to avoid overfitting. In addition to k-nearest neighbors, this week covers linear regression (least-squares, ridge, lasso, and polynomial regression), logistic regression, support vector machines, the use of cross-validation for model evaluation, and decision trees.

What's included

13 videos2 readings2 assignments1 programming assignment2 ungraded labs

13 videos Total 190 minutes

Introduction to Supervised Machine Learning 17 minutes
Overfitting and Underfitting 12 minutes
Supervised Learning: Datasets 5 minutes
K-Nearest Neighbors: Classification and Regression 13 minutes
Linear Regression: Least-Squares 18 minutes
Linear Regression: Ridge, Lasso, and Polynomial Regression 27 minutes
Logistic Regression 13 minutes
Linear Classifiers: Support Vector Machines 14 minutes
Multi-Class Classification 7 minutes
Kernelized Support Vector Machines 19 minutes
Cross-Validation 12 minutes
Decision Trees 20 minutes
One-Hot Encoding (Optional) 14 minutes

2 readings Total 20 minutes

A Few Useful Things to Know about Machine Learning 10 minutes
Ed Yong: Genetic Test for Autism Refuted (optional) 10 minutes

2 assignments Total 40 minutes

Module 2 Quiz 30 minutes
Assignment 2 - Follow-up 10 minutes

1 programming assignment Total 180 minutes

Assignment 2 180 minutes

2 ungraded labs Total 120 minutes

Module 2 Notebook 60 minutes
Classifier Visualization Playspace 60 minutes

This module covers evaluation and model selection methods that you can use to help understand and optimize the performance of your machine learning models.

What's included

8 videos2 readings1 assignment1 programming assignment1 ungraded lab

8 videos Total 113 minutes

Model Evaluation & Selection 22 minutes
Confusion Matrices & Basic Evaluation Metrics 14 minutes
Classifier Decision Functions 7 minutes
Precision-Recall and ROC Curves 8 minutes
Multi-Class Evaluation 10 minutes
Regression Evaluation 6 minutes
Model Selection: Optimizing Classifiers for Different Evaluation Metrics 13 minutes
Model Calibration (Optional) 31 minutes

2 readings Total 20 minutes

Practical Guide to Controlled Experiments on the Web (optional) 10 minutes
Note on Assignment 3 10 minutes

1 assignment Total 28 minutes

Module 3 Quiz 28 minutes

1 programming assignment Total 180 minutes

Assignment 3 180 minutes

1 ungraded lab Total 60 minutes

Module 3 Notebook 60 minutes

This module covers more advanced supervised learning methods that include ensembles of trees (random forests, gradient boosted trees), and neural networks (with an optional summary on deep learning). You will also learn about the critical problem of data leakage in machine learning and how to detect and avoid it.

What's included

10 videos13 readings1 assignment1 programming assignment2 ungraded labs

10 videos Total 103 minutes

Naive Bayes Classifiers 8 minutes
Random Forests 12 minutes
Gradient Boosted Decision Trees 6 minutes
Neural Networks 19 minutes
Deep Learning (Optional) 14 minutes
Data Leakage 13 minutes
Introduction 5 minutes
Dimensionality Reduction and Manifold Learning 10 minutes
Clustering 15 minutes
Conclusion 3 minutes

13 readings Total 123 minutes

Neural Networks Made Easy (optional) 10 minutes
Play with Neural Networks: TensorFlow Playground (optional) 10 minutes
Deep Learning in a Nutshell: Core Concepts (optional) 10 minutes
Assisting Pathologists in Detecting Cancer with Deep Learning (optional) 10 minutes
The Treachery of Leakage (optional) 10 minutes
Leakage in Data Mining: Formulation, Detection, and Avoidance (optional) 10 minutes
Data Leakage Example: The ICML 2013 Whale Challenge (optional) 10 minutes
Rules of Machine Learning: Best Practices for ML Engineering (optional) 10 minutes
How to Use t-SNE Effectively 10 minutes
How Machines Make Sense of Big Data: an Introduction to Clustering Algorithms 10 minutes
Post-course Survey 10 minutes
Keep Learning with Michigan Online 10 minutes
Admissions Team alert about fee waiver 3 minutes

1 assignment Total 20 minutes

Module 4 Quiz 20 minutes

1 programming assignment Total 180 minutes

Assignment 4 180 minutes

2 ungraded labs Total 120 minutes

Module 4 Notebook 60 minutes
Unsupervised Learning Notebook 60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings

(925 ratings)

Kevyn Collins-Thompson

University of Michigan

4 Courses 328,128 learners

Offered by

University of Michigan

Explore more from Data Analysis

Status: Free Trial
Edureka
Applied Machine Learning with Python
Course
Status: Free Trial
University of Michigan
Applied Unsupervised Learning in Python
Course
Status: Free Trial
EDUCBA
Machine Learning in Python: Analyze & Apply
Course
Status: Preview
O.P. Jindal Global University
Machine Learning
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
71.89%
4 stars
20.79%
3 stars
4.80%
2 stars
1.20%
1 star
1.29%

Showing 3 of 8769

Reviewed on Oct 22, 2020

EXTREMELY USEFUL AND GOOD COURSE, CONGRATULATIONS TO ALL THE PEOPLE INVOLVE.Honestly, I never thought I could learn so much in an online course, excited for the rest of the specialization

Reviewed on Aug 19, 2018

Concise and clear presentation of the material with the majority of time focused around using TDD to learn and practice concepts through developing solutions to open ended coding challenges.

Reviewed on Nov 26, 2020

great experience and learning lots of technique to apply on real world data, and get important and insightful information from raw data. motivated to proceed further in this domain and course as well.

View more reviews

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.