IBM

Machine Learning Capstone

Yan Luo
Artem Arutyunov

Instructors: Yan Luo

Access provided by Justice Through Code at Columbia University

20,880 already enrolled

Gain insight into a topic and learn the fundamentals.
4.7

(168 reviews)

Advanced level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
96%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.7

(168 reviews)

Advanced level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
96%
Most learners liked this course

What you'll learn

  • Compare and contrast different machine learning algorithms by creating recommender systems in Python

  • Predict course ratings by training a neural network and constructing regression and classification models 

  • Create recommendation systems by applying your knowledge of KNN, PCA, and non-negative matrix collaborative filtering

  • Develop a final presentation and evaluate your peers’ projects

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

6 assignments¹

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your Machine Learning expertise

This course is part of the IBM Machine Learning with Python & Scikit-learn Professional Certificate
When you enroll in this course, you'll also be enrolled in this Professional Certificate.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate from IBM

There are 5 modules in this course

In this module, you will be introduced to the idea of recommender systems. All labs in subsequent modules are based on this concept. You will also be provided with an overview of the capstone project. You will perform exploratory data analysis to find preliminary insights such as data patterns. You will also use it to check assumptions with the help of summary statistics and graphical representations of online course-related data sets such as course titles, course genres, and course enrollments. Next, you will extract a word-count vector called a “bag of words” (BoW) from course titles and descriptions. The BoW feature is probably the simplest but most effective feature characterizing textual data. It is widely used in many textual machine learning tasks. Finally, you will apply the cosine similarity measurement to calculate the course similarity using the extracted BoW feature vectors.

What's included

2 videos2 assignments3 app items5 plugins

In this module, you will create three course recommendation systems using different methods. In lab 1, you will create a course recommendation system based on user profile and course genre matrices by computing an interest score for each course and recommend the courses with the highest interest scores. In the second lab, you will generate a course similarity matrix to create the recommendation system. In the third lab, you will implement a clustering-based recommender system algorithm using K-means clustering and principal component analysis based on group members’ course enrollment history. In labs four and five you will use collaborative filtering to make predictions about a user’s interest based on a collection of other users’ similar preferences. In lab 4, you will perform KNN-based collaborative filtering and in lab 5, you will use non-negative matrix factorization.

What's included

1 video2 assignments3 app items2 plugins

In this module, you will predict course ratings using neural networks. In the first lab, you will train neural networks to predict course ratings while simultaneously extracting users' and items' latent features. In lab 2, you will be given course interaction feature vectors as input data. Using regression analysis, you will calculate numerical rating scores that predict whether a student will audit or complete a course. Lab 3 is similar to lab 2 but instead of using regression you will use a classification model. You will extract user and item embedding feature vectors from a neural network. With those embedding feature vectors, you will create an interaction feature vector and use that to build a classification model. The model maps the interaction feature vector to a rating mode that predicts whether a learner will audit or complete a course.

What's included

1 video2 assignments5 app items1 plugin

In this module, you will review guidelines and best practices for creating successful reports. As well you may wish to review instructions on creating PowerPoint presentations and how to save a PowerPoint as a PDF.

What's included

2 videos4 plugins

In this final module, you will be introduced to Streamlit and have the opportunity to build a Streamlit app to showcase your work in previous modules. You will complete your submission of screenshots from the hands-on labs for your peers to review. Once you have completed your submission you will then review the submission of one of your peers and grade their submission.

What's included

3 readings1 peer review4 plugins

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
5.0 (47 ratings)
Yan Luo
IBM
7 Courses379,092 learners
Artem Arutyunov
IBM
1 Course20,880 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.7

168 reviews

  • 5 stars

    78.48%

  • 4 stars

    16.27%

  • 3 stars

    1.16%

  • 2 stars

    1.16%

  • 1 star

    2.90%

Showing 3 of 168

AB
4

Reviewed on Mar 23, 2024

SD
4

Reviewed on Oct 19, 2025

GV
5

Reviewed on Jun 26, 2025

Explore more from Data Science

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.