This Machine Learning Capstone course uses various Python-based machine learning libraries, such as Pandas, sci-kit-learn, and Tensorflow/Keras. You will also learn to apply your machine-learning skills and demonstrate your proficiency in them. Before taking this course, you must complete all the previous courses in the IBM Machine Learning Professional Certificate.



Machine Learning Capstone
This course is part of IBM Machine Learning with Python & Scikit-learn Professional Certificate


Instructors: Yan Luo
Access provided by EDGE Group
20,882 already enrolled
(168 reviews)
Recommended experience
What you'll learn
- Compare and contrast different machine learning algorithms by creating recommender systems in Python 
- Predict course ratings by training a neural network and constructing regression and classification models 
- Create recommendation systems by applying your knowledge of KNN, PCA, and non-negative matrix collaborative filtering 
- Develop a final presentation and evaluate your peers’ projects 
Skills you'll gain
- Exploratory Data Analysis
- Python Programming
- Data Analysis
- Unsupervised Learning
- Data Presentation
- Machine Learning
- Machine Learning Algorithms
- Scikit Learn (Machine Learning Library)
- Supervised Learning
- Artificial Neural Networks
- Regression Analysis
- Tensorflow
- Applied Machine Learning
- Keras (Neural Network Library)
Details to know

Add to your LinkedIn profile
See how employees at top companies are mastering in-demand skills

Build your Machine Learning expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM

There are 5 modules in this course
In this module, you will be introduced to the idea of recommender systems. All labs in subsequent modules are based on this concept. You will also be provided with an overview of the capstone project. You will perform exploratory data analysis to find preliminary insights such as data patterns. You will also use it to check assumptions with the help of summary statistics and graphical representations of online course-related data sets such as course titles, course genres, and course enrollments. Next, you will extract a word-count vector called a “bag of words” (BoW) from course titles and descriptions. The BoW feature is probably the simplest but most effective feature characterizing textual data. It is widely used in many textual machine learning tasks. Finally, you will apply the cosine similarity measurement to calculate the course similarity using the extracted BoW feature vectors.
What's included
2 videos2 assignments3 app items5 plugins
In this module, you will create three course recommendation systems using different methods. In lab 1, you will create a course recommendation system based on user profile and course genre matrices by computing an interest score for each course and recommend the courses with the highest interest scores. In the second lab, you will generate a course similarity matrix to create the recommendation system. In the third lab, you will implement a clustering-based recommender system algorithm using K-means clustering and principal component analysis based on group members’ course enrollment history. In labs four and five you will use collaborative filtering to make predictions about a user’s interest based on a collection of other users’ similar preferences. In lab 4, you will perform KNN-based collaborative filtering and in lab 5, you will use non-negative matrix factorization.
What's included
1 video2 assignments3 app items2 plugins
In this module, you will predict course ratings using neural networks. In the first lab, you will train neural networks to predict course ratings while simultaneously extracting users' and items' latent features. In lab 2, you will be given course interaction feature vectors as input data. Using regression analysis, you will calculate numerical rating scores that predict whether a student will audit or complete a course. Lab 3 is similar to lab 2 but instead of using regression you will use a classification model. You will extract user and item embedding feature vectors from a neural network. With those embedding feature vectors, you will create an interaction feature vector and use that to build a classification model. The model maps the interaction feature vector to a rating mode that predicts whether a learner will audit or complete a course.
What's included
1 video2 assignments5 app items1 plugin
In this module, you will review guidelines and best practices for creating successful reports. As well you may wish to review instructions on creating PowerPoint presentations and how to save a PowerPoint as a PDF.
What's included
2 videos4 plugins
In this final module, you will be introduced to Streamlit and have the opportunity to build a Streamlit app to showcase your work in previous modules. You will complete your submission of screenshots from the hands-on labs for your peers to review. Once you have completed your submission you will then review the submission of one of your peers and grade their submission.
What's included
3 readings1 peer review4 plugins
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors


Offered by
Why people choose Coursera for their career




Learner reviews
168 reviews
- 5 stars78.03% 
- 4 stars16.76% 
- 3 stars1.15% 
- 2 stars1.15% 
- 1 star2.89% 
Showing 3 of 168
Reviewed on Apr 9, 2025
I learned so much by completing the machine learning capstone project. I encourage anyone who decides to take this course to explore the deeper nuances of each type of recommender system.
Reviewed on Mar 23, 2024
It was really a quite informative and well planned course. Will continue to get more Professional Certificate from IBM related to ML-DL and Generative AI
Reviewed on Jun 26, 2025
Amazing Project to work on and gain more knowledge of Machine Learning Techniques
Explore more from Data Science
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.





