Packt
Clustering and Classification with Machine Learning in R
Packt

Clustering and Classification with Machine Learning in R

Access provided by Scotiabank

Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Beginner level

Recommended experience

1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Perform basic data pre-processing and wrangling in R Studio.

  • Implement and analyze unsupervised clustering techniques, such as K-means clustering.

  • Implement supervised learning techniques and classification methods, such as Random Forests.

  • Utilize dimensional reduction techniques (PCA) and feature selection.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

12 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 10 modules in this course

In this module, we will introduce the course, outlining the fundamental concepts of clustering and classification in machine learning. We will also guide you through the installation and setup of R and R Studio, ensuring you are prepared to dive into the practical aspects of the course.

What's included

2 videos1 reading1 assignment

In this module, we will explore the different methods to import data into R from various sources. You will learn to read data from CSV and Excel files, unzipped folders, online CSVs, Google Sheets, HTML tables, and databases, setting the foundation for data manipulation and analysis.

What's included

7 videos1 assignment1 plugin

In this module, we will delve into data cleaning and preprocessing, ensuring your data is ready for analysis. You will learn to summarize and explore data using the dplyr package and create visualizations with ggplot2. Additionally, we'll cover methods to evaluate associations between variables and test for correlation.

What's included

11 videos1 assignment1 plugin

In this module, we will explore the differences between machine learning and traditional statistical analysis, providing a theoretical overview of machine learning. You will gain a foundational understanding of machine learning concepts and their relevance to data science.

What's included

2 videos1 assignment1 plugin

In this module, we will cover unsupervised learning techniques, focusing on clustering algorithms. You will learn to implement and evaluate different clustering methods, including K-Means, Fuzzy K-Means, DBSCAN, and more. We'll also discuss how to select the best algorithm for your specific data needs.

What's included

12 videos1 assignment1 plugin

In this module, we will explore techniques for reducing the dimensionality of your data. You will learn the theoretical aspects of dimension reduction and how to apply methods such as PCA, Multidimensional Scaling, and SVD in R to simplify your datasets while preserving essential information.

What's included

5 videos1 assignment1 plugin

In this module, we will focus on feature selection techniques to identify the most relevant predictors for your models. You will learn to remove correlated variables and use methods like LASSO regression, FSelector, and Boruta analysis to select important features, enhancing your model's performance.

What's included

4 videos1 assignment1 plugin

In this module, we will introduce the fundamental concepts of supervised learning. You will learn how to preprocess data for supervised learning and gain insights into various types of supervised learning problems, preparing you for more advanced classification and regression techniques.

What's included

2 videos1 assignment1 plugin

In this module, we will delve into classification techniques in supervised learning. You will learn to implement logistic regression, Decision Trees, Random Forests, and Support Vector Machines (SVM). We will also cover methods to evaluate classification accuracy and understand variable importance in your models.

What's included

18 videos1 assignment1 plugin

In this module, we will provide additional lectures focusing on advanced clustering methods. You will learn about Fuzzy C-Means Clustering, understanding its theoretical underpinnings and practical applications in R, further enhancing your clustering analysis skills.

What's included

1 video3 assignments

Instructor

Packt - Course Instructors
Packt
1,024 Courses241,154 learners

Offered by

Packt

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Explore more from Data Science