This course introduces the key steps involved in the data mining pipeline, including data understanding, data preprocessing, data warehousing, data modeling, interpretation and evaluation, and real-world applications.



Data Mining Pipeline
This course is part of Data Mining Foundations and Practice Specialization

Instructor: Qin (Christine) Lv
Access provided by Justice Through Code at Columbia University
11,202 already enrolled
(103 reviews)
Recommended experience
What you'll learn
- Identify the key components of the data mining pipeline and describe how they're related. 
- Identify particular challenges presented by each component of the data mining pipeline. 
- Apply techniques to address challenges in each component of the data mining pipeline. 
Skills you'll gain
Details to know

Add to your LinkedIn profile
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
This week provides you with an introduction to the Data Mining Specialization and this course, Data Mining Pipeline. As you begin, you will get introduced to the four views of data mining and the key components in the data mining pipeline.
What's included
8 videos6 readings2 peer reviews1 discussion prompt
This week covers data understanding by identifying key data properties and applying techniques to characterize different datasets.
What's included
6 videos1 programming assignment
This week explains why data preprocessing is needed and what techniques can be used to preprocess data.
What's included
6 videos1 programming assignment
This week covers the key characteristics of data warehousing and the techniques to support data warehousing.
What's included
4 videos1 programming assignment
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Build toward a degree
This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
103 reviews
- 5 stars62.13% 
- 4 stars8.73% 
- 3 stars5.82% 
- 2 stars4.85% 
- 1 star18.44% 
Showing 3 of 103
Reviewed on Oct 1, 2023
This course was recently updated. I feel it's much better than the prior version. The videos are easier to follow, and the assignments are cleaned up as well.
Explore more from Data Science
 - University of Colorado Boulder 
 - University of Colorado Boulder 
 - University of Illinois Urbana-Champaign 
 - Coursera Project Network 

