When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
This course covers the core techniques used in data mining, including frequent pattern analysis, classification, clustering, outlier analysis, as well as mining complex data and research frontiers in the data mining field.
This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more:
MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder
MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder
Course logo image courtesy of Lachlan Cormie, available here on Unsplash: https://unsplash.com/photos/jbJp18srifE
This week starts with an overview of this course, Data Mining Methods, then focuses on frequent pattern analysis, including the Apriori algorithm and FP-growth algorithm for frequent itemset mining, as well as association rules and correlation analysis.
Example: Monotonic and Anti-monotonic Constraints•10 minutes
Example: Lift Correlation•7 minutes
Example: X^2 Correlation•7 minutes
6 readings•Total 46 minutes
Course Updates and Accessibility Support•1 minute
Earn Academic Credit for Your Work! •10 minutes
Course Support•10 minutes
About This Course•10 minutes
Assessment Expectations•5 minutes
AI Citation and Acknowledgement•10 minutes
1 assignment•Total 5 minutes
AI Policy Quiz•5 minutes
1 programming assignment•Total 240 minutes
Frequent Pattern Analysis•240 minutes
1 discussion prompt•Total 10 minutes
Introduce Yourself!•10 minutes
Classification
Module 2•6 hours to complete
Module details
This week introduces supervised learning, classification, prediction, and covers several core classification methods including decision tree induction, Bayesian classification, support vector machines, neural networks, and ensemble methods. It also discusses classification model evaluation and comparison.
What's included
9 videos1 programming assignment
Show info about module content
9 videos•Total 126 minutes
Introduction to Classification •15 minutes
Decision Tree Induction, Example•20 minutes
Bayesian Classification, Example•17 minutes
Example: Decision Tree Induction Classification •10 minutes
Example: Bayesian Classification•9 minutes
Support Vector Machines•9 minutes
Neural Network •12 minutes
Ensemble, Model Evaluation•20 minutes
Model Selection•14 minutes
1 programming assignment•Total 240 minutes
Classification •240 minutes
Clustering
Module 3•6 hours to complete
Module details
This week introduces you to unsupervised learning, clustering, and covers several core clustering methods including partitioning, hierarchical, grid-based, density-based, and probabilistic clustering. Advanced topics for high-dimensional clustering, bi-clustering, graph clustering, and constraint-based clustering are also discussed.
What's included
8 videos1 reading1 programming assignment
Show info about module content
8 videos•Total 106 minutes
Introduction to Clustering•12 minutes
Partitioning Methods•16 minutes
Hierarchical and Grid Based Clustering•16 minutes
Density-Based Clustering•10 minutes
Probabilistic Clustering•11 minutes
EM Clustering•11 minutes
High Dimensional, Bi-Clustering, Graph Clustering•17 minutes
Constraint Based Clustering•13 minutes
1 reading•Total 10 minutes
EM Clustering: Further Explanation •10 minutes
1 programming assignment•Total 240 minutes
Clustering•240 minutes
Outlier Analysis
Module 4•5 hours to complete
Module details
This week discusses three different types of outliers (global, contextual, and collective) and how different methods may be used to identify and analyze such outliers. It also covers some advanced methods for mining complex data, as well as the research frontiers of the data mining field.
What's included
8 videos1 peer review
Show info about module content
8 videos•Total 110 minutes
Types of Outliers•13 minutes
Anomaly Detection Methods 1•16 minutes
Anomaly Detection Methods 2•16 minutes
Anomaly Detection Examples •13 minutes
Sequence and Time Series Data•14 minutes
Graph and Online Social Network Data•10 minutes
Web Data, KDD Conference •14 minutes
Data Mining Research Frontiers•14 minutes
1 peer review•Total 180 minutes
Peer Review: Outlier Analysis, Research Frontiers•180 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Build toward a degree
This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
View eligible degrees
Build toward a degree
This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
¹Successful application and enrollment are required. Eligibility requirements apply. Each institution determines the number of credits recognized by completing this content that may count towards degree requirements, considering any existing credits you may have. Click on a specific course for more information.
OK
Instructor
Instructor ratings
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
CU Boulder is a dynamic community of scholars and learners on one of the most spectacular college campuses in the country. As one of 34 U.S. public institutions in the prestigious Association of American Universities (AAU), we have a proud tradition of academic excellence, with five Nobel laureates and more than 50 members of prestigious academic academies.
A cross-listed course is offered under two or more CU Boulder degree programs on Coursera. For example, Dynamic Programming, Greedy Algorithms is offered as both CSCA 5414 for the MS-CS and DTSA 5503 for the MS-DS.
· You may not earn credit for more than one version of a cross-listed course.
· You can identify cross-listed courses by checking your program’s student handbook.
· Your transcript will be affected. Cross-listed courses are considered equivalent when evaluating graduation requirements. However, we encourage you to take your program's versions of cross-listed courses (when available) to ensure your CU transcript reflects the substantial amount of coursework you are completing directly in your home department. Any courses you complete from another program will appear on your CU transcript with that program’s course prefix (e.g., DTSA vs. CSCA).
· Programs may have different minimum grade requirements for admission and graduation. For example, the MS-DS requires a C or better on all courses for graduation (and a 3.0 pathway GPA for admission), whereas the MS-CS requires a B or better on all breadth courses and a C or better on all elective courses for graduation (and a B or better on each pathway course for admission). All programs require students to maintain a 3.0 cumulative GPA for admission and graduation.
Can I take cross-listed courses to fulfill my degree requirements?
Yes. Cross-listed courses are considered equivalent when evaluating graduation requirements. You can identify cross-listed courses by checking your program’s student handbook.
How do I upgrade and earn credit from CU Boulder?
You may upgrade and pay tuition during any open enrollment period to earn graduate-level CU Boulder credit for << this course/ courses in this specialization>>. Because << this course is / these courses are >> cross listed in both the MS in Computer Science and the MS in Data Science programs, you will need to determine which program you would like to earn the credit from before you upgrade.
MS in Data Science (MS-DS) Credit: To upgrade to the for-credit data science (DTSA) version of << this course / these courses >>, use the MS-DS enrollment form. See How It Works.
MS in Computer Science (MS-CS) Credit: To upgrade to the for-credit computer science (CSCA) version of << this course / these courses >>, use the MS-CS enrollment form. See How It Works.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.