When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 5 modules in this course
Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for data-driven phrase mining and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.
The course orientation will get you familiar with the course, your instructor, your classmates, and our learning environment.
Welcome! Please tell us about yourself.•15 minutes
Module 1
5 hours to complete
Module details
Module 1 consists of two lessons. Lesson 1 covers the general concepts of pattern discovery. This includes the basic concepts of frequent patterns, closed patterns, max-patterns, and association rules. Lesson 2 covers three major approaches for mining frequent patterns. We will learn the downward closure (or Apriori) property of frequent patterns and three major categories of methods for mining frequent patterns: the Apriori algorithm, the method that explores vertical data format, and the pattern-growth approach. We will also discuss how to directly mine the set of closed patterns.
1.1. What Is Pattern Discovery? Why Is It Important?•3 minutes
1.2. Frequent Patterns and Association Rules•6 minutes
1.3. Compressed Representation: Closed Patterns and Max-Patterns•7 minutes
2.1. The Downward Closure Property of Frequent Patterns•4 minutes
2.2. The Apriori Algorithm•6 minutes
2.3. Extensions or Improvements of Apriori•8 minutes
2.4. Mining Frequent Patterns by Exploring Vertical Data Format•4 minutes
2.5. FPGrowth: A Pattern Growth Approach•8 minutes
2.6. Mining Closed Patterns•4 minutes
2 readings•Total 20 minutes
Lesson 1 Overview•10 minutes
Lesson 2 Overview•10 minutes
2 assignments•Total 60 minutes
Lesson 1 Quiz•30 minutes
Lesson 2 Quiz•30 minutes
1 programming assignment•Total 180 minutes
Frequent Itemset Mining Using Apriori•180 minutes
Module 2
2 hours to complete
Module details
Module 2 covers two lessons: Lessons 3 and 4. In Lesson 3, we discuss pattern evaluation and learn what kind of interesting measures should be used in pattern analysis. We show that the support-confidence framework is inadequate for pattern evaluation, and even the popularly used lift and chi-square measures may not be good under certain situations. We introduce the concept of null-invariance and introduce a new null-invariant measure for pattern evaluation. In Lesson 4, we examine the issues on mining a diverse spectrum of patterns. We learn the concepts of and mining methods for multiple-level associations, multi-dimensional associations, quantitative associations, negative correlations, compressed patterns, and redundancy-aware patterns.
What's included
9 videos2 readings2 assignments
Show info about module content
9 videos•Total 47 minutes
3.1. Limitation of the Support-Confidence Framework•3 minutes
3.2. Interestingness Measures: Lift and χ2•6 minutes
3.3. Null Invariance Measures•5 minutes
3.4. Comparison of Null-Invariant Measures•8 minutes
Module 3 consists of two lessons: Lessons 5 and 6. In Lesson 5, we discuss mining sequential patterns. We will learn several popular and efficient sequential pattern mining methods, including an Apriori-based sequential pattern mining method, GSP; a vertical data format-based sequential pattern method, SPADE; and a pattern-growth-based sequential pattern mining method, PrefixSpan. We will also learn how to directly mine closed sequential patterns. In Lesson 6, we will study concepts and methods for mining spatiotemporal and trajectory patterns as one kind of pattern mining applications. We will introduce a few popular kinds of patterns and their mining methods, including mining spatial associations, mining spatial colocation patterns, mining and aggregating patterns over multiple trajectories, mining semantics-rich movement patterns, and mining periodic movement patterns.
What's included
10 videos2 readings2 assignments
Show info about module content
10 videos•Total 56 minutes
5.1. Sequential Pattern and Sequential Pattern Mining•7 minutes
6.3. Mining and Aggregating Patterns over Multiple Trajectories•10 minutes
6.4. Mining Semantics-Rich Movement Patterns•3 minutes
6.5. Mining Periodic Movement Patterns•7 minutes
2 readings•Total 20 minutes
Lesson 5 Overview•10 minutes
Lesson 6 Overview•10 minutes
2 assignments•Total 60 minutes
Lesson 5 Quiz•30 minutes
Lesson 6 Quiz•30 minutes
Week 4
6 hours to complete
Module details
Module 4 consists of two lessons: Lessons 7 and 8. In Lesson 7, we study mining quality phrases from text data as the second kind of pattern mining application. We will mainly introduce two newer methods for phrase mining: ToPMine and SegPhrase, and show frequent pattern mining may be an important role for mining quality phrases in massive text data. In Lesson 8, we will learn several advanced topics on pattern discovery, including mining frequent patterns in data streams, pattern discovery for software bug mining, pattern discovery for image analysis, and pattern discovery and society: privacy-preserving pattern mining. Finally, we look forward to the future of pattern mining research and application exploration.
7.1. From Frequent Pattern Mining to Phrase Mining•4 minutes
7.2. Previous Phrase Mining Methods•11 minutes
7.3. ToPMine: Phrase Mining without Training Data•12 minutes
7.4. SegPhrase: Phrase Mining with Tiny Training Sets•14 minutes
8.1. Frequent Pattern Mining in Data Streams•20 minutes
8.2. Pattern Discovery for Software Bug Mining•12 minutes
8.3. Pattern Discovery for Image Analysis•6 minutes
8.4. Advanced Topics on Pattern Discovery: Pattern Mining and Society—Privacy Issue•14 minutes
8.5. Advanced Topics on Pattern Discovery: Looking Forward•5 minutes
2 readings•Total 20 minutes
Lesson 7 Overview•10 minutes
Lesson 8 Overview•10 minutes
2 assignments•Total 60 minutes
Lesson 7 Quiz•30 minutes
Lesson 8 Quiz•30 minutes
1 programming assignment•Total 180 minutes
Mining Contiguous Sequential Patterns in Text•180 minutes
1 plugin•Total 15 minutes
How was the course?•15 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Prepare for a degree
Taking this course by University of Illinois Urbana-Champaign may provide you with a preview of the topics, materials and instructors in a related degree program which can help you decide if the topic or university is right for you.
View eligible degrees
Prepare for a degree
Taking this course by University of Illinois Urbana-Champaign may provide you with a preview of the topics, materials and instructors in a related degree program which can help you decide if the topic or university is right for you.
The University of Illinois at Urbana-Champaign is a world leader in research, teaching and public engagement, distinguished by the breadth of its programs, broad academic excellence, and internationally renowned faculty and alumni. Illinois serves the world by creating knowledge, preparing students for lives of impact, and finding solutions to critical societal needs.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.