University of Illinois Urbana-Champaign

Pattern Discovery in Data Mining

This course is part of Data Mining Specialization

Jiawei Han

Instructor: Jiawei Han

Access provided by The National Institute of Engineering

43,738 already enrolled

Gain insight into a topic and learn the fundamentals.
4.3

(326 reviews)

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
89%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.3

(326 reviews)

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
89%
Most learners liked this course

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

9 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Data Mining Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

The course orientation will get you familiar with the course, your instructor, your classmates, and our learning environment.

What's included

1 video3 readings1 assignment1 discussion prompt1 plugin

Module 1 consists of two lessons. Lesson 1 covers the general concepts of pattern discovery. This includes the basic concepts of frequent patterns, closed patterns, max-patterns, and association rules. Lesson 2 covers three major approaches for mining frequent patterns. We will learn the downward closure (or Apriori) property of frequent patterns and three major categories of methods for mining frequent patterns: the Apriori algorithm, the method that explores vertical data format, and the pattern-growth approach. We will also discuss how to directly mine the set of closed patterns.

What's included

9 videos2 readings2 assignments1 programming assignment

Module 2 covers two lessons: Lessons 3 and 4. In Lesson 3, we discuss pattern evaluation and learn what kind of interesting measures should be used in pattern analysis. We show that the support-confidence framework is inadequate for pattern evaluation, and even the popularly used lift and chi-square measures may not be good under certain situations. We introduce the concept of null-invariance and introduce a new null-invariant measure for pattern evaluation. In Lesson 4, we examine the issues on mining a diverse spectrum of patterns. We learn the concepts of and mining methods for multiple-level associations, multi-dimensional associations, quantitative associations, negative correlations, compressed patterns, and redundancy-aware patterns.

What's included

9 videos2 readings2 assignments

Module 3 consists of two lessons: Lessons 5 and 6. In Lesson 5, we discuss mining sequential patterns. We will learn several popular and efficient sequential pattern mining methods, including an Apriori-based sequential pattern mining method, GSP; a vertical data format-based sequential pattern method, SPADE; and a pattern-growth-based sequential pattern mining method, PrefixSpan. We will also learn how to directly mine closed sequential patterns. In Lesson 6, we will study concepts and methods for mining spatiotemporal and trajectory patterns as one kind of pattern mining applications. We will introduce a few popular kinds of patterns and their mining methods, including mining spatial associations, mining spatial colocation patterns, mining and aggregating patterns over multiple trajectories, mining semantics-rich movement patterns, and mining periodic movement patterns.

What's included

10 videos2 readings2 assignments

Module 4 consists of two lessons: Lessons 7 and 8. In Lesson 7, we study mining quality phrases from text data as the second kind of pattern mining application. We will mainly introduce two newer methods for phrase mining: ToPMine and SegPhrase, and show frequent pattern mining may be an important role for mining quality phrases in massive text data. In Lesson 8, we will learn several advanced topics on pattern discovery, including mining frequent patterns in data streams, pattern discovery for software bug mining, pattern discovery for image analysis, and pattern discovery and society: privacy-preserving pattern mining. Finally, we look forward to the future of pattern mining research and application exploration.

What's included

9 videos2 readings2 assignments1 programming assignment1 plugin

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Prepare for a degree

Taking this course by University of Illinois Urbana-Champaign may provide you with a preview of the topics, materials and instructors in a related degree program which can help you decide if the topic or university is right for you.

 

Instructor

Instructor ratings
4.3 (14 ratings)
Jiawei Han
University of Illinois Urbana-Champaign
4 Courses70,227 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.3

326 reviews

  • 5 stars

    56.74%

  • 4 stars

    23.92%

  • 3 stars

    11.96%

  • 2 stars

    3.98%

  • 1 star

    3.37%

Showing 3 of 326

RM
4

Reviewed on Mar 4, 2017

JG
4

Reviewed on Nov 10, 2017

VB
4

Reviewed on Aug 8, 2019

Explore more from Data Science