Do I need to take the courses in a specific order?

It is recommended that the courses in the Specialization be taken in the order outlined. In the Capstone Project, you will have the opportunity to synthesize your learning in all the courses and apply your combined skills in a final project.

Is this course really 100% online? Do I need to attend any classes in person?

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Can I just enroll in a single course?

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Can I take the course for free?

No, you cannot take this course for free. When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you cannot afford the fee, you can apply for financial aid.

Data Mining Specialization

Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Data Mining Specialization

Analyze Text, Discover Patterns, Visualize Data.

Solve real-world data mining challenges.

Instructors: John C. Hart

70,020 already enrolled

Included with

Learn more

6 course series

Get in-depth knowledge of a subject

from 2,951 reviews of courses in this program

Intermediate level

Some related experience required

3 months to complete

at 10 hours a week

6 course series

Get in-depth knowledge of a subject

from 2,951 reviews of courses in this program

Intermediate level

Some related experience required

3 months to complete

at 10 hours a week

What you'll learn

The Data Mining Specialization teaches data mining techniques for both structured data which conform to a clearly defined schema, and unstructured data which exist in the form of natural language text. Specific course topics include pattern discovery, clustering, text retrieval, text mining and analytics, and data visualization. The Capstone project task is to solve real-world data mining challenges using a restaurant review data set from Yelp.

Courses 2 - 5 of this Specialization form the lecture component of courses in the online Master of Computer Science Degree in Data Science. You can apply to the degree program either before or after you begin the Specialization.

Skills you'll gain

Tools you'll learn

Dashboard

Details to know

Shareable certificate

Add to your LinkedIn profile

Taught in English

Flexible schedule

Learn at your own pace

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Advance your subject-matter expertise

Learn in-demand skills from university and industry experts
Master a subject or tool with hands-on projects
Develop a deep understanding of key concepts
Earn a career certificate from University of Illinois Urbana-Champaign

Specialization - 6 course series

Data Visualization

Course 1, 16 hours

What you'll learn

This course will teach you how to make more effective visualizations of data. Not only will you gain deeper insight into the data, but you will also learn how to better communicate that insight to others. You will learn new ways to display data, applying some fundamental principles of design and human cognition to choose the most effective way to display different kinds of data. This course not only teaches you how to use popular applications like Tableau to connect to data warehouses to extract and visualize relevant data, but also teaches you how Tableau works so you can use the same techniques to make effective data visualizations on your own with any visualization system.

Skills you'll gain

Category: Data Visualization

Category: Data Presentation

Category: Dashboard

Category: Graphing

Category: Data Mapping

Category: Tableau Software

Category: Dashboard Creation

Category: Plot (Graphics)

Category: Data Visualization Software

Text Retrieval and Search Engines

Course 2, 31 hours

What you'll learn

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. Text data are unique in that they are usually generated directly by humans rather than a computer system or sensors, and are thus especially valuable for discovering knowledge about people’s opinions and preferences, in addition to many other kinds of knowledge that we encode in text.

This course will cover search engine technologies, which play an important role in any data mining applications involving text data for two reasons. First, while the raw data may be large for any particular problem, it is often a relatively small subset of the data that are relevant, and a search engine is an essential tool for quickly discovering a small subset of relevant text data in a large text collection. Second, search engines are needed to help analysts interpret any patterns discovered in the data by allowing them to examine the relevant original text data to make sense of any discovered pattern. You will learn the basic concepts, principles, and the major techniques in text retrieval, which is the underlying science of search engines.

Skills you'll gain

Category: Natural Language Processing

Category: Statistical Modeling

Category: Text Mining

Category: Probability & Statistics

Category: Statistical Methods

Category: Applied Machine Learning

Category: Network Analysis

Category: Model Optimization

Category: Machine Learning

Category: Data Engineering

Category: Model Evaluation

Category: Web Scraping

Category: Machine Learning Algorithms

Category: Data Mining

Category: Unstructured Data

Category: Web Analytics and SEO

Category: Big Data

Category: AI Personalization

Text Mining and Analytics

Course 3, 33 hours

What you'll learn

This course will cover the major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making, with an emphasis on statistical approaches that can be generally applied to arbitrary text data in any natural language with no or minimum human effort.

Detailed analysis of text data requires understanding of natural language text, which is known to be a difficult task for computers. However, a number of statistical approaches have been shown to work well for the "shallow" but robust analysis of text data for pattern finding and knowledge discovery. You will learn the basic concepts, principles, and major algorithms in text mining and their potential applications.

Skills you'll gain

Category: Text Mining

Category: Probability & Statistics

Category: Unsupervised Learning

Category: Natural Language Processing

Category: Data Mining

Category: Statistical Analysis

Category: Data Analysis

Category: Unstructured Data

Category: Generative Model Architectures

Category: Classification Algorithms

Category: Probability Distribution

Category: Applied Machine Learning

Category: Statistical Machine Learning

Category: Correlation Analysis

Category: Statistical Methods

Category: Data-Driven Decision-Making

Category: Analytics

Category: Model Optimization

Pattern Discovery in Data Mining

Course 4, 17 hours

What you'll learn

Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for data-driven phrase mining and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.

Skills you'll gain

Category: Data Mining

Category: Text Mining

Category: Algorithms

Category: Spatial Data Analysis

Category: Correlation Analysis

Category: Unsupervised Learning

Category: Spatial Analysis

Category: Big Data

Category: Advanced Analytics

Category: Model Evaluation

Category: Information Privacy

Category: Image Analysis

Cluster Analysis in Data Mining

Course 5, 17 hours

What you'll learn

Discover the basic concepts of cluster analysis, and then study a set of typical clustering methodologies, algorithms, and applications. This includes partitioning methods such as k-means, hierarchical methods such as BIRCH, and density-based methods such as DBSCAN/OPTICS. Moreover, learn methods for clustering validation and evaluation of clustering quality. Finally, see examples of cluster analysis in applications.

Skills you'll gain

Category: Data Mining

Category: Unsupervised Learning

Category: Algorithms

Category: Machine Learning Algorithms

Category: Model Evaluation

Category: Verification And Validation

Category: Statistical Methods

Category: Applied Machine Learning

Category: Machine Learning Methods

Data Mining Project

Course 6, 15 hours

What you'll learn

Note: You should complete all the other courses in this Specialization before beginning this course.

This six-week long Project course of the Data Mining Specialization will allow you to apply the learned algorithms and techniques for data mining from the previous courses in the Specialization, including Pattern Discovery, Clustering, Text Retrieval, Text Mining, and Visualization, to solve interesting real-world data mining challenges. Specifically, you will work on a restaurant review data set from Yelp and use all the knowledge and skills you’ve learned from the previous courses to mine this data set to discover interesting and useful knowledge. The design of the Project emphasizes: 1) simulating the workflow of a data miner in a real job setting; 2) integrating different mining techniques covered in multiple individual courses; 3) experimenting with different ways to solve a problem to deepen your understanding of techniques; and 4) allowing you to propose and explore your own ideas creatively. The goal of the Project is to analyze and mine a large Yelp review data set to discover useful knowledge to help people make decisions in dining. The project will include the following outputs: 1. Opinion visualization: explore and visualize the review content to understand what people have said in those reviews. 2. Cuisine map construction: mine the data set to understand the landscape of different types of cuisines and their similarities. 3. Discovery of popular dishes for a cuisine: mine the data set to discover the common/popular dishes of a particular cuisine. 4. Recommendation of restaurants to help people decide where to dine: mine the data set to rank restaurants for a specific dish and predict the hygiene condition of a restaurant. From the perspective of users, a cuisine map can help them understand what cuisines are there and see the big picture of all kinds of cuisines and their relations. Once they decide what cuisine to try, they would be interested in knowing what the popular dishes of that cuisine are and decide what dishes to have. Finally, they will need to choose a restaurant. Thus, recommending restaurants based on a particular dish would be useful. Moreover, predicting the hygiene condition of a restaurant would also be helpful. By working on these tasks, you will gain experience with a typical workflow in data mining that includes data preprocessing, data exploration, data analysis, improvement of analysis methods, and presentation of results. You will have an opportunity to combine multiple algorithms from different courses to complete a relatively complicated mining task and experiment with different ways to solve a problem to understand the best way to solve it. We will suggest specific approaches, but you are highly encouraged to explore your own ideas since open exploration is, by design, a goal of the Project. You are required to submit a brief report for each of the tasks for peer grading. A final consolidated report is also required, which will be peer-graded.

Skills you'll gain

Category: Data Analysis

Category: Unstructured Data

Category: Data Presentation

Category: Model Evaluation

Category: Data Visualization

Category: Text Mining

Category: Technical Writing

Category: Data Visualization Software

Category: Data Mining

Category: Data Preprocessing

Category: Exploratory Data Analysis

Category: Algorithms

Category: Natural Language Processing

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Prepare for a degree

Taking this Specialization by University of Illinois Urbana-Champaign may provide you with a preview of the topics, materials and instructors in a related degree program which can help you decide if the topic or university is right for you.

Instructors

John C. Hart

University of Illinois Urbana-Champaign

8 Courses153,015 learners

Jiawei Han

University of Illinois Urbana-Champaign

4 Courses72,832 learners

ChengXiang Zhai

University of Illinois Urbana-Champaign

4 Courses110,367 learners

Offered by

University of Illinois Urbana-Champaign

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

Time to completion can vary widely based on your schedule. Most learners are able to complete the Specialization in 4-5 months.

Each course in the Specialization is offered on a regular schedule with sessions starting about once per month. If you don't complete a course on the first try, you can easily transfer to the next session, and your completed work and grades will carry over.

Comfortable with computer programming in multiple programming languages

Basic knowledge of probability and statistics

MCS courses in Coursera do not carry University of Illinois credit on their own. Each course has an enhanced for-credit component. You can earn academic credit if you combine an MCS Coursera course with the enhanced for-credit component offered on the University of Illinois platform. Some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

At completion of this Specialization in Data Mining, you will (1) know the basic concepts in pattern discovery and clustering in data mining, information retrieval, text analytics, and visualization, (2) understand the major algorithms for mining both structured and unstructured text data, and (3) be able to apply the learned algorithms to solve real-world data mining problems.

Data Mining Specialization

Data Mining Specialization

What you'll learn

Skills you'll gain

Tools you'll learn

Details to know

See how employees at top companies are mastering in-demand skills

Advance your subject-matter expertise

Specialization - 6 course series

Data Visualization

What you'll learn

Skills you'll gain

Text Retrieval and Search Engines

What you'll learn

Skills you'll gain

Text Mining and Analytics

What you'll learn

Skills you'll gain

Pattern Discovery in Data Mining

What you'll learn

Skills you'll gain

Cluster Analysis in Data Mining

What you'll learn

Skills you'll gain

Data Mining Project

What you'll learn

Skills you'll gain

Earn a career certificate

Prepare for a degree

Instructors

Offered by

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Get midyear savings and gain career momentum

Add momentum to your team

Frequently asked questions

More questions

Data Mining Specialization

Data Mining Specialization

What you'll learn

Skills you'll gain

Tools you'll learn

Details to know

See how employees at top companies are mastering in-demand skills

Advance your subject-matter expertise

Specialization - 6 course series

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

What you'll learn

Skills you'll gain

Earn a career certificate

Prepare for a degree

Instructors

Offered by

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Frequently asked questions

How long does it take to complete the Specialization?

How often is each course in the Specialization offered?

What background knowledge is necessary?

More questions