This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.

Exploratory Data Analysis

Exploratory Data Analysis
This course is part of multiple programs.



Instructors: Roger D. Peng, PhD +2 more
184,359 already enrolled
Included with
6,091 reviews
What you'll learn
Understand analytic graphics and the base plotting system in R
Use advanced graphing systems such as the Lattice system
Make graphical displays of very high dimensional data
Apply cluster analysis techniques to locate patterns in data
Skills you'll gain
- Category: Dimensionality Reduction
- Category: Unsupervised Learning
- Category: Data Visualization
- Category: Plot (Graphics)
- Category: Data Visualization Software
- Category: Statistical Analysis
- Category: Graphing
- Category: Data Analysis
- Category: Exploratory Data Analysis
- Category: Statistical Methods
- Category: Statistical Visualization
Tools you'll learn
- Category: R (Software)
- Category: R Programming
- Category: Ggplot2
Details to know

Add to your LinkedIn profile
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors

Offered by

Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
74.34%
- 4 stars
21.09%
- 3 stars
3.38%
- 2 stars
0.73%
- 1 star
0.44%
Showing 3 of 6091
Reviewed on Mar 8, 2017
When it comes to hierarchical and K-means clustering, the theory wasn't explained clearly. When do we use U and V for what purpose? How does D come in? I'm left confused after this.
Reviewed on Oct 31, 2019
Good introduction. The swirl exercises kind of reproduce the lectures though- felt like it might not have been the most efficient use of time to go over the exact same example again.
Reviewed on May 23, 2019
Amazing! Learing so much how to explore the data for the first time. This is a must do for anyone who wants to be a data scientist. Now I can use ggplot without any trouble. Thanks!
Frequently asked questions
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.
