Johns Hopkins University

Exploratory Data Analysis

This course is part of multiple programs.

Roger D. Peng, PhD
Jeff Leek, PhD
Brian Caffo, PhD

Instructors: Roger D. Peng, PhD

Access provided by Sistech

182,970 already enrolled

Gain insight into a topic and learn the fundamentals.
4.7

(6,085 reviews)

6 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
94%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.7

(6,085 reviews)

6 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
94%
Most learners liked this course

What you'll learn

  • Understand analytic graphics and the base plotting system in R

  • Use advanced graphing systems such as the Lattice system

  • Make graphical displays of very high dimensional data

  • Apply cluster analysis techniques to locate patterns in data

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

2 assignments¹

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 4 modules in this course

This week covers the basics of analytic graphics and the base plotting system in R. We've also included some background material to help you install R if you haven't done so already.

What's included

15 videos6 readings1 assignment5 programming assignments1 peer review

Welcome to Week 2 of Exploratory Data Analysis. This week covers some of the more advanced graphing systems available in R: the Lattice system and the ggplot2 system. While the base graphics system provides many important tools for visualizing data, it was part of the original R system and lacks many features that may be desirable in a plotting system, particularly when visualizing high dimensional data. The Lattice and ggplot2 systems also simplify the laying out of plots making it a much less tedious process.

What's included

7 videos1 reading1 assignment5 programming assignments

Welcome to Week 3 of Exploratory Data Analysis. This week covers some of the workhorse statistical methods for exploratory analysis. These methods include clustering and dimension reduction techniques that allow you to make graphical displays of very high dimensional data (many many variables). We also cover novel ways to specify colors in R so that you can use color as an important and useful dimension when making data graphics. All of this material is covered in chapters 9-12 of my book Exploratory Data Analysis with R.

What's included

12 videos1 reading4 programming assignments

This week, we'll look at two case studies in exploratory data analysis. The first involves the use of cluster analysis techniques, and the second is a more involved analysis of some air pollution data. How one goes about doing EDA is often personal, but I'm providing these videos to give you a sense of how you might proceed with a specific type of dataset.

What's included

2 videos2 readings1 programming assignment1 peer review

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.7 (206 ratings)
Roger D. Peng, PhD
Johns Hopkins University
37 Courses1,662,358 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.7

6,085 reviews

  • 5 stars

    74.33%

  • 4 stars

    21.09%

  • 3 stars

    3.38%

  • 2 stars

    0.73%

  • 1 star

    0.44%

Showing 3 of 6085

EK
5

Reviewed on Jun 5, 2020

MA
5

Reviewed on Jul 3, 2018

CB
5

Reviewed on Jan 11, 2017

Explore more from Data Science

¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.