Back to Getting and Cleaning Data
Johns Hopkins University

Getting and Cleaning Data

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.

Status: Data Processing
Status: Data Import/Export
Course20 hours

Featured reviews

RR

4.0Reviewed Jul 9, 2017

I found the last project insufficiently explained. I was struggling in understanding what the task is. A bit more clear task description (as in Course 2) would be really appreciated.

XX

4.0Reviewed Aug 14, 2018

The Swirl practice part is great! But there is a big gap between what we learned from video/swirl and the course project! The project is much harder than what I learn from the course.

NA

5.0Reviewed Jun 7, 2020

A very useful course. The audio quality of some lectures (especially those by the main instructor) was not good. This course completes the sister course of R programming and they work together.

HS

5.0Reviewed May 2, 2020

This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.

AB

5.0Reviewed Oct 15, 2017

This course is very enlightening. The techniques demonstrated in this course are critical for gathering raw data from various sources and turning it into useful data for analysis.

AC

5.0Reviewed Jan 1, 2019

It was pretty hard for someone like me who has a weakness in programming but it provided sufficient exposure and tasks for me to learn within my capabilities. I did enjoy its challenges.

AG

4.0Reviewed May 7, 2020

The course was very helpful & guided but since I don't have a strong coding background I felt myself getting lost often. It would be really helpful if there is some guidance in assignments.

AT

4.0Reviewed Nov 19, 2017

Very interesting and enjoyed doing the Assignment. but the assignment instructions are not clear.A lot of time was wasted trying to figure out what data is what are what are we interested in.

NK

4.0Reviewed May 18, 2020

The 'cleaning data' part was explained pretty well... I do feel he could've gone into more detail for the 'gathering data' part- especially the webscraping part. Other than that, great course!

SB

5.0Reviewed Mar 16, 2018

So knowledgeable and interesting course. I have learned much about data cleaning and getting from different sources. Finally thanks to coursera team for giving us the opportunity.

TW

4.0Reviewed May 4, 2016

The assignment was excellent but challenging, instruction wasn't too clear or obvious though, struggled for a while (good kind of struggle) The lecture was okay but the Swirl part is always fantastic.

KS

5.0Reviewed May 21, 2019

I really liked this course and believe that my work, although seemingly noob-ish, will get much better as I see others works from the peer review and examples noted in the lessons.

All reviews

Showing: 20 of 1,316

William Stewart
2.0
Reviewed Feb 4, 2018
T M
1.0
Reviewed Feb 1, 2019
Matt Kerns
1.0
Reviewed Jul 17, 2018
Sebastián Lucas
2.0
Reviewed Jan 12, 2018
Bhawesh Singhania
2.0
Reviewed Apr 4, 2019
Mohammad Amir Aghaee
3.0
Reviewed May 13, 2019
Thej
1.0
Reviewed Nov 29, 2018
THI A ALLGOOD
1.0
Reviewed Feb 16, 2019
Les Schmidt
2.0
Reviewed Apr 8, 2017
Pietro Pollo
2.0
Reviewed Jan 25, 2019
Javier R Lores Gil
1.0
Reviewed Nov 17, 2018
Kyle Rozic
2.0
Reviewed Jun 1, 2020
Jennifer Sargent
1.0
Reviewed Jul 22, 2020
1.0
Reviewed Apr 9, 2020
Erin Aylsworth
5.0
Reviewed Dec 9, 2019
Mathew Knudson
4.0
Reviewed Dec 29, 2019
Viktor Kusnezh
1.0
Reviewed Mar 23, 2020
Dan Kjeldstrøm Hansen
5.0
Reviewed Feb 2, 2016
Moshe Pilsky
3.0
Reviewed Mar 13, 2019
Akshay Khatter
2.0
Reviewed Apr 9, 2018