Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.
About this Course
What you will learn
Understand common data storage systems
Apply data cleaning basics to make data "tidy"
Use R for text and date manipulation
Obtain usable data from the web, APIs, and databases
Skills you will gain
- Data Manipulation
- Regular Expression (REGEX)
- R Programming
- Data Cleansing
Syllabus - What you will learn from this course
- 5 stars67.55%
- 4 stars23.62%
- 3 stars5.85%
- 2 stars1.62%
- 1 star1.33%
TOP REVIEWS FROM GETTING AND CLEANING DATA
This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.
I really liked this course and believe that my work, although seemingly noob-ish, will get much better as I see others works from the peer review and examples noted in the lessons.
The Swirl practice part is great! But there is a big gap between what we learned from video/swirl and the course project! The project is much harder than what I learn from the course.
Loved the structure of the course. Learned a lot. The course project seemed a little funky , especially creating the codebook for an already existing set of data but was a useful teaching aid.
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.