This is the fourth course in the Google Data Analytics Certificate. In this course, you’ll continue to build your understanding of data analytics and the concepts and tools that data analysts use in their work. You’ll learn how to check and clean your data using spreadsheets and SQL, as well as how to verify and report your data cleaning results. Current Google data analysts will continue to instruct and provide you with hands-on ways to accomplish common data analyst tasks with the best tools and resources.



Process Data from Dirty to Clean
This course is part of Google Data Analytics Professional Certificate

Instructor: Google Career Certificates
Access provided by Capgemini
858,088 already enrolled
(18,529 reviews)
Recommended experience
What you'll learn
Define different types of data integrity and identify risks to data integrity.
Apply basic SQL functions to clean string variables in a database.
Develop basic SQL queries for use on databases.
Describe the process of verifying data cleaning results.
Skills you'll gain
Details to know

Add to your LinkedIn profile
21 assignments
See how employees at top companies are mastering in-demand skills

Build your Data Analysis expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Google

There are 6 modules in this course
Data integrity is critical to successful analysis. In this part of the course, you’ll explore methods and steps that analysts take to check their data for integrity. This includes knowing what to do when you don’t have enough data. You’ll also learn about random samples and understand how to avoid sampling bias. All of these methods will also help you ensure your analysis is successful.
What's included
8 videos10 readings6 assignments
Every data analyst wants to analyze clean data. In this part of the course, you’ll learn the difference between clean and dirty data. Then, you’ll practice cleaning data in spreadsheets and other tools.
What's included
10 videos10 readings6 assignments1 plugin
Knowing a variety of ways to clean data can make a data analyst’s job much easier. In this part of the course, you’ll use SQL to clean data from databases. In particular, you’ll explore how SQL queries and functions can be used to clean and transform your data before an analysis.
What's included
9 videos7 readings5 assignments1 plugin
When you clean data, you make changes to the original dataset. It’s important to verify the changes you make are accurate and to let your teammates know about the changes. In this part of the course, you’ll learn to verify that data is clean and report your data cleaning results. With verified clean data, you’re ready to begin analyzing!
What's included
6 videos5 readings4 assignments
Creating an effective resume will help you in your data analytics career. In this part of the course, you’ll learn all about the job application process. Your focus will be on building a resume that highlights your strengths and relevant experience.
What's included
3 videos2 readings
Review the course glossary and prepare for the next course in the Google Data Analytics Certificate program.
What's included
1 video3 readings
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
18,529 reviews
- 5 stars
85.04%
- 4 stars
12.32%
- 3 stars
1.83%
- 2 stars
0.42%
- 1 star
0.37%
Showing 3 of 18529
Reviewed on Oct 27, 2023
Fun, concise, and on point course walking new folks through (or a great review for not so new folks) the process of identification, basic change management, and reporting for dataset validation
Reviewed on Jul 7, 2025
Great way of teaching, her lectures were outstaning and engaging, understood each and every concepts very clearly. Thank you Google and Coursera team for making us to interact with such personality...
Reviewed on Oct 26, 2021
Good content overall. However it will be nice to have the glossary for each week not mixed up with those from previous as it makes it hard to navigate and know which new words need to be learnt
Explore more from Data Science
Johns Hopkins University