Transform raw data into valuable insights using R's powerful tidyverse tools. This beginner-friendly course introduces you to essential data cleaning and manipulation techniques, making complex data tasks approachable and practical. Learn how to clean messy data, handle missing values, and prepare datasets for analysis using Microsoft's development environment and AI assistance.



Data Manipulation and Cleaning in R
This course is part of Microsoft R Programming for Everyone Professional Certificate

Instructor: Microsoft
Access provided by Chula Engineering
Recommended experience
Skills you'll gain
Details to know

Add to your LinkedIn profile
9 assignments
June 2025
See how employees at top companies are mastering in-demand skills

Build your Software Development expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Microsoft

There are 5 modules in this course
In this module, you'll get hands-on experience with dplyr, the powerhouse package for data manipulation in R. We'll work with real retail sales data as you learn to filter, arrange, and transform your data with ease. By the end of this module, you'll be confidently writing clean, efficient code using the pipe operator and essential dplyr functions that professional data analysts use daily.
What's included
4 videos9 readings2 assignments3 ungraded labs2 plugins
Data rarely comes in the perfect format we need - and that's exactly what we'll tackle in this module. Using tidyr, you'll learn to reshape data like a pro, converting between wide and long formats, and handling complex data structures. Through practical exercises with regional sales data, you'll master the tools needed to transform messy data into clean, analysis-ready formats.
What's included
3 videos4 readings2 assignments2 ungraded labs1 plugin
Text data can be particularly challenging. In this module, you'll work with stringr to clean and standardize text data effectively. Using real product descriptions and customer data, you'll learn pattern matching and advanced string manipulation techniques that make text data cleaning a breeze. You'll see how combining stringr with dplyr creates robust solutions for complex data cleaning challenges.
What's included
2 videos7 readings2 assignments3 ungraded labs2 plugins
In this module, you'll learn approaches to handling missing values, outliers, and duplicates. Working with actual order and inventory data, you'll develop strategies for maintaining data quality. You'll discover how modern AI tools can help automate your cleaning processes, making your work more efficient and consistent.
What's included
4 videos8 readings3 assignments3 ungraded labs1 plugin
The comprehensive project simulates a real-world data cleaning scenario where you'll act as a data specialist tasked with standardizing a critical organizational dataset. You'll apply all the key skills learned throughout the course in a structured, step-by-step approach.
What's included
5 readings1 programming assignment2 ungraded labs1 plugin
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Why people choose Coursera for their career









