Data cleaning courses can help you learn techniques for identifying and correcting errors in datasets, handling missing values, and standardizing data formats. You can build skills in data validation, outlier detection, and transforming raw data into a usable format. Many courses introduce tools like Python libraries such as Pandas and NumPy, as well as software like OpenRefine, that support executing data cleaning tasks efficiently and ensuring data quality for analysis.

Skills you'll gain: Dataflow, Data Pipelines, Apache Kafka, Real Time Data, Data Processing, Pandas (Python Package), Data Transformation, SQL, Jupyter, Google Cloud Platform, Analytics, Cloud Storage
Advanced · Course · 1 - 3 Months

Skills you'll gain: Microsoft Power Platform, Business Process Automation, Microsoft 365, Invoicing, No-Code Development, Application Design, Document Management, Data Integration
Beginner · Guided Project · Less Than 2 Hours

Google Cloud
Skills you'll gain: Google Sheets, Data Mapping, Data Integration, Database Application, Cloud Applications, No-Code Development, Relational Databases, Google Cloud Platform, Data Management
Beginner · Project · Less Than 2 Hours

Skills you'll gain: Apache Kafka, Real Time Data, Data Pipelines, JSON, Java, Docker (Software), Software Versioning
Intermediate · Course · 3 - 6 Months

École Polytechnique Fédérale de Lausanne
Skills you'll gain: Apache Spark, Scala Programming, Big Data, Data Manipulation, Distributed Computing, Data Processing, Performance Tuning, SQL, Programming Principles, Data Storage Technologies
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Azure Synapse Analytics, Data Security, Databricks, Data Quality, Microsoft Azure, Data Storage Technologies, Data Storage, Data Transformation, Database Management, Data Processing, Data Cleansing, Data Lakes, Query Languages, Performance Tuning, Data Integration, Data Pipelines, Apache Spark, Encryption, Data Analysis, Real Time Data
Intermediate · Course · 1 - 4 Weeks

University of Illinois Urbana-Champaign
Skills you'll gain: Big Data, Apache Spark, Apache Hadoop, Apache Mahout, Distributed Computing, Data Storage, Data Processing, NoSQL, Apache Kafka, Cloud Computing, Real Time Data, Databases, Analytics, Deep Learning, Scalability, Machine Learning Algorithms, Graph Theory, Machine Learning
Mixed · Course · 1 - 3 Months

Skills you'll gain: Predictive Modeling, Data Preprocessing, Django (Web Framework), Data Visualization, Model Evaluation, Machine Learning Methods, Feature Engineering, Programming Principles, Databases, Game Design, Development Environment, Data Science, Web Applications, Animation and Game Design, Application Frameworks, Scripting, Scripting Languages, Software Design Patterns, Functional Design, Data Validation
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Databricks, Real Time Data, PySpark, Apache Hive, Apache Spark, Big Data, Data Processing, SQL, Data Manipulation, Pandas (Python Package)
Intermediate · Guided Project · Less Than 2 Hours

Skills you'll gain: Budget Management, Account Strategy, Financial Management, Financial Data, Key Performance Indicators (KPIs), Account Management, Data Quality, Budgeting, Customer Analysis, Data Integrity, Performance Analysis, Analytics, Data-Driven Decision-Making, Strategic Decision-Making, Power BI, Customer Insights, Data Analysis, Data Management, AI Enablement, Data Collection
Advanced · Course · 1 - 4 Weeks

Google Cloud
Skills you'll gain: Data Pipelines, Google Cloud Platform, Data Processing, Data Storage, Cloud Engineering, Systems Design, Data Infrastructure, Data Management, Data Architecture, Data Integration, Data Analysis, Automation
Advanced · Course · 1 - 3 Months

Google Cloud
Skills you'll gain: Data Migration, MySQL, Google Cloud Platform, Database Management, Data Pipelines, Data Storage Technologies, Operational Databases, Cloud Management, Data Management
Intermediate · Project · Less Than 2 Hours