
Coursera
Skills you'll gain: A/B Testing, Data-Driven Decision-Making, Statistical Methods, Statistical Hypothesis Testing, Analytics, Statistics, Estimation, Decision Making, Data Analysis, Analytical Skills, Statistical Inference, Statistical Analysis, Business, Sample Size Determination, Data Collection
Intermediate · Course · 1 - 4 Weeks

University of California San Diego
Skills you'll gain: Apache Spark, Model Evaluation, Apache Hadoop, Data Integration, Exploratory Data Analysis, Big Data, Classification Algorithms, Graph Theory, Data Pipelines, Data Processing, Network Model, Model Training, Database Design, Data Modeling, Regression Analysis, Data Management, Data Infrastructure, Data Presentation, Data Mining, MongoDB
★ 4.5 (14K) · Beginner · Specialization · 3 - 6 Months

Skills you'll gain: Security Controls, Scalability, Azure Synapse Analytics, Data Pipelines, Databases, Microsoft Azure, Data Governance, Extract, Transform, Load, Data Lakes, Databricks, NoSQL, Data Management, Apache Hadoop, Big Data, Apache Spark, Dashboard Creation, Interactive Data Visualization, MLOps (Machine Learning Operations), Large Language Modeling, Applied Machine Learning
Intermediate · Professional Certificate · 3 - 6 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Pipelines, Data Science, Databases, Data Integration, SQL, Query Languages, File I/O, Data Architecture, Distributed Computing, Performance Tuning
★ 4.6 (9) · Intermediate · Specialization · 3 - 6 Months

University of California San Diego
Skills you'll gain: Big Data, Apache Hadoop, Scalability, Data Processing, Data Science, Distributed Computing, Unstructured Data, Data Analysis, Real Time Data, Data Quality, Data Storage
★ 4.6 (11K) · Mixed · Course · 1 - 3 Months

Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
★ 4.4 (479) · Intermediate · Course · 1 - 3 Months

Cloudera
Skills you'll gain: Database Design, SQL, Apache Hive, Relational Databases, Databases, Database Management, Database Management Systems, Data Store, Big Data, Database Systems, Amazon Web Services, MySQL, Data Management, Query Languages, Amazon S3, Data Storage, Data Access, NoSQL, Cloud Storage, Data Analysis
★ 4.7 (1.4K) · Beginner · Specialization · 3 - 6 Months

Skills you'll gain: Data Flow Diagrams (DFDs), Apache Airflow, Data Pipelines, Diagram Design, Data Mapping, Data Modeling, Data Integration, Data Architecture, Data Warehousing, Apache Spark, Extract, Transform, Load, Database Development, Data Processing, Data Transformation, Configuration Management, Enterprise Security
Beginner · Course · 1 - 3 Months

Duke University
Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, GitHub Copilot, Interactive Data Visualization, Plot (Graphics), Plotly, Data Pipelines, Kubernetes, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming
★ 3.8 (123) · Intermediate · Specialization · 1 - 3 Months

Skills you'll gain: Data Validation, Data Quality, Data Integrity, Debugging, Data Pipelines, Test Automation, Root Cause Analysis, YAML, Generative AI, Test Tools, Anomaly Detection, AI Integrations, CI/CD, Python Programming, Reliability, Performance Tuning, Memory Management
Beginner · Course · 1 - 3 Months

Skills you'll gain: Data Validation, Data Quality, SQL, Data Integrity, Verification And Validation, Unit Testing
Advanced · Course · 1 - 4 Weeks

Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Cloud Infrastructure, Data Modeling, Database Design, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management
★ 4.4 (98) · Intermediate · Specialization · 1 - 3 Months