
Duke University
Skills you'll gain: PySpark, Databricks, Data Pipelines, Apache Spark, MLOps (Machine Learning Operations), Apache Hadoop, Big Data, Data Warehousing, Data Quality, Data Integration, Data Processing, Database Architecture and Administration, DevOps, Distributed Computing, Data Transformation, SQL, Python Programming
Advanced · Course · 1 - 4 Weeks

Google Cloud
Skills you'll gain: Apache Spark, PySpark, Google Cloud Platform, Cloud Management, Cloud Computing, Distributed Computing, Package and Software Management
Intermediate · Project · Less Than 2 Hours

O.P. Jindal Global University
Skills you'll gain: Big Data, Apache Spark, PySpark, Apache Hadoop, Apache Hive, Databases, NoSQL, Data Mining, Data Warehousing, Real Time Data, Cloud Computing, Data Processing, Query Languages, Distributed Computing, Applied Machine Learning, Scripting Languages
Build toward a degree
Beginner · Course · 1 - 3 Months

Skills you'll gain: Data Ethics, Generative AI, Microsoft Copilot, Data Quality, Responsible AI, Generative Adversarial Networks (GANs), Data Preprocessing, Data Cleansing, Data Processing, Data Synthesis, Data Transformation, Feature Engineering, Data Validation, Information Privacy
Intermediate · Course · 1 - 3 Months

Skills you'll gain: Data Pipelines, Apache Hadoop, Extract, Transform, Load, Data Transformation, Apache Hive, Big Data, Data Warehousing, Strategic Decision-Making, Apache Spark, Data Integration, Data Processing, Data Management, Data Analysis, Scalability
Intermediate · Course · 1 - 4 Weeks

École Polytechnique Fédérale de Lausanne
Skills you'll gain: Scala Programming, Data Structures, Distributed Computing, Algorithms, Functional Design, Scalability, Java Programming, Other Programming Languages, Performance Tuning
Intermediate · Course · 1 - 4 Weeks

Coursera
Skills you'll gain: PySpark, Matplotlib, Apache Spark, Big Data, Data Processing, Distributed Computing, Data Management, Data Visualization, Data Analysis, Data Manipulation, Data Cleansing, Query Languages, Python Programming
Intermediate · Guided Project · Less Than 2 Hours

Skills you'll gain: Apache Spark, PySpark, Applied Machine Learning, Big Data, Machine Learning Methods, Data Storage Technologies, Data Preprocessing, Data Storage, Machine Learning Algorithms, Machine Learning, Distributed Computing, Data Processing, Data Science, Statistical Methods, Model Evaluation, Descriptive Statistics
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hadoop, Big Data, Database Design, Data Processing, Distributed Computing, Scalability, Data Pipelines, Data Warehousing, Query Languages, Data Cleansing, Data Transformation, Data Management, Analytics, Business Intelligence
Mixed · Course · 1 - 4 Weeks

Johns Hopkins University
Skills you'll gain: Apache Hadoop, File Systems, Big Data, Data Infrastructure, Java, Data Structures, File Management, Systems Architecture, Data Processing, Distributed Computing, Data Storage, Development Environment, Scalability
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Enterprise Architecture, Apache Kafka, Data Architecture, Generative AI, Data Infrastructure, Root Cause Analysis, Dataflow, Solution Architecture, Data Quality, Data Pipelines, Software Architecture, Data Integration, Cloud Storage, Data Storage, Failure Analysis, Data Processing, Dependency Analysis, Real Time Data
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: PySpark, Apache Spark, Apache Hadoop, Data Pipelines, Big Data, Data Storage Technologies, Data Processing, Distributed Computing, Data Analysis Expressions (DAX), Data Storage, Data Transformation, SQL, Data Manipulation, Performance Tuning
Intermediate · Course · 1 - 3 Months