
Skills you'll gain: Data Warehousing, Data Flow Diagrams (DFDs), Data Modeling, Data Pipelines, Ansible, Cloud Security, Diagram Design, Data Validation, Database Design, Apache Airflow, Star Schema, Snowflake Schema, Interviewing Skills, Apache Spark, PySpark, CI/CD, Docker (Software), SQL, Workflow Management, Git (Version Control System)
Intermediate · Professional Certificate · 3 - 6 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Pipelines, Data Science, Databases, Data Integration, SQL, Query Languages, File I/O, Data Architecture, Distributed Computing, Performance Tuning
Intermediate · Specialization · 3 - 6 Months

Google Cloud
Skills you'll gain: Dataflow, Data Pipelines, Serverless Computing, Identity and Access Management, Google Cloud Platform, Site Reliability Engineering, Cloud Security, Performance Tuning, Data Security, CI/CD, Data Processing, Debugging, Real Time Data, System Monitoring, Cloud Storage, Development Testing, Unit Testing, Containerization, File I/O, Data Transformation
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: NoSQL, Apache Spark, Apache Hadoop, MongoDB, Database Development, Database Systems, Databases, Database Management Systems, Database Management, Extract, Transform, Load, Database Software, Database Administration, PySpark, Apache Hive, Machine Learning Methods, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Model Evaluation
Beginner · Specialization · 3 - 6 Months

Skills you'll gain: Data Pipelines, Apache Kafka, Apache Airflow, Data Transformation, Extract, Transform, Load, Data Processing, Data Integration, Data Warehousing, Data Cleansing, Data Lakes, Data Mart, Performance Tuning, Shell Script, Bash (Scripting Language), Command-Line Interface
Intermediate · Course · 1 - 3 Months

Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
Intermediate · Course · 1 - 3 Months

Rice University
Skills you'll gain: Apache Kafka, Apache Spark, Apache Hadoop, Event-Driven Programming, Distributed Computing, Java Programming, Dataflow, Java, OS Process Management, Scala Programming, Data Structures, Scalability, Programming Principles, Server Side, Servers, Application Frameworks, Algorithms, Performance Tuning, Performance Testing, Functional Design
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Apache Airflow, CI/CD, Data Pipelines, Continuous Deployment, Workflow Management, Site Reliability Engineering, Data Engineering, Model Deployment, Data Quality, Version Control, PostgreSQL, Git (Version Control System), Python Programming, Debugging, SQL, Production Management, Scheduling, Unit Testing, Linux Commands, Web Servers
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Cloud Security, Apache Spark, Transaction Processing, Cloud Infrastructure, Data Lakes, PySpark, Data Security, Security Controls, Data Infrastructure, Performance Tuning, Cloud Computing, Cloud Storage, Data Storage Technologies, Data Storage, Cloud Deployment, Data Warehousing, Data Management, Infrastructure Architecture, Data Integrity, Infrastructure as Code (IaC)
Beginner · Course · 1 - 3 Months

University of Pittsburgh
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Data Pipelines, Distributed Computing, Big Data, Apache Hive, Data Processing, Data Storage, Scikit Learn (Machine Learning Library), Predictive Modeling, Scalability, Data Management, File Systems, Data Science, Data Transformation, Information Technology, Data Analysis
Build toward a degree
Intermediate · Course · 1 - 4 Weeks

KodeKloud
Skills you'll gain: MLOps (Machine Learning Operations), Apache Kafka, Apache Airflow, Apache Spark, Extract, Transform, Load, Data Lakes, Data Pipelines, Distributed Computing, Real Time Data, DevOps, Data Processing, Feature Engineering, CI/CD, Pandas (Python Package), Continuous Integration
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Scala Programming, Data Pipelines, Test Driven Development (TDD), Apache Airflow, Data Lakes, Apache Spark, CI/CD, Apache Kafka, Data Quality, Data Architecture, Performance Tuning, Data Store, Unit Testing, Data Transformation, Data Processing, Data Validation, Maintainability, Continuous Integration, Continuous Deployment, Data Integrity
Intermediate · Course · 3 - 6 Months