
Skills you'll gain: Data Warehousing, Data Flow Diagrams (DFDs), Data Modeling, Data Pipelines, Ansible, Cloud Security, Diagram Design, Data Validation, Database Design, Apache Airflow, Star Schema, Snowflake Schema, Interviewing Skills, Apache Spark, PySpark, CI/CD, Docker (Software), SQL, Workflow Management, Git (Version Control System)
Intermediate · Professional Certificate · 3 - 6 Months

Skills you'll gain: PySpark, Apache Spark, Model Evaluation, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Logistic Regression, Customer Analysis, Apache Hadoop, Predictive Modeling, Applied Machine Learning, Data Processing, Data Persistence, Advanced Analytics, Big Data, Apache Maven, Data Access, Apache, Python Programming
Beginner · Specialization · 1 - 3 Months

Skills you'll gain: System Monitoring, Data Quality, Performance Tuning, Apache Spark, Data Validation, Data Pipelines, Query Languages, Debugging, Data Transformation, Anomaly Detection, PySpark, Performance Analysis, Extract, Transform, Load, Failure Analysis, SQL, Data Architecture, Data Processing, Benchmarking, Root Cause Analysis, Distributed Computing
Advanced · Specialization · 3 - 6 Months

Skills you'll gain: Cloud Security, Apache Spark, Transaction Processing, Cloud Infrastructure, Data Lakes, PySpark, Data Security, Security Controls, Data Infrastructure, Performance Tuning, Cloud Computing, Cloud Storage, Data Storage Technologies, Data Storage, Cloud Deployment, Data Warehousing, Data Management, Infrastructure Architecture, Data Integrity, Infrastructure as Code (IaC)
Beginner · Course · 1 - 3 Months

Skills you'll gain: NoSQL, Apache Spark, Apache Hadoop, MongoDB, Database Development, Database Systems, Databases, Database Management Systems, Database Management, Extract, Transform, Load, Database Software, Database Administration, PySpark, Apache Hive, Machine Learning Methods, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Model Evaluation
Beginner · Specialization · 3 - 6 Months

Edureka
Skills you'll gain: PySpark, Model Optimization, Data Pipelines, Dashboard Creation, Dashboard, Interactive Data Visualization, Model Training, Data Processing, Data Storage Technologies, Data Architecture, Natural Language Processing, Data Storage, Data Wrangling, Data Integration, Data Transformation, Machine Learning, Data Preprocessing, Deep Learning, Logistic Regression
Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Apache Spark, Machine Learning, Generative AI, Model Evaluation, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Data Processing, Extract, Transform, Load, Predictive Modeling, Model Deployment, Classification Algorithms, Data Transformation, Regression Analysis
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
Intermediate · Course · 1 - 3 Months

University of Pittsburgh
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Data Pipelines, Distributed Computing, Big Data, Apache Hive, Data Processing, Data Storage, Scikit Learn (Machine Learning Library), Predictive Modeling, Scalability, Data Management, File Systems, Data Science, Data Transformation, Information Technology, Data Analysis
Build toward a degree
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Databricks, CI/CD, Apache Spark, Microsoft Azure, Data Governance, Data Lakes, Data Architecture, Integration Testing, Continuous Integration, Continuous Deployment, Data Infrastructure, Real Time Data, Data Integration, Data Pipelines, Development Environment, Data Management, Data Processing, Automation, Data Storage, File Systems
Intermediate · Specialization · 1 - 3 Months

Skills you'll gain: Scala Programming, Data Pipelines, Test Driven Development (TDD), Apache Airflow, Data Lakes, Apache Spark, CI/CD, Apache Kafka, Data Quality, Data Architecture, Performance Tuning, Data Store, Unit Testing, Data Transformation, Data Processing, Data Validation, Maintainability, Continuous Integration, Continuous Deployment, Data Integrity
Intermediate · Course · 3 - 6 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Pipelines, Data Science, Databases, Data Integration, SQL, Query Languages, File I/O, Data Architecture, Distributed Computing, Performance Tuning
Intermediate · Specialization · 3 - 6 Months