
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
★ 4.4 (479) · Intermediate · Course · 1 - 3 Months

Skills you'll gain: Data Warehousing, Data Flow Diagrams (DFDs), Data Modeling, Data Pipelines, Ansible, Cloud Security, Diagram Design, Data Validation, Database Design, Apache Airflow, Star Schema, Snowflake Schema, Interviewing Skills, Apache Spark, PySpark, CI/CD, Docker (Software), SQL, Workflow Management, Git (Version Control System)
Intermediate · Professional Certificate · 3 - 6 Months

Skills you'll gain: PySpark, Apache Spark, Model Evaluation, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Logistic Regression, Customer Analysis, Apache Hadoop, Predictive Modeling, Applied Machine Learning, Data Processing, Data Persistence, Advanced Analytics, Big Data, Apache Maven, Data Access, Apache, Python Programming
★ 4.6 (90) · Beginner · Specialization · 1 - 3 Months

Pragmatic AI Labs
Skills you'll gain: Databricks, Data Lakes, Data Engineering, Data Wrangling, Apache Spark, Data Access, Data Processing, Data Warehousing, Data Architecture, Data Management, Data Synthesis, Data Science, Data Mining, Data Integrity, Data Modeling, Data Presentation, Data Entry, Data Storage, SQL, Python Programming
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: System Monitoring, Data Quality, Performance Tuning, Apache Spark, Data Validation, Data Pipelines, Query Languages, Debugging, Data Transformation, Anomaly Detection, PySpark, Performance Analysis, Extract, Transform, Load, Failure Analysis, SQL, Data Architecture, Data Processing, Benchmarking, Root Cause Analysis, Distributed Computing
Advanced · Specialization · 3 - 6 Months

Edureka
Skills you'll gain: PySpark, Apache Spark, Data Management, Distributed Computing, Apache Hadoop, Data Processing, Data Manipulation, Data Analysis, Exploratory Data Analysis, Python Programming
★ 3.7 (50) · Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Databricks, CI/CD, Apache Spark, Microsoft Azure, Data Governance, Data Lakes, Data Architecture, Integration Testing, Continuous Integration, Continuous Deployment, Real Time Data, Data Integration, Data Pipelines, IT Automation, Development Environment, Data Management, Automation, Data Storage, Metadata Management, File Systems
★ 4.4 (50) · Intermediate · Specialization · 1 - 3 Months

Johns Hopkins University
Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Science, Databases, Data Integration, SQL, Query Languages, File I/O, Data Architecture, Data Manipulation, Distributed Computing, Performance Tuning
★ 4.6 (9) · Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: Pandas (Python Package), Pivot Tables And Charts, Data Manipulation, Data Import/Export, NumPy, Time Series Analysis and Forecasting, Business Reporting, Data Wrangling, Jupyter, Data Visualization, Microsoft Excel, Plot (Graphics), Data Transformation, Data Analysis, Data Cleansing, Data Preprocessing, Analytics, Performance Reporting, Data Processing, Python Programming
★ 4.7 (15) · Beginner · Specialization · 1 - 3 Months

Skills you'll gain: PySpark, MySQL, Data Pipelines, Apache Spark, Data Access, Data Processing, Data Engineering, SQL, Data Transformation, Data Manipulation, Distributed Computing, Data Import/Export, Programming Principles, Python Programming, Debugging
★ 4.5 (41) · Mixed · Course · 1 - 4 Weeks

Skills you'll gain: Data Pipelines, Apache Kafka, Apache Airflow, Data Transformation, Extract, Transform, Load, Data Processing, Data Integration, Data Warehousing, Data Cleansing, Data Lakes, Data Mart, Performance Tuning, Shell Script, Bash (Scripting Language), Command-Line Interface
★ 4.5 (457) · Intermediate · Course · 1 - 3 Months

Skills you'll gain: Apache Kafka, Data Transformation, Real Time Data, Fraud detection, Data Pipelines, Apache Spark, Power BI, PySpark, Performance Tuning, Grafana, Disaster Recovery, Data Architecture, Prometheus (Software), Data Integrity, Scalability, Data Processing, Data Governance, Event-Driven Programming, System Monitoring, Docker (Software)
Intermediate · Specialization · 3 - 6 Months