
Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Development Environment, Distributed Computing, Performance Tuning, Open Source Technology, Data Transformation, Debugging
★ 4.4 (479) · Intermediate · Course · 1 - 3 Months

Pragmatic AI Labs
Skills you'll gain: Databricks, Data Lakes, Data Engineering, Data Wrangling, Apache Spark, Data Access, Data Processing, Data Warehousing, Data Architecture, Data Management, Data Synthesis, Data Science, Data Mining, Data Integrity, Data Modeling, Data Presentation, Data Entry, Data Storage, SQL, Python Programming
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Pandas (Python Package), Pivot Tables And Charts, Data Manipulation, Data Import/Export, NumPy, Time Series Analysis and Forecasting, Business Reporting, Data Wrangling, Jupyter, Data Visualization, Microsoft Excel, Plot (Graphics), Data Transformation, Data Analysis, Data Cleansing, Data Preprocessing, Analytics, Performance Reporting, Data Processing, Python Programming
★ 4.7 (15) · Beginner · Specialization · 1 - 3 Months

Skills you'll gain: Apache Airflow, Service Level Agreement, Data Pipelines, Workflow Management, Dependency Analysis, Scalability, Service Level, System Monitoring
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Scala Programming, Data Pipelines, Test Driven Development (TDD), Apache Airflow, Data Lakes, Apache Spark, CI/CD, Apache Kafka, Data Quality, Data Architecture, Performance Tuning, Data Store, Unit Testing, Data Transformation, Data Processing, Data Validation, Maintainability, Continuous Integration, Continuous Deployment, Data Integrity
Intermediate · Course · 3 - 6 Months

Coursera
Skills you'll gain: Data Pipelines, Apache Airflow, Data Integration, Cloud Deployment, Extract, Transform, Load, Cloud Computing, Data Infrastructure, Data Processing, Data Cleansing, Data Warehousing, Code Reusability
Beginner · Course · 1 - 4 Weeks

Skills you'll gain: Apache Kafka, Apache Hadoop, Apache Spark, Real Time Data, Scala Programming, Data Integration, Command-Line Interface, Apache Hive, Big Data, Applied Machine Learning, Data Processing, System Design and Implementation, Apache Cassandra, Data Pipelines, Java, Distributed Computing, IntelliJ IDEA, Java Programming, Application Deployment, Enterprise Application Management
★ 4.6 (15) · Intermediate · Specialization · 3 - 6 Months

Skills you'll gain: NoSQL, Apache Spark, Apache Hadoop, MongoDB, Database Development, Database Systems, Databases, Database Management Systems, Database Management, Extract, Transform, Load, Database Software, Database Administration, PySpark, Apache Hive, Machine Learning Methods, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Model Evaluation
★ 4.5 (840) · Beginner · Specialization · 3 - 6 Months

Skills you'll gain: Cloud Security, Apache Spark, Transaction Processing, Cloud Infrastructure, Data Lakes, PySpark, Data Security, Security Controls, Data Infrastructure, Performance Tuning, Cloud Computing, Cloud Storage, Data Storage Technologies, Data Storage, Cloud Deployment, Data Warehousing, Data Management, Infrastructure Architecture, Data Integrity, Infrastructure as Code (IaC)
Beginner · Course · 1 - 3 Months

Skills you'll gain: PySpark, Apache Spark, Customer Analysis, Big Data, Data Processing, Advanced Analytics, Statistical Modeling, Text Mining, Customer Insights, Risk Modeling, Data Preprocessing, Unstructured Data, Simulation and Simulation Software, Data Manipulation, Marketing Analytics
★ 4.6 (14) · Mixed · Course · 1 - 4 Weeks

Pearson
Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Data Lakes, Analytics, Data Pipelines, Data Processing, Data Import/Export, Linux Commands, Linux, File Systems, Data Management, Distributed Computing, Command-Line Interface, Relational Databases, Software Installation, Java, C++ (Programming Language)
Intermediate · Specialization · 1 - 4 Weeks

Google Cloud
Skills you'll gain: Dataflow, Data Pipelines, Serverless Computing, Identity and Access Management, Google Cloud Platform, Site Reliability Engineering, Cloud Security, Performance Tuning, Data Security, CI/CD, Data Processing, Debugging, Real Time Data, System Monitoring, Cloud Storage, Development Testing, Unit Testing, Containerization, File I/O, Data Transformation
★ 4 (131) · Intermediate · Specialization · 3 - 6 Months