PySpark courses can help you learn data manipulation, distributed computing, and data analysis techniques. You can build skills in working with large datasets, performing transformations, and executing machine learning algorithms. Many courses introduce tools like Apache Spark and its libraries, that support processing big data efficiently and integrating with AI applications.

University of Colorado Boulder
Skills you'll gain: Matplotlib, Seaborn, Plot (Graphics), Pandas (Python Package), NumPy, Data Visualization Software, Data Visualization, Data Manipulation, Data Science, Histogram, Package and Software Management, Data Import/Export, Python Programming
Intermediate · Course · 1 - 4 Weeks

Johns Hopkins University
Skills you'll gain: Bioinformatics, Data Structures, Jupyter, Python Programming, Programming Principles, Object Oriented Programming (OOP), File I/O, Computational Logic, Package and Software Management, Data Manipulation
Mixed · Course · 1 - 4 Weeks

Skills you'll gain: Continuous Integration, Test Automation, Authentications, CI/CD, Test Script Development, API Testing, Software Testing, Unit Testing, Behavior-Driven Development, Web Development Tools, Test Case, GitHub, User Interface (UI)
Intermediate · Course · 3 - 6 Months

University of Colorado Boulder
Skills you'll gain: Pandas (Python Package), NumPy, Data Structures, Data Import/Export, Data Manipulation, Data Cleansing, Statistical Methods, Data Analysis, Exploratory Data Analysis
Intermediate · Course · 1 - 3 Months

University of California San Diego
Skills you'll gain: Apache Hadoop, Big Data, Data Analysis, Apache Spark, Data Science, PySpark, Data Infrastructure, Data Processing, Distributed Computing, Performance Tuning, Scalability, Data Storage, Python Programming
Mixed · Course · 1 - 3 Months

Skills you'll gain: Microsoft Azure, MLOps (Machine Learning Operations), Databricks, Cloud Computing, Data Ethics, Model Deployment, Data Pipelines, Data Preprocessing, Responsible AI, Machine Learning, Model Evaluation, Information Privacy, Continuous Monitoring, Data Security, Performance Tuning
Intermediate · Course · 1 - 3 Months

Skills you'll gain: Model Deployment, Apache Spark, Data Pipelines, MLOps (Machine Learning Operations), PySpark, IBM Cloud, Jupyter, Docker (Software), Machine Learning, Data Science, Python Programming, Scalability, Design Thinking
Advanced · Course · 1 - 4 Weeks

Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, PySpark, Apache Hadoop, Data Transformation, MySQL, Data Manipulation, Java Platform Enterprise Edition (J2EE), Data Import/Export, Data Persistence, Development Environment, Software Installation
Mixed · Course · 1 - 4 Weeks

Skills you'll gain: Apache Spark, PySpark, Applied Machine Learning, Big Data, Machine Learning Methods, Data Storage Technologies, Data Preprocessing, Data Storage, Machine Learning Algorithms, Machine Learning, Distributed Computing, Data Processing, Data Science, Statistical Methods, Model Evaluation, Descriptive Statistics
Intermediate · Course · 1 - 4 Weeks

DeepLearning.AI
Skills you'll gain: Data Modeling, Data Transformation, Data Warehousing, Data Preprocessing, Apache Hadoop, Data Pipelines, Apache Spark, Feature Engineering, Star Schema, Real Time Data, Data Access, Machine Learning
Intermediate · Course · 1 - 4 Weeks

Johns Hopkins University
Skills you'll gain: Tidyverse (R Package), Web Scraping, Data Manipulation, R Programming, Data Transformation, Data Cleansing, Big Data, Statistical Programming, Text Mining, Data Import/Export, Data Analysis, Data Structures, Performance Tuning, Application Programming Interface (API)
Intermediate · Course · 1 - 3 Months

University of Leeds
Skills you'll gain: Jupyter, Programming Principles, Computer Programming, Python Programming, Development Environment, Scripting, Software Installation, Data Structures, Data Science
Beginner · Course · 1 - 4 Weeks