PySpark courses can help you learn data manipulation, distributed computing, and data analysis techniques. You can build skills in working with large datasets, performing transformations, and executing machine learning algorithms. Many courses introduce tools like Apache Spark and its libraries, that support processing big data efficiently and integrating with AI applications.

Skills you'll gain: Data Import/Export, NumPy, Pandas (Python Package), Pivot Tables And Charts, Business Reporting, Data Manipulation, Analytics, Performance Reporting, Data Cleansing, Data Analysis, Data Transformation, Data Management, Linear Algebra
Mixed · Course · 1 - 4 Weeks

Skills you'll gain: Version Control, Git (Version Control System), Selenium (Software), Test Automation, Jenkins, Test Script Development, Software Versioning, Continuous Integration, Test Tools, Continuous Delivery, Test Data, Software Testing, CI/CD, Code Reusability, Software Design Patterns, Command-Line Interface, File I/O
Advanced · Course · 1 - 3 Months

Skills you'll gain: Alteryx, Predictive Modeling, People Analytics, Scripting, R Programming, Predictive Analytics, Data Science, Advanced Analytics, Scripting Languages, R (Software), Trend Analysis, Data Preprocessing, Data Integration, Exploratory Data Analysis, Data Manipulation, Data Visualization Software, Data Analysis, Model Evaluation, Employee Retention, Machine Learning
Intermediate · Guided Project · Less Than 2 Hours

Johns Hopkins University
Skills you'll gain: Open Source Technology, Package and Software Management, Unit Testing, R (Software), GitHub, Version Control, Rmarkdown, Cross Platform Development, Software Versioning, Software Documentation, Test Case, Testability, R Programming, Code Reusability, Knitr, Continuous Integration, Program Development, Build Tools, Git (Version Control System), Development Testing
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Application Development, User Interface (UI), Program Development, Software Development Life Cycle, UI Components, Data Management, User Interface (UI) Design, Software Design, File I/O, Application Design, Development Environment, Data Import/Export, Application Frameworks, Data Persistence
Intermediate · Course · 1 - 4 Weeks

Skills you'll gain: Model Deployment, Snowflake Schema, Data Preprocessing, Applied Machine Learning, Machine Learning Methods, Model Training, MLOps (Machine Learning Operations), Machine Learning, Predictive Modeling, Data Pipelines, Data Transformation, Data Science, Python Programming, Regression Analysis
Intermediate · Guided Project · Less Than 2 Hours

Skills you'll gain: Database Application, Database Software, Integrated Development Environments, Computer Networking, Database Management, Real Time Data, Application Development, Package and Software Management
Mixed · Course · 1 - 3 Months

Skills you'll gain: Apache Hadoop, Real Time Data, Apache Spark, Apache Kafka, Data Integration, AWS Kinesis, Apache Hive, Data Pipelines, Big Data, Applied Machine Learning, Systems Design, System Design and Implementation, Distributed Computing, Query Languages, Data Processing, NoSQL, MongoDB, Data Import/Export, SQL, Scalability
Intermediate · Course · 1 - 3 Months

Google Cloud
Skills you'll gain: Apache Spark, Google Cloud Platform, Cloud Management, Apache Hadoop, Cloud Computing
Beginner · Project · Less Than 2 Hours

Google Cloud
Skills you'll gain: Apache Spark, Apache Hadoop, Google Cloud Platform, Data Processing, Command-Line Interface, Cloud Management, Cloud Computing
Beginner · Project · Less Than 2 Hours

Google Cloud
Skills you'll gain: Model Evaluation, Apache Spark, Google Cloud Platform, Logistic Regression, Predictive Modeling, Big Data, Model Training, Data Preprocessing, Applied Machine Learning
Intermediate · Project · Less Than 2 Hours

Google Cloud
Skills you'll gain: Apache Spark, Google Cloud Platform, Cloud Management, Cloud Computing, Distributed Computing, Package and Software Management
Intermediate · Project · Less Than 2 Hours