About this Course

30,700 recent views
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 13 hours to complete
English

What you will learn

  • U​se the collaborative Databricks workspace and write SQL code that executes against a cluster of machines

  • Use Spark UI to analyze performance and identify bottlenecks

  • Create an end-to-end pipeline that reads data, transforms it, and saves the result

  • B​uild a linear regression model and make predictions using SparkSQL

Skills you will gain

Data ScienceApache SparkSQL
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 13 hours to complete
English

Offered by

Placeholder

University of California, Davis

Syllabus - What you will learn from this course

Week
1

Week 1

3 hours to complete

Introduction to Spark

3 hours to complete
6 videos (Total 32 min), 3 readings, 2 quizzes
Week
2

Week 2

2 hours to complete

Spark Core Concepts

2 hours to complete
6 videos (Total 25 min), 2 readings, 2 quizzes
Week
3

Week 3

3 hours to complete

Engineering Data Pipelines

3 hours to complete
7 videos (Total 43 min), 2 readings, 2 quizzes
Week
4

Week 4

4 hours to complete

Machine Learning Applications of Spark

4 hours to complete
7 videos (Total 35 min), 2 readings, 3 quizzes

Reviews

TOP REVIEWS FROM DISTRIBUTED COMPUTING WITH SPARK SQL

View all reviews

About the Learn SQL Basics for Data Science Specialization

Learn SQL Basics for Data Science

Frequently Asked Questions

More questions? Visit the Learner Help Center.