About this Course

65,494 recent views
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 12 hours to complete
English

Skills you will gain

Data ScienceApache SparkSQL
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 12 hours to complete
English

Offered by

Placeholder

University of California, Davis

Syllabus - What you will learn from this course

Week
1

Week 1

3 hours to complete

Introduction to Spark

3 hours to complete
6 videos (Total 32 min), 3 readings, 2 quizzes
6 videos
Why Distributed Computing?7m
Spark DataFrames6m
The Databricks Environment8m
SQL in Notebooks3m
Import Data2m
3 readings
A Note From UC Davis10m
Readings and Resources40m
Assignment #1 - Queries in Spark SQL30m
2 practice exercises
Assignment #1 Quiz - Queries in Spark SQL30m
Module 1 Quiz30m
Week
2

Week 2

2 hours to complete

Spark Core Concepts

2 hours to complete
6 videos (Total 25 min), 2 readings, 2 quizzes
6 videos
Spark Terminology3m
Caching5m
Shuffle Partitions7m
Spark UI3m
Broadcast Joins3m
2 readings
Readings30m
Assignment #2 - Spark Internals30m
2 practice exercises
Assignment #2 Quiz - Spark Internals30m
Module 2 Quiz30m
Week
3

Week 3

3 hours to complete

Engineering Data Pipelines

3 hours to complete
7 videos (Total 43 min), 2 readings, 2 quizzes
7 videos
Spark as a Connector6m
Accessing Data10m
File Formats8m
Schemas and Types4m
Writing Data6m
Managed and Unmanaged Tables4m
2 readings
Readings1h
Assignment #3 - Engineering Data Pipelines30m
2 practice exercises
Assignment #3 Quiz - Engineering Data Pipelines30m
Module 3 Quiz30m
Week
4

Week 4

4 hours to complete

Machine Learning Applications of Spark

4 hours to complete
7 videos (Total 35 min), 2 readings, 3 quizzes
7 videos
Applications of Machine Learning4m
Machine Learning Fundamentals6m
Linear Regression6m
Training Linear Regression Model8m
Applying Machine Learning with UDFs4m
Course Summary3m
2 readings
Readings1h
Assignment #4 - Logistic Regression Classifier10m
2 practice exercises
Assignment #4 Quiz - Logistic Regression Classifier30m
Module 4 Quiz30m

Reviews

TOP REVIEWS FROM DISTRIBUTED COMPUTING WITH SPARK SQL

View all reviews

About the Learn SQL Basics for Data Science Specialization

Learn SQL Basics for Data Science

Frequently Asked Questions

More questions? Visit the Learner Help Center.