About this Course

89,790 recent views
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Intermediate Level
Approx. 14 hours to complete
English

What you will learn

  • U​se the collaborative Databricks workspace to write scalable Spark SQL code that executes against a cluster of machines

  • Inspect the Spark UI to analyze query performance and identify bottlenecks

  • Create an end-to-end pipeline that reads data, transforms it, and saves the result

  • B​uild a medallion (bronze, silver, gold) lakehouse architecture with Delta Lake to ensure the reliability, scalability, and performance of your data

Skills you will gain

Data ScienceApache SparkDelta LakeSQL
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Intermediate Level
Approx. 14 hours to complete
English

Offered by

Placeholder

University of California, Davis

Syllabus - What you will learn from this course

Content RatingThumbs Up87%(1,038 ratings)Info
Week
1

Week 1

3 hours to complete

Introduction to Spark

3 hours to complete
6 videos (Total 43 min), 3 readings, 2 quizzes
Week
2

Week 2

3 hours to complete

Spark Core Concepts

3 hours to complete
6 videos (Total 36 min), 2 readings, 2 quizzes
Week
3

Week 3

4 hours to complete

Engineering Data Pipelines

4 hours to complete
7 videos (Total 62 min), 2 readings, 2 quizzes
Week
4

Week 4

4 hours to complete

Data Lakes, Warehouses and Lakehouses

4 hours to complete
8 videos (Total 52 min), 2 readings, 3 quizzes

Reviews

TOP REVIEWS FROM DISTRIBUTED COMPUTING WITH SPARK SQL

View all reviews

About the Learn SQL Basics for Data Science Specialization

Learn SQL Basics for Data Science

Frequently Asked Questions

More questions? Visit the Learner Help Center.