Wenn Sie sich für diesen Kurs anmelden, werden Sie auch für diese Spezialisierung angemeldet.
Lernen Sie neue Konzepte von Branchenexperten
Gewinnen Sie ein Grundverständnis bestimmter Themen oder Tools
Erwerben Sie berufsrelevante Kompetenzen durch praktische Projekte
Erwerben Sie ein Berufszertifikat zur Vorlage
In diesem Kurs gibt es 4 Module
Learn to build data pipelines on the Databricks Lakehouse Platform — from architecture concepts to hands-on Spark and Delta Lake. This beginner course starts with why the lakehouse pattern replaced separate data warehouses and data lakes, then moves directly into the Databricks workspace where you'll configure compute, write PySpark and SQL queries, and manage data with Unity Catalog's three-level namespace.
Week by week, you'll progress from navigating the platform to transforming DataFrames with select, filter, groupBy, and joins, then to creating Delta Lake tables with ACID transactions, schema enforcement, and time travel. You'll perform real DML operations — INSERT, UPDATE, DELETE, and MERGE — and learn to schedule production pipelines using Databricks Jobs with DAG-based orchestration.
The course runs entirely on Databricks Free Edition, so there's no cloud billing. Six hands-on labs reinforce each module: explore the workspace, write notebook-based transformations, build Delta tables, and wire up an automated workflow. By the end, you'll have built a complete data engineering pipeline from raw ingestion through Delta Lake to scheduled production jobs.
This module introduces the lakehouse paradigm and the Databricks platform. You'll learn about the structure of lakehouse architecture, explore the Databricks workspace and its core tools, and understand how compute and storage work together.
Das ist alles enthalten
6 Videos7 Lektüren1 Aufgabe
Infos zu Modulinhalt anzeigen
6 Videos•Insgesamt 24 Minuten
Data Architecture Evolution•5 Minuten
Lakehouse Architecture•5 Minuten
Databricks and the Lakehouse•3 Minuten
Databricks Overview•4 Minuten
Workspace, Catalog & Data•4 Minuten
Compute Resources•4 Minuten
7 Lektüren•Insgesamt 7 Minuten
About This Course•1 Minute
Key Terms•1 Minute
Reflection•1 Minute
Key Terms•1 Minute
Reflection•1 Minute
Key Terms•1 Minute
Reflection•1 Minute
1 Aufgabe•Insgesamt 2 Minuten
Quiz: Lakehouse Architecture & Platform•2 Minuten
Apache Spark on Databricks
Modul 2•1 Stunde abzuschließen
Moduldetails
This module covers notebooks and hands-on data manipulation using PySpark. You'll create and organize notebooks, load data from the Catalog, and write PySpark transformations to select, filter, aggregate, and join datasets.
Das ist alles enthalten
6 Videos6 Lektüren1 Aufgabe
Infos zu Modulinhalt anzeigen
6 Videos•Insgesamt 28 Minuten
Using Notebooks•4 Minuten
Magic Commands & Utilities•4 Minuten
Loading & Previewing Data•5 Minuten
Spark Core Concepts•3 Minuten
Select & Filter Operations•7 Minuten
GroupBy, Aggregations & Joins•5 Minuten
6 Lektüren•Insgesamt 6 Minuten
Key Terms•1 Minute
Reflection•1 Minute
Databricks Free Edition•1 Minute
Key Terms•1 Minute
Lazy Evaluation•1 Minute
Reflection•1 Minute
1 Aufgabe•Insgesamt 2 Minuten
Quiz: Spark Fundamentals•2 Minuten
Delta Lake Essentials
Modul 3•1 Stunde abzuschließen
Moduldetails
This module introduces Delta Lake, where you'll create Delta tables, perform transactional operations like updates, deletes, and merges, use time travel to query previous versions, and see how Delta Lake connects to governance and automation features.
Das ist alles enthalten
6 Videos7 Lektüren1 Aufgabe
Infos zu Modulinhalt anzeigen
6 Videos•Insgesamt 25 Minuten
What Is Delta Lake•4 Minuten
Delta Lake Concepts•4 Minuten
Creating Delta Tables•6 Minuten
Insert, Update & Merge•5 Minuten
Time Travel•3 Minuten
Jobs, Dashboards & Workflows•4 Minuten
7 Lektüren•Insgesamt 7 Minuten
Key Terms•1 Minute
Reflection•1 Minute
Key Terms•1 Minute
Reflection•1 Minute
Hands-On: MERGE, Updates, and Time Travel (Pure Python Mental Model)•1 Minute
Key Terms•1 Minute
Reflection•1 Minute
1 Aufgabe•Insgesamt 3 Minuten
Quiz: Delta Lake & Workflows•3 Minuten
Capstone
Modul 4•6 Minuten abzuschließen
Moduldetails
Build an end-to-end lakehouse data pipeline integrating every concept from the course. Starting from raw data files, you will construct a complete medallion architecture (bronze → silver → gold) with Delta Lake, implement incremental MERGE logic, and orchestrate the pipeline as a scheduled Databricks Job. Six hands-on lab notebooks guide you through the project using the course GitHub repository.
Do I need a paid Databricks account to take this course?
o. The entire course runs on Databricks Free Edition, which gives you full platform access — notebooks, clusters, Delta Lake, Unity Catalog, and Jobs — with zero cloud billing. You can start immediately without a credit card.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.