Wenn Sie sich fĂĽr diesen Kurs anmelden, werden Sie auch fĂĽr diese Spezialisierung angemeldet.
Lernen Sie neue Konzepte von Branchenexperten
Gewinnen Sie ein Grundverständnis bestimmter Themen oder Tools
Erwerben Sie berufsrelevante Kompetenzen durch praktische Projekte
Erwerben Sie ein Berufszertifikat zur Vorlage
In diesem Kurs gibt es 13 Module
You will design and implement enterprise-grade data models, from traditional star schemas to modern lakehouse architectures. This comprehensive course equips you with the skills to build cost-effective, scalable data solutions that drive business intelligence and analytics.
You'll gain hands-on experience creating dimensional models with surrogate keys, optimizing database schemas through partitioning and clustering, and implementing slowly changing dimensions for historical data tracking. The course covers advanced topics like semantic metrics layers, multi-cluster warehouse architectures, and open-source table formats for data lakes.
What makes this course unique is its end-to-end approach to modern data architecture. You'll work with real-world scenarios, from analyzing storage costs to designing data ingestion pipelines that span from raw files to analytics-ready tables.
By completion, you'll confidently architect data solutions that balance performance, cost, and scalability—skills essential for senior data engineering and architecture roles in today's data-driven organizations.
You will examine existing snowflake schemas to pinpoint performance bottlenecks caused by redundant lookup paths and develop systematic approaches for identifying optimization opportunities.
You will construct optimized star-schema dimensional models with proper fact and dimension table structures, implementing surrogate keys and design patterns that maximize query performance for analytical workloads.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 13 Minuten
Star Schema Architectural Principles and Design Patterns•8 Minuten
Building Star Schema Tables with Surrogate Keys•5 Minuten
1 Lektüre•Insgesamt 10 Minuten
Comprehensive Guide to Fact and Dimension Table Design•10 Minuten
2 Aufgaben•Insgesamt 15 Minuten
Enterprise Star Schema Design Challenge•10 Minuten
Star Schema Implementation Validation•5 Minuten
Create Semantic Metrics Layer
Modul 3•1 Stunde abzuschließen
Moduldetails
You will develop standardized semantic metrics layers that ensure consistent business logic across analytics platforms, eliminate metric drift, and provide a unified source of truth for enterprise reporting.
Metrics Standardization Concepts and Implementation Patterns•9 Minuten
Implementing Metrics Definitions with Standardized Business Logic•5 Minuten
1 Lektüre•Insgesamt 8 Minuten
Semantic Layer Architecture for Enterprise Analytics•8 Minuten
2 Aufgaben•Insgesamt 15 Minuten
Semantic Layer Concepts and Implementation Validation•3 Minuten
Comprehensive Semantic Layer Design and Implementation•12 Minuten
Apply Partitioning and Clustering Strategies
Modul 4•1 Stunde abzuschließen
Moduldetails
You will implement advanced partitioning and clustering techniques using SQL DDL commands to optimize query performance for large-scale datasets.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 8 Minuten
Why Database Performance Hits a Wall at Scale•2 Minuten
Partitioning and Clustering Fundamentals for Performance Optimization•6 Minuten
1 Lektüre•Insgesamt 12 Minuten
DDL Syntax and Implementation Patterns for Partitioning and Clustering•12 Minuten
2 Aufgaben•Insgesamt 21 Minuten
Design and Implement a Partitioned Data Warehouse Table•18 Minuten
Partitioning and Clustering Strategy Assessment•3 Minuten
Analyze Normalization vs Performance Trade-offs
Modul 5•1 Stunde abzuschließen
Moduldetails
You will evaluate database normalization levels against query performance requirements to make strategic denormalization decisions for optimizing analytical workloads.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 12 Minuten
Denormalization Strategies for Analytical Workloads•6 Minuten
Analyzing Query Performance Impact of Normalization Levels•6 Minuten
1 Lektüre•Insgesamt 12 Minuten
Normalization Forms and Performance Impact Analysis•12 Minuten
2 Aufgaben•Insgesamt 23 Minuten
Develop a Schema Refactoring Proposal with Performance Justification•20 Minuten
Normalization vs Performance Trade-off Analysis•3 Minuten
Create Entity-Relationship Diagrams
Modul 6•1 Stunde abzuschließen
Moduldetails
You will design and document comprehensive Entity-Relationship diagrams that effectively communicate complex data structures and relationships for enterprise data systems.
Das ist alles enthalten
3 Videos1 LektĂĽre2 Aufgaben1 Unbewertetes Labor
Infos zu Modulinhalt anzeigen
3 Videos•Insgesamt 18 Minuten
When Data Models Become Mission-Critical Communication Tools•3 Minuten
Professional ER Diagram Design and Documentation Standards•6 Minuten
Building Comprehensive ER Diagrams with Professional Tools•9 Minuten
1 Lektüre•Insgesamt 12 Minuten
ER Diagram Components and Advanced Modeling Techniques•12 Minuten
2 Aufgaben•Insgesamt 18 Minuten
ER Diagram Design Knowledge Check•3 Minuten
Comprehensive ER Diagram Design and Documentation Assessment •15 Minuten
1 Unbewertetes Labor•Insgesamt 18 Minuten
Enterprise Data Modeling: Creating Comprehensive ER Diagrams for Complex Data Structures•18 Minuten
Implement Data Pipelines for Historical Changes
Modul 7•1 Stunde abzuschließen
Moduldetails
You will build automated SCD Type 2 pipelines using MERGE statements and window functions to preserve historical data integrity in enterprise environments.
Das ist alles enthalten
2 Videos2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 10 Minuten
Why Historical Data Pipelines Matter for Enterprise Analytics•3 Minuten
SCD Type 2 Implementation with MERGE Statements•7 Minuten
2 Aufgaben•Insgesamt 21 Minuten
Implement Complete SCD Type 2 Pipeline for Product Dimensions•18 Minuten
SCD Implementation Knowledge Check•3 Minuten
Analyze Storage and Compute Cost Trends
Modul 8•1 Stunde abzuschließen
Moduldetails
You will conduct comprehensive cost analysis of data lifecycle patterns to develop strategic archiving recommendations that balance storage economics with business value.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 18 Minuten
Cost Trend Analysis Techniques for Data Archiving•8 Minuten
Calculating Storage Costs and ROI for Archiving Strategies•9 Minuten
1 Lektüre•Insgesamt 10 Minuten
Data Lifecycle Cost Analysis and Storage Economics•10 Minuten
2 Aufgaben•Insgesamt 21 Minuten
Develop Comprehensive Data Archiving Strategy with Cost-Benefit Analysis•18 Minuten
Cost Analysis and Archiving Strategy Knowledge Check•3 Minuten
Create Multi-Cluster Warehouse Architecture
Modul 9•1 Stunde abzuschließen
Moduldetails
You will design scalable multi-cluster data warehouse architectures that isolate workloads for optimal performance while implementing comprehensive cost control and resource management policies.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 9 Minuten
The Business Case for Multi-Cluster Architecture•3 Minuten
Designing Workload Isolation and Resource Management Policies•6 Minuten
1 Lektüre•Insgesamt 10 Minuten
Multi-Cluster Architecture Design Principles and Implementation Patterns•10 Minuten
You will develop analytical frameworks to evaluate and compare the technical capabilities of Delta Lake, Apache Iceberg, and Apache Hudi for specific business requirements.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 13 Minuten
Feature Comparison Matrix: Schema Evolution and Time Travel•5 Minuten
Building Technical Capability Comparison Matrices•8 Minuten
1 Lektüre•Insgesamt 10 Minuten
Comprehensive Analysis of Delta Lake, Iceberg, and Hudi Capabilities•10 Minuten
2 Aufgaben•Insgesamt 23 Minuten
Develop Strategic Table Format Recommendation Framework•20 Minuten
Table Format Analysis and Selection Knowledge Check•3 Minuten
Data Ingestion Pipeline Implementation
Modul 12•1 Stunde abzuschließen
Moduldetails
You will architect and implement automated data ingestion pipelines that orchestrate data movement across medallion architecture zones within lakehouse platforms.
Das ist alles enthalten
2 Videos1 LektĂĽre2 Aufgaben
Infos zu Modulinhalt anzeigen
2 Videos•Insgesamt 10 Minuten
The Cost of Manual Data Pipeline Management•3 Minuten
COPY INTO Commands and Incremental Loading Strategies•7 Minuten
1 Lektüre•Insgesamt 10 Minuten
Medallion Architecture and Pipeline Orchestration Patterns•10 Minuten
2 Aufgaben•Insgesamt 18 Minuten
Data Ingestion Pipeline Implementation Knowledge Check•3 Minuten
Lakehouse Data Pipeline Implementation Assessment•15 Minuten
Project: Data Modeling and Lakehouse Architecture with SQL
Modul 13•2 Stunden abzuschließen
Moduldetails
You will design and implement a comprehensive data lakehouse architecture that integrates dimensional modeling, schema optimization, cost management, and multi-format data ingestion. This project synthesizes advanced SQL skills to create a production-ready data engineering solution.
Das ist alles enthalten
4 LektĂĽren1 Aufgabe
Infos zu Modulinhalt anzeigen
4 Lektüren•Insgesamt 90 Minuten
Why This Project Matters•10 Minuten
Project Requirements•10 Minuten
Graded Assignment: Enterprise Data Lakehouse Architecture•60 Minuten
Solution Key•10 Minuten
1 Aufgabe•Insgesamt 15 Minuten
Graded Quiz: Enterprise Data Lakehouse
Architecture•15 Minuten
Erwerben Sie ein Karrierezertifikat.
FĂĽgen Sie dieses Zeugnis Ihrem LinkedIn-Profil, Lebenslauf oder CV hinzu. Teilen Sie sie in Social Media und in Ihrer Leistungsbeurteilung.
Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
What is a SQL-based lakehouse architecture in this course?
In this course, a SQL-based lakehouse architecture is a way to organize file-based storage, warehouse tables, and analytics models so they work as one system. The emphasis is on designing that system end to end, including dimensional models, ingestion paths, performance tuning, and consistent metrics.
When would you use a lakehouse architecture?
You would use it when your data lives partly in files and partly in structured tables, but you still want one analytical setup for querying and modeling. In this course, it is treated as a practical approach for combining raw data ingestion, historical tracking, and analytics-ready tables in one design.
How does a lakehouse architecture fit into a broader workflow?
It sits between raw data collection and business analysis, giving you the structure that turns incoming data into dependable analytical tables. The course shows how it connects ingestion, dimensional modeling, historical change handling, and metric standardization into a repeatable workflow.
How is a lakehouse architecture different from a traditional data warehouse?
A traditional data warehouse centers on loading everything into warehouse tables first, while a lakehouse also works with file-based storage through structured querying and external table patterns. This course shows that the lakehouse approach still relies on warehouse design principles such as star schemas, surrogate keys, and performance optimization.
Do you need any prerequisites before learning SQL-based lakehouse architecture?
A working knowledge of SQL and core data warehousing ideas, such as joins, fact tables, and dimension tables, is helpful. Because the course is advanced, it focuses more on architecture decisions, optimization, and historical data management than on beginner database basics.
What tools, platforms, or methods are used in this course?
The course centers on SQL in a modern warehouse and lakehouse setting, with a focus on dimensional modeling and external-table-based ingestion. Performance tuning and historical change handling are also part of the implementation work.
What specific tasks will you practice or complete in this course?
You’ll design dimensional models with the right grain and surrogate keys, then optimize those schemas for analytical use. You’ll also build ingestion paths from file storage, manage historical changes in dimensions, and define consistent business metrics.
Finanzielle UnterstĂĽtzung verfĂĽgbar, weitere Informationen
Âą Einige Aufgaben in diesem Kurs werden mit AI bewertet. FĂĽr diese Aufgaben werden Ihre Daten in Ăśbereinstimmung mit Datenschutzhinweis von Courseraverwendet.