Northeastern University
Data Warehousing and Integration Part 1

Découvrez de nouvelles compétences avec 120 $ de réduction sur les cours dispensés par des experts de l'industrie. Économisez maintenant.

Ce cours n'est pas disponible en Français (France)

Nous sommes actuellement en train de le traduire dans plus de langues.
Northeastern University

Data Warehousing and Integration Part 1

Venkat Krishnamurthy

Instructeur : Venkat Krishnamurthy

Inclus avec Coursera Plus

Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.
2 semaines à compléter
à 10 heures par semaine
Planning flexible
Apprenez à votre propre rythme
Obtenez un aperçu d'un sujet et apprenez les principes fondamentaux.
2 semaines à compléter
à 10 heures par semaine
Planning flexible
Apprenez à votre propre rythme

Compétences que vous acquerrez

  • Catégorie : SQL
  • Catégorie : Business Intelligence
  • Catégorie : Database Design
  • Catégorie : Data Integration
  • Catégorie : Data Analysis
  • Catégorie : Data Modeling
  • Catégorie : Data Governance
  • Catégorie : Star Schema
  • Catégorie : Extract, Transform, Load
  • Catégorie : Data Mining
  • Catégorie : Data Quality
  • Catégorie : Relational Databases
  • Catégorie : Data Warehousing
  • Catégorie : Data Mart
  • Catégorie : Data Architecture

Détails à connaître

Certificat partageable

Ajouter à votre profil LinkedIn

Récemment mis à jour !

août 2025

Évaluations

13 devoirs

Enseigné en Anglais

Découvrez comment les employés des entreprises prestigieuses maîtrisent des compétences recherchées

 logos de Petrobras, TATA, Danone, Capgemini, P&G et L'Oreal

Il y a 7 modules dans ce cours

This module introduces data warehousing and business intelligence, emphasizing their role in enhancing organizational decision-making. Data warehouses transform raw data into actionable insights using processes like ETL (Extract, Transform, and Load), supported by tools such as OLAP for querying and data mining. While operational databases (OLTP) are suited for daily transactions, OLAP databases are optimized for complex analytics.

Inclus

3 vidéos6 lectures1 devoir

This module builds on the foundations of database design from the previous module, focussing on relational database modeling, normalization, and SQL. The readings will guide you in translating a conceptual EER diagram into a relational model, ensuring adherence to normalization principles and aiming for Third Normal Form (3NF). We’ll also emphasize understanding primary keys and foreign keys for maintaining data integrity and establishing table relationships. You will also have the opportunity to create and critique relational models. We’ll then explore SQL basics, covering syntax (SELECT, INSERT, UPDATE, DELETE), querying techniques (WHERE, ORDER BY, JOIN), and operations involving functions and aggregates (COUNT, SUM, AVG, MIN, MAX), which are fundamental in database querying and management.

Inclus

3 lectures2 devoirs1 élément d'application

This module provides an introduction to data warehouse concepts. Data warehouses are based on a multidimensional model. We will look closely into the multidimensional model and its representation as data cubes (also known as hypercubes). We’ll examine how different aspects of data are categorized into facts, measures, and dimensions. Dimensions such as Product, Time, and Customer are organized hierarchically within a cube, allowing data to be analyzed at various levels of detail. Measures such as Quantity and Sales Amount are stored within these cubes, and analysts can navigate through different levels of detail using "rolling up" and "drilling down" techniques. We will also explore key concepts such as granularity, dimension schema, and member hierarchies, which are essential in understanding how data is structured and analyzed in multidimensional models. Finally, we will learn to use techniques such as disjointness, completeness, and correctness to ensure data accuracy and integrity when aggregating information in data cubes, collectively known as summarizability.

Inclus

2 vidéos5 lectures2 devoirs1 élément d'application

In this module we’ll explore conceptual modeling with multidimensional models, visualized using MultiDim. This approach helps us organize data into facts and dimensions and understand the relationships between them, which is essential for designing data warehouses. We’ll explore topics such as dimensions (e.g., date, customer) and measures (e.g., quantity, total sales) in more detail. We’ll also explore the difference between primary events and secondary events and learn how they are used. Finally, we will look at another categorization of Measures into Flow: Level and Unit Measures.

Inclus

2 vidéos4 lectures3 devoirs

In this module, we’ll dive into conceptual modeling of hierarchies within data warehouses, exploring their definitions, characteristics, and significance. Balanced hierarchies have a uniform structure where each child has one parent and all branches are of the same length, making data analysis consistent and efficient. In contrast, unbalanced hierarchies have varying branch lengths and missing aggregation levels, offering flexibility to model real-world scenarios like product categories and geographical hierarchies. You’ll also be introduced to generalized hierarchies, which involve "is-a" relationships between supertypes and subtypes, allowing for detailed data representation but requiring careful management of aggregation and specialization. We’ll also explore alternative hierarchies, showcasing different ways to organize the same dimension, such as calendar vs. fiscal views of time. Finally, we’ll look at parallel hierarchies, both independent and dependent, as tools for analyzing data from multiple perspectives, representing complex organizational structures. Understanding these hierarchy types is crucial for effective data management and analysis in data warehousing.

Inclus

4 vidéos3 lectures2 devoirs

In this module, you’ll explore logical modeling in data warehousing, which is the process of designing a structured, abstract representation of data to be stored, focusing on how data is organized, related, and optimized for efficient querying and analysis. Building on what you learned in the previous modules, you'll take the next step in data warehouse design: translating a conceptual model into a logical model for implementation. The module will focus on the relational representation of data warehouses, including the study of various schema implementations: star, snowflake, starflake, and constellation. You'll also examine the rules for mapping a multidimensional conceptual model to a relational model, highlighting the role and importance of different types of keys in this process. We'll also discuss strategies for maintaining consistency in a data warehouse. Finally, you'll explore how to pre-populate certain dimensions, like time, to streamline operations and improve query performance.

Inclus

6 vidéos11 lectures2 devoirs1 élément d'application

Designing a data warehouse is a complex process that requires transitioning from high-level conceptual models to detailed logical models. This transition is critical because it bridges the gap between understanding business needs and translating them into a technical framework that effectively supports those needs. In this module, you’ll expand on the logical modeling process covered in the previous module, with a particular focus on dimensional model design and the intricacies of hierarchy modeling. As you delve deeper, you’ll encounter logical modeling for advanced concepts such as many-to-many dimensions, links between facts, and facts with multiple granularities. We’ll also explore the concept of Slowly Changing Dimensions (SCDs), which are essential for managing historical data in your warehouse. You’ll learn how to implement different SCD types to accurately track and manage changes in dimension data over time. Finally, we’ll touch on SQL for OLAP, focusing on advanced concepts like aggregation and window functions, and you’ll learn how to use SQL to query and analyze data warehouses.

Inclus

5 vidéos11 lectures1 devoir

Obtenez un certificat professionnel

Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.

Instructeur

Venkat Krishnamurthy
Northeastern University
3 Cours309 apprenants

Offert par

En savoir plus sur Data Analysis

Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?

Felipe M.
Étudiant(e) depuis 2018
’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’
Jennifer J.
Étudiant(e) depuis 2020
’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’
Larry W.
Étudiant(e) depuis 2021
’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’
Chaitanya A.
’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’
Coursera Plus

Ouvrez de nouvelles portes avec Coursera Plus

Accès illimité à 10,000+ cours de niveau international, projets pratiques et programmes de certification prêts à l'emploi - tous inclus dans votre abonnement.

Faites progresser votre carrière avec un diplôme en ligne

Obtenez un diplôme auprès d’universités de renommée mondiale - 100 % en ligne

Rejoignez plus de 3 400 entreprises mondiales qui ont choisi Coursera pour les affaires

Améliorez les compétences de vos employés pour exceller dans l’économie numérique

Foire Aux Questions