In this course, you will learn about the raw ingredients and processes that are used to physically store data on disk and in memory. You’ll explore different storage systems, including object, block, and file storage, as well as databases, that are built on top of these raw ingredients. You’ll also get a chance to use the Cypher language to query a Neo4j graph database, and perform vector similarity search, a key feature behind generative AI and large language models. You will explore the evolution of data storage abstractions, from data warehouses, to data lakes, and data lakehouses, while comparing the advantages and drawbacks of each architectural paradigm. With hands-on practice, you will design a simple data lake using Amazon Glue, and build a data lakehouse using AWS LakeFormation and Apache Iceberg. In the last week of this course, you’ll see how queries work behind the scenes, practice writing more advanced SQL queries, compare the query performance in row vs column-oriented storage, and perform streaming queries using Apache Flink.





Data Storage and Queries
This course is part of DeepLearning.AI Data Engineering Professional Certificate

Instructor: Joe Reis
Top Instructor
Access provided by Somaiya Vidyavihar University
6,806 already enrolled
(74 reviews)
Recommended experience
What you'll learn
Design storage architectures for various use cases, and select appropriate technologies to implement these architectures
Practice common query patters and identify ways to improve query performance and enhance the value of your data systems
Skills you'll gain
Details to know

Add to your LinkedIn profile
3 assignments
See how employees at top companies are mastering in-demand skills

Build your Cloud Computing expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from DeepLearning.AI

There are 3 modules in this course
What's included
16 videos12 readings1 assignment1 programming assignment1 ungraded lab
What's included
16 videos2 readings1 assignment1 programming assignment1 ungraded lab
What's included
15 videos4 readings1 assignment1 programming assignment2 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Why people choose Coursera for their career




Learner reviews
74 reviews
- 5 stars
84%
- 4 stars
10.66%
- 3 stars
2.66%
- 2 stars
2.66%
- 1 star
0%
Showing 3 of 74
Reviewed on May 24, 2025
Excellent course, Iceberg is still a new thing but the way the tutor take us from the need of data lake to data lake house and then iceberg it's great.
Reviewed on Apr 24, 2025
This is a really excellent course covering a number of topics that anyone going into data engineering should be familiar with.
Reviewed on Sep 15, 2025
Solid! bit of all but too in depth nor too practice oriented
Explore more from Information Technology
Google Cloud
Universidad Nacional Autónoma de México
DeepLearning.AI