The fundamental-level course is typically designed for individuals with a basic understanding of data storage and processing concepts but little to no prior experience with building data lakes on AWS specifically. After a brief introduction to Data Lakes, we'll introduce data ingestion, cataloging and preparation, concluding with an overview of querying data with Amazon Athena. The course will continue with an AWS Lake Formation overview, including a hands-on lab where you'll build a data lake. We'll then introduce data processing and analytics leveraing AWS Glue before diving into automated data lake creatiokn using Lake Formation blueprints. Finally, we'll close with Modern Data Architectures on AWS with a lab that covers publishing and consuming data products as a service.



Building Data Lakes on AWS
This course is part of AWS Cloud Solutions Architect Professional Certificate


Instructors: Rafael Lopes
Access provided by Yale
37,242 already enrolled
(303 reviews)
Recommended experience
What you'll learn
- Apply data lake methodologies in planning and designing a data lake. 
- Describe the components and services required for building an AWS data lake. 
- Compare the ways data can be ingested, stored, and transformed in a data lake. 
- Explain how to secure a data lake with appropriate permissions. 
Skills you'll gain
- Data Science
- Data Transformation
- Data Storage
- Amazon Web Services
- Data Import/Export
- Data Architecture
- Data Lakes
- AWS Identity and Access Management (IAM)
- Data Processing
- Data Management
- Data Governance
- Machine Learning
- Data Warehousing
- Query Languages
- Amazon S3
- Data Engineering
- Data Infrastructure
- Data Visualization
Details to know

Add to your LinkedIn profile
7 assignments
See how employees at top companies are mastering in-demand skills

Build your Data Analysis expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from Amazon Web Services

There are 6 modules in this course
This module provides an overview of data lakes, their purpose, and how they differ from data warehouses. It also covers the components and architectures involved in data lakes.
What's included
5 videos1 reading2 assignments1 plugin
This module focuses on the processes of ingesting data into a data lake, cataloging the data, and preparing it for analysis. It covers topics such as data lake storage, data ingestion methods, crawling and cataloging data, data formatting, partitioning, compression, and querying data with Amazon Athena.
What's included
10 videos1 reading1 assignment1 plugin
This module introduces AWS Lake Formation, a service that helps build and manage data lakes on AWS. It covers the basic permission model, and provides an overview of the service’s features and capabilities.
What's included
3 videos1 reading1 assignment1 app item
This module covers data transformation techniques and tools like AWS Glue for processing and analyzing data in the data lake. It includes hands-on demos and a technical talk on Glue and Athena Federated Queries.
What's included
7 videos1 assignment1 discussion prompt
This module explores advanced features and configurations of AWS Lake Formation, including blueprints, workflows, and fine-grained access control. It also covers data visualization with Amazon QuickSight.
What's included
5 videos1 assignment1 app item
This module introduces the concept of modern data architecture and its implementation on AWS. It covers data movement scenarios, data sharing models, and relevant readings.
What's included
6 videos3 readings1 assignment1 app item1 plugin
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors


Offered by
Why people choose Coursera for their career




Learner reviews
303 reviews
- 5 stars80.52% 
- 4 stars13.20% 
- 3 stars3.63% 
- 2 stars0.99% 
- 1 star1.65% 
Showing 3 of 303
Reviewed on Feb 23, 2025
Great must learn course of data scientist professional.
Reviewed on Jul 31, 2022
thank for such nicely design course for learning and handson
Reviewed on Feb 7, 2023
The best aws course is taught by two excellent instructors who use relevant examples, demonstrations, graphics, and content that is carefully organized.





