In this course, you will be introduced to the data engineering lifecycle, from data generation in source systems, to ingestion, transformation, storage, and serving data to downstream stakeholders. You’ll study the key undercurrents that affect all stages of the lifecycle, and start developing a framework for how to think like a data engineer. To gain hands-on practice, you’ll gather stakeholder needs, translate those needs into system requirements, and choose tools and technologies to build systems that provide business value. By the end of this course you’ll be spinning up batch and streaming data pipelines to serve product recommendations on the AWS cloud!





Introduction to Data Engineering
This course is part of DeepLearning.AI Data Engineering Professional Certificate

Instructor: Joe Reis
Top Instructor
Access provided by Palo Alto Networks
35,283 already enrolled
(434 reviews)
Recommended experience
What you'll learn
- Gain a comprehensive understanding of the data engineering lifecycle and its undercurrents 
- Gather stakeholder needs and translate them into system requirements 
- Design and implement batch and streaming data pipelines on AWS 
Skills you'll gain
Details to know

Add to your LinkedIn profile
6 assignments
See how employees at top companies are mastering in-demand skills

Build your Cloud Computing expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from DeepLearning.AI

There are 4 modules in this course
Gain a high-level overview of the data engineering lifecycle and key undercurrents to understand how data engineers add business value to organizations. Start developing a mental framework for thinking like a data engineer, starting with gathering stakeholder needs and translating them into system requirements. Learn the basics of working on the cloud from an AWS expert.
What's included
18 videos10 readings1 assignment1 app item
Dive deeper into the stages of the data engineering lifecycle and its key undercurrents. Build an end-to-end data pipeline on AWS that encompasses all the stages of the data engineering lifecycle.
What's included
20 videos3 readings1 assignment1 programming assignment
Define data architecture and how it fits within the larger enterprise architecture. Examine the principles of good data architecture and how these principles inform tools and technology choices. Evaluate and optimize the security, performance, reliability, cost-efficiency, and scalability of a web application hosted on AWS.
What's included
22 videos2 readings1 assignment1 programming assignment
Practice gathering stakeholder needs and translating them into system requirements. Choose the appropriate tools and technologies based on the system requirements, then build an end-to-end data system that includes a batch and a streaming component to train a product recommendation system and serves product recommendations to a sales platform.
What's included
17 videos5 readings3 assignments1 programming assignment
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Why people choose Coursera for their career




Learner reviews
434 reviews
- 5 stars84.10% 
- 4 stars11.52% 
- 3 stars2.30% 
- 2 stars1.15% 
- 1 star0.92% 
Showing 3 of 434
Reviewed on Oct 18, 2024
I appreciate the addition of the DE mental framework as it gives a whole lot of context to the more practical knowledge that will be learned in the next courses.
Reviewed on Nov 25, 2024
Easy to follow and lots of good information. Not too technical, but it is the intro :)
Reviewed on Oct 27, 2024
One of the best courses in the platform, it will guide you to the fundamental principles of data engineering and you'll build your first pipeline for stream and batch data







