Back to Advanced Data Engineering
Duke University

Advanced Data Engineering

In this advanced course, you will gain practical expertise in scaling data engineering systems using cutting-edge tools and techniques. This course is designed for data scientists, data engineers, and anyone with a foundational understanding of data handling who desires to escalate their skills to handle larger, more complex datasets efficiently. Throughout the course, you'll master the application of technologies such as Celery with RabbitMQ for scalable data consumption, Apache Airflow for optimized workflow management, and Vector and Graph databases for robust data management at scale. The course will culminate with hands-on projects that offer real-world experience, where you'll put your acquired skills to test in solving data engineering challenges. You will not only learn to create scalable data systems but also to analyze their performance and make necessary adjustments for optimum results. This invaluable experience in advanced data engineering techniques will prepare you for the demanding tasks of handling massive datasets, streamlining complex workflows, and optimizing data operations for businesses of any scale.

Status: Data Pipelines
Status: Scalability
IntermediateCourse23 hours

Featured reviews

ND

5.0Reviewed Aug 21, 2024

Great learning resources that will be useful long after completing the course, concise presentations, and clear explanations of all topics

KA

4.0Reviewed Sep 10, 2024

Having taken this course, added to my data engineering skills additional tools such as RabbitMQ, VectorDB and AWS DynamoDB.

All reviews

Showing: 4 of 4

Nicole D
5.0
Reviewed Aug 22, 2024
PANAGIOTIS MITSIOS
5.0
Reviewed Dec 26, 2025
Kanatbek Abdurasulov
4.0
Reviewed Sep 11, 2024
Heino H. Gehlsen
1.0
Reviewed Jan 10, 2026