In this advanced course, you will gain practical expertise in scaling data engineering systems using cutting-edge tools and techniques. This course is designed for data scientists, data engineers, and anyone with a foundational understanding of data handling who desires to escalate their skills to handle larger, more complex datasets efficiently.



Advanced Data Engineering
This course is part of Large Language Model Operations (LLMOps) Specialization


Instructors: Noah Gift
Access provided by Stanford University
4,504 already enrolled
(16 reviews)
Recommended experience
What you'll learn
Create and manage data pipelines and their lifecycle
Connect and work with message queues to manage data processing
Use vector, graph, and key/value databases for data storage at scale
Skills you'll gain
- Data Infrastructure
- Real Time Data
- Data Architecture
- Database Management
- Performance Analysis
- Middleware
- Data Import/Export
- Database Systems
- Performance Tuning
- Dataflow
- Apache Airflow
- Data Transformation
- Operational Databases
- Database Management Systems
- Data Warehousing
- Workflow Management
- MySQL
- Scalability
- Data Pipelines
Details to know

Add to your LinkedIn profile
14 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
In this module, you will learn about databases and queues. You will find out the purpose and components of RabbitMQ including its use of queues and integration with Celery. Through hands-on exercises, they will gain experience connecting Celery to RabbitMQ within a Flask application and implementing task patterns like fire and forget and result retrieval. The course also covers core MySQL skills like interacting via the command line interface, manipulating databases, and integrating with Python web apps. By the end, students will have a foundational understanding of RabbitMQ, Celery, and MySQL that allows them to start building modern, asynchronous applications backed by a database.
What's included
22 videos15 readings4 assignments1 discussion prompt1 ungraded lab
What's included
17 videos13 readings4 assignments
In this module, we explore vector and graph databases, powerful tools for managing and extracting insights from large, complex datasets. As data volumes continue to grow, scalability is crucial. We'll learn how vector and graph databases can efficiently store data while maintaining relationships, enabling more advanced analytics. Through real-world examples, you'll see how these databases unlock scalability for machine learning, fraud detection, social networks, and more.
What's included
14 videos11 readings3 assignments1 ungraded lab
In this final module, you will work on advanced real-world data engineering projects, applying everything you've learned. You'll encounter complex data challenges and devise solutions using the latest tools and techniques. This is an opportunity to bring together data engineering concepts covered throughout the course and implement them holistically to deliver impactful outcomes.
What's included
13 videos10 readings3 assignments2 ungraded labs
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Why people choose Coursera for their career




Learner reviews
16 reviews
- 5 stars
62.50%
- 4 stars
31.25%
- 3 stars
0%
- 2 stars
0%
- 1 star
6.25%
Showing 3 of 16
Reviewed on Sep 10, 2024
Having taken this course, added to my data engineering skills additional tools such as RabbitMQ, VectorDB and AWS DynamoDB.
Reviewed on Aug 21, 2024
Great learning resources that will be useful long after completing the course, concise presentations, and clear explanations of all topics
Explore more from Computer Science
DeepLearning.AI
Coursera Instructor Network