Showcase your skills in this Data Engineering project! In this course you will apply a variety of data engineering skills and techniques you have learned as part of the previous courses in the IBM Data Engineering Professional Certificate.



Data Engineering Capstone Project
This course is part of IBM Data Engineering Professional Certificate


Instructors: Rav Ahuja
Access provided by Kalinga Institute of Industrial Technology
17,630 already enrolled
(130 reviews)
Recommended experience
What you'll learn
Demonstrate proficiency in skills required for an entry-level data engineering role.
Design and implement various concepts and components in the data engineering lifecycle such as data repositories.
Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.
Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.
Skills you'll gain
Details to know

Add to your LinkedIn profile
See how employees at top companies are mastering in-demand skills

Build your Data Management expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM

There are 7 modules in this course
In this module, you will design a data platform that uses MySQL as an OLTP database. You will be using MySQL to store the OLTP data.
What's included
2 videos2 assignments1 app item2 plugins
In this module, you will design a data platform that uses MongoDB as a NoSQL database. You will use MongoDB to store the e-commerce catalog data.
What's included
1 video2 assignments1 app item
In this module you will design and implement a data warehouse and you will then generate reports from the data in the data warehouse.
What's included
2 videos1 reading3 assignments3 app items1 plugin
In this module, you will assume the role of a data engineer at an e-commerce company. Your company has finished setting up a data warehouse. Now you are assigned the responsibility to design a reporting dashboard that reflects the key metrics of the business.
What's included
1 video5 readings2 assignments5 plugins
In this module, you will use the given python script to perform various ETL operations that move data from RDBMS to NoSQL, NoSQL to RDBMS, and from RDBMS, NoSQL to the data warehouse. You will write a pipeline that analyzes the web server log file, extracts the required lines and fields, transforms and loads data.
What's included
2 videos3 assignments2 app items
In this module, you will use the data from a webserver to analyse search terms. You will then load a pretrained sales forecasting model and predict the sales forecast for a future year.
What's included
1 video2 assignments2 app items
In this final module you will complete your submission of screenshots from the hands-on labs for your peers to review. Once you have completed your submission you will then review the submission of one of your peers and grade their submission.
What's included
2 readings1 peer review
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors


Offered by
Why people choose Coursera for their career




Learner reviews
130 reviews
- 5 stars
83.07%
- 4 stars
10.76%
- 3 stars
2.30%
- 2 stars
1.53%
- 1 star
2.30%
Showing 3 of 130
Reviewed on Mar 9, 2024
The Capstone was a bit of an anticlimax. I was expecting a very challenging Capstone, but found a "follow the instructions" approach which made it seem too simple. I'm not complaining ;-)
Reviewed on Aug 13, 2023
Great course to learn the fundamentals to become a very good Data Engineer !
Reviewed on Feb 8, 2025
good course for who want to become a Data Engineer
Explore more from Information Technology
Âą Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.