IBM

Introduction to Data Engineering

Rav Ahuja
Priya Kapoor

Instructors: Rav Ahuja

Access provided by Justice Through Code at Columbia University

228,195 already enrolled

Gain insight into a topic and learn the fundamentals.
4.7

(3,432 reviews)

Beginner level

Recommended experience

Flexible schedule
1 week at 10 hours a week
Learn at your own pace
95%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.7

(3,432 reviews)

Beginner level

Recommended experience

Flexible schedule
1 week at 10 hours a week
Learn at your own pace
95%
Most learners liked this course

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

21 assignmentsÂą

AI Graded see disclaimer
Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 4 modules in this course

In this module, you will learn about the different entities that come together to form a modern data ecosystem and the role Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts play in this ecosystem. You will learn what data engineering is and the key tasks in a data engineering lifecycle. You will also gain an understanding of the responsibilities of a data engineer, the skillsets they need in order to be successful, and what a typical day in the life of a data engineer looks like.

What's included

10 videos2 readings4 assignments

In this module, you will learn about the data engineering ecosystem, the different types of data structures, file formats, sources of data, and the languages data professionals use in their day-to-day tasks. You will gain an understanding of several different types of data repositories such as relational and non-relational databases, data warehouses, data marts, and data lakes. You will learn about ETL and ELT processes, data pipelines, and data integration platforms. You will also gain an understanding of what big data is, and the tools used for processing and storing big data. At the end of this module, you will be guided to create an IBM Cloud account, and provision an instance of IBM Db2.

What's included

18 videos4 readings6 assignments1 app item3 plugins

In this module, we will walk you through the data engineering lifecycle. You will learn about the architecture of a data platform, factors for selecting and designing data stores, and the different facets of security as it applies to data platforms and data lifecycle management. You will also learn about the process, steps, and tools used for gathering, importing, wrangling, and querying data. You will gain an understanding of performance monitoring and the steps you can take to troubleshoot performance issues. We will also talk about governance regulations, why we need them, and how technology enables compliance to regulations. During the course of this module, you will be guided to load data from a CSV file into the IBM Db2 instance you created in the previous module. You will also be guided to explore your dataset using some basic SQL queries that will be provided to you.

What's included

10 videos5 readings8 assignments2 app items2 plugins

In this module, you will learn about career opportunities in the field of Data Engineering and the different paths that you can take for getting skilled as a Data Engineer. At the end of the module, you will be presented with the final graded assignment which is divided into two parts. The first part of the final assignment includes a couple of quiz questions and the second part includes open-ended questions that will be reviewed and graded by a peer.

What's included

6 videos2 readings3 assignments1 peer review2 plugins

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.7 (1,153 ratings)
Rav Ahuja
IBM
56 Courses4,371,838 learners

Offered by

IBM

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.7

3,432 reviews

  • 5 stars

    79.54%

  • 4 stars

    16.70%

  • 3 stars

    2.32%

  • 2 stars

    0.43%

  • 1 star

    0.98%

Showing 3 of 3432

PT
5

Reviewed on Apr 7, 2024

AR
5

Reviewed on Sep 25, 2021

MF
5

Reviewed on Mar 25, 2022

Explore more from Information Technology

Âą Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.