Data Engineering: Pipelines, ETL, Hadoop
Completed by Nancy Giang
February 17, 2025
3 hours (approximately)
Nancy Giang's account is verified. Coursera certifies their successful completion of Data Engineering: Pipelines, ETL, Hadoop
What you will learn
Analyse the architecture and components of data pipelines to understand their impact on data flow and processing efficiency.
Implement robust ETL processes, for scalability and maintainability.
Analyze big data challenges and introduce Hadoop ecosystem tools (HDFS, MapReduce, Hive, Pig, and Spark) for data processing tasks.
Skills you will gain
- Category: Data Architecture
- Category: Data Pipelines
- Category: Data Management
- Category: Data Warehousing
- Category: Data Analysis
- Category: Data Transformation
- Category: Extract, Transform, Load
- Category: Data Strategy
- Category: Data-Driven Decision-Making
- Category: Data Import/Export
- Category: Data Processing
- Category: Apache Spark

