Data Engineering: Pipelines, ETL, Hadoop
Completed by Nancy Giang
February 17, 2025
3 hours (approximately)
Nancy Giang's account is verified. Coursera certifies their successful completion of Data Engineering: Pipelines, ETL, Hadoop
What you will learn
Analyse the architecture and components of data pipelines to understand their impact on data flow and processing efficiency.
Implement robust ETL processes, for scalability and maintainability.
Analyze big data challenges and introduce Hadoop ecosystem tools (HDFS, MapReduce, Hive, Pig, and Spark) for data processing tasks.
Skills you will gain
- Category: Data Pipelines
- Category: Dataflow
- Category: Data Import/Export
- Category: Apache Hive
- Category: Data Strategy
- Category: Big Data
- Category: Extract, Transform, Load
- Category: Data-Driven Decision-Making
- Category: Data Integration
- Category: Data Architecture
- Category: Data Management
- Category: Data Processing

