This course is for those who want to become data engineers or analysts. It covers the key skills needed to manage, process, and analyze large datasets using common industry tools. You will learn how to import data into Hadoop HDFS, Hive tables, and use Spark for direct HDFS imports. The course also covers handling streaming data with Apache Flume and connecting relational databases to Hadoop with Apache Sqoop. You will use Apache Zeppelin for developing Spark applications and learn how to install, monitor, and manage Hadoop clusters with Ambari. The course also introduces advanced HDFS features for data management. By the end, you will be able to use Hadoop and Spark in practical settings.

Hadoop and Spark Fundamentals: Unit 3

Hadoop and Spark Fundamentals: Unit 3
This course is part of Hadoop and Spark Fundamentals Specialization


Instructors: Pearson
Access provided by Interbank
Gain insight into a topic and learn the fundamentals.
Intermediate level
Recommended experience
8 hours to complete
Flexible schedule
Learn at your own pace
What you'll learn
Master advanced data ingestion techniques into Hadoop HDFS, including Hive, Spark, Flume, and Sqoop.
Develop and run interactive Spark applications using the Apache Zeppelin web interface.
Install, monitor, and administer Hadoop clusters with Ambari and essential command-line tools.
Utilize advanced HDFS features such as snapshots and NFS mounts for enhanced data management.
Details to know

Shareable certificate
Add to your LinkedIn profile
Assessments
4 assignments
Taught in English
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
This course is part of the Hadoop and Spark Fundamentals Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Explore more from Information Technology

University of Pittsburgh




