The course “HDFS Architecture and Programming” offers a comprehensive understanding of the Hadoop Distributed File System (HDFS) architecture, components, and advanced programming techniques. You will gain practical experience in setting up and configuring Hadoop for Java development, while mastering key concepts such as file and directory CRUD operations, data compression, and serialization. By the end of the course, you will be proficient in using HDFS to handle large-scale data processing, enabling you to build scalable, high-availability solutions.



HDFS Architecture and Programming
This course is part of Big Data Processing Using Hadoop Specialization

Instructor: Karthik Shyamsunder
Access provided by PwC India
Recommended experience
What you'll learn
- Understand HDFS architecture, components, and how it ensures scalability and availability for big data processing. 
- Learn to configure Hadoop for Java programming and perform file CRUD operations using HDFS APIs. 
- Master advanced HDFS programming concepts like compression, serialization, and working with specialized file structures like Sequence and Map files. 
Skills you'll gain
Details to know

Add to your LinkedIn profile
9 assignments
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
This course provides a comprehensive understanding of Hadoop Distributed File System (HDFS) architecture and its key components. Students will gain hands-on experience with HDFS, learning how to set up Java programming environments and configure Hadoop. The course covers essential topics such as the HDFS programming model, file and directory CRUD operations, and compression techniques. You will also explore serialization, deserialization, and specialized file structures like Sequence and Map Files. By the end of the course, You will be equipped to leverage HDFS for scalable, highly available big data solutions.
What's included
2 readings
In this module, we will cover the working model and architecture behind Hadoop Distributed File System (HDFS) 1.0 and the capabilities and deficiencies of HDFS 1.0 architecture.
What's included
6 videos4 readings3 assignments
In this module, we will cover HDFS programming concepts, HDFS API, and steps to write an HDFS client program for CRUD (Create, Read, Update and Delete) on files.
What's included
6 videos5 readings3 assignments
In this module, we will cover HDFS advanced programming concepts, such as CRUD on directories, compression, serialization and deserialization, and file-based data structures like sequence files.
What's included
6 videos5 readings3 assignments
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career




Explore more from Information Technology
 - University of California San Diego 
 - Johns Hopkins University 
 - Johns Hopkins University 


