University of California San Diego
Hadoop Platform and Application Framework
University of California San Diego

Hadoop Platform and Application Framework

Natasha Balac, Ph.D.
Paul Rodriguez
Andrea Zonca

Instructors: Natasha Balac, Ph.D.

Access provided by Goldman Sachs

150,778 already enrolled

Gain insight into a topic and learn the fundamentals.
4.0

(3,325 reviews)

3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
83%
Most learners liked this course
Gain insight into a topic and learn the fundamentals.
4.0

(3,325 reviews)

3 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace
83%
Most learners liked this course

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

11 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 5 modules in this course

Welcome to the first module of the Big Data Platform course. This first module will provide insight into Big Data Hype, its technologies opportunities and challenges. We will take a deeper look into the Hadoop stack and tool and technologies associated with Big Data solutions.

What's included

7 videos4 readings1 assignment

In this module we will take a detailed look at the Hadoop stack ranging from the basic HDFS components, to application execution frameworks, and languages, services.

What's included

10 videos6 readings3 assignments

In this module we will take a detailed look at the Hadoop Distributed File System (HDFS). We will cover the main design goals of HDFS, understand the read/write process to HDFS, the main configuration parameters that can be tuned to control HDFS performance and robustness, and get an overview of the different ways you can access data on HDFS.

What's included

9 videos5 readings3 assignments

This module will introduce Map/Reduce concepts and practice. You will learn about the big idea of Map/Reduce and you will learn how to design, implement, and execute tasks in the map/reduce framework. You will also learn the trade-offs in map/reduce and how that motivates other tools.

What's included

9 videos3 readings1 assignment2 programming assignments

Welcome to module 5, Introduction to Spark, this week we will focus on the Apache Spark cluster computing framework, an important contender of Hadoop MapReduce in the Big Data Arena. Spark provides great performance advantages over Hadoop MapReduce,especially for iterative algorithms, thanks to in-memory caching. Also, gives Data Scientists an easier way to write their analysis pipeline in Python and Scala,even providing interactive shells to play live with data.

What's included

10 videos4 readings3 assignments2 programming assignments

Instructors

Instructor ratings
3.8 (93 ratings)
Natasha Balac, Ph.D.
University of California San Diego
4 Courses219,622 learners
Paul Rodriguez
University of California San Diego
3 Courses183,863 learners

Offered by

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

4.0

3,325 reviews

  • 5 stars

    45.33%

  • 4 stars

    28.08%

  • 3 stars

    12.37%

  • 2 stars

    6.77%

  • 1 star

    7.43%

Showing 3 of 3325

DZ
4

Reviewed on Dec 21, 2015

T
5

Reviewed on Jun 26, 2016

MT
4

Reviewed on Oct 5, 2016

Explore more from Data Science