About this Course

106,909 recent views

Learner Career Outcomes

50%

started a new career after completing these courses

25%

got a tangible career benefit from this course

14%

got a pay increase or promotion
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 43 hours to complete
English

Skills you will gain

Python ProgrammingApache HadoopMapreduceApache Spark

Learner Career Outcomes

50%

started a new career after completing these courses

25%

got a tangible career benefit from this course

14%

got a pay increase or promotion
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Flexible deadlines
Reset deadlines in accordance to your schedule.
Intermediate Level
Approx. 43 hours to complete
English

Offered by

Placeholder

Yandex

Syllabus - What you will learn from this course

Content RatingThumbs Up84%(5,098 ratings)Info
Week
1

Week 1

24 minutes to complete

Welcome

24 minutes to complete
8 videos (Total 14 min), 1 reading
8 videos
Issues BigData can solve1m
BigData Applications1m
What is BigData Essentials?2m
Course Structure2m
Meet Emeli1m
Meet Alexey2m
Meet Ivan1m
1 reading
Slack Channel is the quickest way to get answers to your questions10m
9 hours to complete

What are BigData and distributed file systems (e.g. HDFS)?

9 hours to complete
18 videos (Total 136 min), 10 readings, 4 quizzes
18 videos
File system managing6m
File content exploration 15m
File content exploration 213m
Processes4m
Scaling Distributed File System9m
Block and Replica States, Recovery Process 16m
Block and Replica States, Recovery Process 27m
HDFS Client9m
Web UI, REST API4m
Namenode Architecture8m
Introduction10m
Text formats9m
Binary formats 18m
Binary formats 28m
Compression7m
How to submit your first assignment3m
How to Install Docker on Windows 7, 8, 104m
10 readings
Basic Bash Commands10m
HDFS Lesson Introduction10m
Gentle Introduction into "curl"10m
File formats extra (optional)10m
Grading System: Instructions and Common Problems10m
Docker Installation Guide10m
HDFS CLI Playground30m
Programming Assignment: Instructions and Common Problems10m
FAQ How to show your code to teaching staff10m
Slack channel "Bigdata-coursera" - the quickest to solve technical problems.10m
2 practice exercises
Distributed File Systems30m
Big Data and Distributed File Systems25m
Week
2

Week 2

3 hours to complete

Solving Problems with MapReduce

3 hours to complete
17 videos (Total 94 min), 1 reading, 3 quizzes
17 videos
Unreliable Components 28m
MapReduce4m
Distributed Shell8m
Fault Tolerance7m
Fault Tolerance. Live Demo3m
Streaming7m
Streaming in Python3m
WordCount in Python5m
Distributed Cache4m
Environment, Counters4m
Testing5m
Combiner5m
Partitioner7m
Comparator1m
Speculative Execution / Backup Tasks3m
Compression4m
1 reading
Hadoop Streaming Assignments: Intro and Code Samples10m
3 practice exercises
Hadoop MapReduce Intro30m
MapReduce Streaming30m
Hadoop Streaming Final30m
Week
3

Week 3

5 hours to complete

Solving Problems with MapReduce (practice week)

5 hours to complete
1 video (Total 3 min), 5 readings, 5 quizzes
5 readings
Hadoop Streaming Assignments: Intro and Code Samples10m
Hints to Debug Hadoop Streaming Applications10m
Grading System and Grading System Sandbox User Guide10m
Hadoop Streaming Assignments: Instructions10m
Hint to the "Stop words" programming assignment10m
Week
4

Week 4

3 hours to complete

Introduction to Apache Spark

3 hours to complete
16 videos (Total 95 min), 2 readings, 2 quizzes
16 videos
RDDs8m
Transformations 16m
Transformations 27m
Actions5m
Resiliency6m
Execution & Scheduling6m
Caching & Persistence5m
Broadcast variables5m
Accumulator variables5m
Getting started with Spark & Python6m
Working with text files6m
Joins4m
Broadcast & Accumulator variables5m
Spark UI4m
Cluster mode3m
2 readings
Spark Assignments Intro10m
Instructions for Spark programming assignment10m
2 practice exercises
Lesson 1 Quiz30m
Lesson 2 Quiz30m

Reviews

TOP REVIEWS FROM BIG DATA ESSENTIALS: HDFS, MAPREDUCE AND SPARK RDD

View all reviews

Frequently Asked Questions

More questions? Visit the Learner Help Center.