Chevron Left
Back to Big Data Essentials: HDFS, MapReduce and Spark RDD

Learner Reviews & Feedback for Big Data Essentials: HDFS, MapReduce and Spark RDD by Yandex

553 ratings
149 reviews

About the Course

Have you ever heard about such technologies as HDFS, MapReduce, Spark? Always wanted to learn these new tools but missed concise starting material? Don’t miss this course either! In this 6-week course you will: - learn some basic technologies of the modern Big Data landscape, namely: HDFS, MapReduce and Spark; - be guided both through systems internals and their applications; - learn about distributed file systems, why they exist and what function they serve; - grasp the MapReduce framework, a workhorse for many modern Big Data applications; - apply the framework to process texts and solve sample business cases; - learn about Spark, the next-generation computational framework; - build a strong understanding of Spark basic concepts; - develop skills to apply these tools to creating solutions in finance, social networks, telecommunications and many other fields. Your learning experience will be as close to real life as possible with the chance to evaluate your practical assignments on a real cluster. No mocking, a friendly considerate atmosphere to make the process of your learning smooth and enjoyable. Get ready to work with real datasets alongside with real masters! Special thanks to: - Prof. Mikhail Roytberg, APT dept., MIPT, who was the initial reviewer of the project, the supervisor and mentor of half of the BigData team. He was the one, who helped to get this show on the road. - Oleg Sukhoroslov (PhD, Senior Researcher at IITP RAS), who has been teaching MapReduce, Hadoop and friends since 2008. Now he is leading the infrastructure team. - Oleg Ivchenko (PhD student APT dept., MIPT), Pavel Akhtyamov (MSc. student at APT dept., MIPT) and Vladimir Kuznetsov (Assistant at P.G. Demidov Yaroslavl State University), superbrains who have developed and now maintain the infrastructure used for practical assignments in this course. - Asya Roitberg, Eugene Baulin, Marina Sudarikova. These people never sleep to babysit this course day and night, to make your learning experience productive, smooth and exciting....

Top reviews

Nov 21, 2018

Everything in this course is new to me, but it provides me with many practice so I can gradually get familiar with all these new stuff. I find it a bit challenging, but overall it's quite good.

May 9, 2019

The course takes you from basic level , step level .But It is quite fast for beginners , you may need pause video in between and try to understand the concept.

Filter by:

51 - 75 of 147 Reviews for Big Data Essentials: HDFS, MapReduce and Spark RDD

By Juan L

Apr 22, 2018

This course really takes you deep into Hadoop with the technical stuff.

By Sivakumar P

Jun 26, 2020

Its very useful course, good trainers are explained very detail.

By Aliaksei T

May 15, 2018

This course very nice and cool, sometimes I want just stop it =)

By amanpreet k

May 23, 2018

This week was pretty good and insightful around Map Reduce

By Hasmik D

Apr 13, 2021

It was very interesting and useful course. Thank you.

By Álvaro L

Mar 6, 2019

Awesome course, goes very deep into the details.

By Irfan S

Nov 1, 2017

Excellent course content covering in depth.

By Garvish

Dec 26, 2017

This is really Intermediate level course.

By Navneet n

Nov 28, 2018

Awesome content...great learning ...:)

By antul k

May 26, 2019

Great Content if you are a beginner.

By shubham m

Aug 20, 2019

Very Nice..............Intraction

By kebize m

Mar 21, 2019

<Good learning >

By Shaik M

Jan 28, 2020

good knowledge

By Aman A

Jul 30, 2019

Great Course.


Jul 19, 2021

it is good

By Rodrigo S

Feb 4, 2018


By Alok K

Sep 16, 2019

Very Good

By Anshika M

Jun 19, 2019


By Marwen B A

Nov 4, 2019


By Minh T

Aug 24, 2019


By Lemohang

Jul 5, 2020


By swapnil c

Dec 29, 2018


By Konstantin S

Mar 16, 2020

The course undoubtedly provides high value and does a good job of introduction of a learner to the basics of big data technologies. I would highly recommend to do the Honors tasks along with the required assignments. There is still a room for improvement, though. The English level does not obstruct the understanding of the material but could be better. Also accurate subtitles, better graphic material, clearer task requirements and more stable grader system would be welcome. The last point might not be a problem anymore, as the grader system seem to have been overhauled in the last few weeks and have been working fine for me since. The slack support channel is active. Kudos to the teachers.

By Mohamed H

Dec 29, 2019

The course is very useful and gives you the basics you need about HDFS, Map-Reduce in python (there is no java in this course) and pyspark. The assignments are straightforward, however you may face issues in the docker and in the grader system. The cons of this course is that sometimes information is given in a fast pace which somehow can make you get confused and unable to fully digest the material. Also there is no interaction at all from the instructors, it'll be nice if they can keep up with students' questions and issues in the future!

By Dorofei

Feb 14, 2020

Потратил больше времени на то, чтобы Grader правильно принял решения, чем времени на решение задачи. Потратить 3-4 часа на решение исходной задачи и потратить 10 часов (включая форум и Slack) на то чтобы ответ правильно принять. Особенно на задаче с Твиттер датасетом, ругается на количество редусеров, но оказывается, надо было логи yarn тоже выводить.

Хорошо было бы добавить еще одну проверку, которая проверяет выводятся ли логи yarn и сообщать, что его нет.

Если бы не эта проблема, поставил бы 5 звезд.