Chevron Left
Back to Data Engineering and Machine Learning using Spark

Learner Reviews & Feedback for Data Engineering and Machine Learning using Spark by IBM

23 ratings
11 reviews

About the Course

Organizations need skilled, forward-thinking Big Data practitioners who can apply their business and technical skills to unstructured data such as tweets, posts, pictures, audio files, videos, sensor data, and satellite imagery and more to identify behaviors and preferences of prospects, clients, competitors, and others. In this short course you'll gain practical skills when you learn how to work with Apache Spark for Data Engineering and Machine Learning (ML) applications. You will work hands-on with Spark MLlib, Spark Structured Streaming, and more to perform extract, transform and load (ETL) tasks as well as Regression, Classification, and Clustering. The course culminates in a project where you will apply your Spark skills to an ETL for ML workflow use-case. NOTE: This course requires that you have foundational skills for working with Apache Spark and Jupyter Notebooks. The Introduction to Big Data with Spark and Hadoop course from IBM will equip you with these skills and it is recommended that you have completed that course or similar prior to starting this one....
Filter by:

1 - 11 of 11 Reviews for Data Engineering and Machine Learning using Spark

By Minh Q N

Sep 22, 2021

Great Course!!!


Nov 19, 2021


By Tatiana P

Jan 10, 2022

T​his course seemed very detached from the rest of the Data Engineering courses.

V​ery advanced info on a very advanced topic presented in a superficial and rushed manner.

F​inal project with many technical issues in the necessary Jupyter Labs, which I don't see reseaonably debugged by the person taking the course (also, why should they?).

V​ery happy with the rest of the Data Engineering offering so far (I completed 11 out of 13).

V​ery disappointed with this one.

By David S S

Nov 15, 2021

I can't rate higher this course due to the problems with the final project... I hope all the errors could be fixed for future students because the course is excellent and the exercise is great to practice all the knowledge acquire but it has a lot of bugs.

By Natale F

Nov 25, 2021

The Data Engineer part is too fast. The final assessment focuses on the implementation of Machine Learning algorithms with Spark, there is no Data Engineer code production required.

By Sheraz M

Sep 18, 2021

T​he final assignmnet instructions are not very clear and also there are some coding msiatkes that lead you to unexpected results.

By Pawel D

Jan 14, 2022

This course is misunderstanding. The lab environment is not working since months. Running lab notebooks locally require a lot of hacking to make it work. The course is assuming knowledge re/ Machine Learning and data wrangling, The spark is explained superficially and not much use. Free online tutorials are better and clearer.

By Dmitry K

Sep 14, 2021

Peer project has tasks which has never been though or referenced. Part of the labs are failng with lack of resources and git has some obsolete code.

By Cristina M M

Nov 9, 2021

The theory and practice of this course are not at the same level. Yo need to learn some statistics and ML theorical concepts previously.

Labs cannot be do it only with the explanations of the videos.... The final project shouldn't be the place where you see a decision tree.

Also, there is a some commands that work in a bad way in the labs. I think the course need a complete revision, keeping in mind that a lot of learners do the course as part of a certification and had no experience with ML and a only a little with spark.

By Omar H

Dec 5, 2021

It offers very little information, The labs are not well explained, this course doesn't add any value for the specialization.

By James N

Nov 8, 2021

Assignments remain offline for more than a week. No refunds offered, no staff responses