Chevron Left
Back to Big Data Integration and Processing

Learner Reviews & Feedback for Big Data Integration and Processing by University of California San Diego

2,343 ratings

About the Course

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Top reviews


Oct 21, 2020

Hello Gentlemen,

This course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.


Mar 5, 2018

It was a good course, it could have been better if some examples of Spark were also provided in other Languages like Java, people without having background of python may find it difficult.

Filter by:

351 - 375 of 497 Reviews for Big Data Integration and Processing

By Giovanny F F F

Aug 3, 2020

Need to explain more about the syntax in spark.

By Puneeth K R

Nov 1, 2020

it's very intresting to learn this course

By Chika E

Aug 2, 2020

Had some analysis issues at the end.

By Kajal N

Mar 2, 2019

Great experience towards this course

By Guillem C M

Nov 1, 2018

The final week is quite difficult


May 2, 2022

quite interesting and futuristic

By Soham G

Mar 1, 2020

little bit drastic and lengthy.

By Deleted A

Dec 25, 2020

Spark is not an easy language

By Mehul P

Dec 30, 2017

Nice overview to get into it.

By prasanth

Oct 28, 2021

Good content on the basics

By Hector G R

Jan 10, 2019

Pretty well course

By Muhammad N S

Apr 8, 2021

Thankyu coursera


Jul 10, 2020

Good Experience

By Vidit K

Mar 20, 2021

I learnt a lot

By Jürgen B

Oct 31, 2018

Good overview.

By Alejandro S M

Apr 23, 2020

Great for db

By Mario L

Aug 6, 2017

has bugs

By Rohit K S

Oct 12, 2020



Oct 18, 2017


By Johan A P O

Nov 10, 2019

Last week was a disaster in terms of giving the necessary educational resources. I found it extremely hard to finish the assignment because I couldn't understand the knowledge set required to do it.

I think you must work on making sure students are getting tailored to the functions that you will request them at the end. It was tremendously underwhelming to me to find such interesting tasks and finding myself unable to understand any clear path to perform even the first actions.

I had to research a lot out of the platform and dig up old replies in the forum just to have hints about what I had to do to find the answers you were requesting. If you consider that it's sufficient with what you explained, you're applying an unfair filter to students.

If you didn't mean that, please adjust either this whole module to focus on

* pyspark syntaxis

* clear use cases in Data retrieval and analysis

* evaluating the syntaxis of each function that you will request later

Or just change the last module to make it according to what you've taught. Thanks, even though I found these struggles, I was able to learn.

By Sarwar A

Oct 7, 2020

I am writing a review for not only for this course but for the previous two courses as well.

The points that I want to make:

The first two courses were okay as far as the theory is concerned but I am very much disappoint with this course because of the following reasons:

1.Not enough exercises for MongoDB

2.That means we have to go further to learn more about MongoDB

3. Too many tools outlined in this course but in return, only a few quizzes comprise hardly more than six questions each.

4.The instructors could have opted for more quizzes on Apache Spark, SparkSQL, MongoDB, Spark Streaming.

5.The creator of this specialization should add two more courses down the line namely " Querying Databases using SparkSQL and MongoDB" and another course could be on "Spark streaming and Splunk"

Overall I didn't like this course at all.

I would like to tell the future learners don't register for this course if you want to take lessons on MongoDB, spark SQL, spark streaming, and Splunk. Look for the courses on COURSERA if you want to take lessons on the above frameworks.

By Dana B

Jul 14, 2021

I really enjoy working on the topic of Big Data. I also think that the course structure and theoretical content as such is very useful and logical. Hence the 3 stars. However, hands on assignments and packages provided are outdated and getting the environment to run properly takes a lot of time and programming knowledge that I, for one, do not have. Also, data in the hands on assignements have changed, hence it is not always possible to reproduce results from assignments, which is really annoying if these results are part of a quiz. Generally, I do not think that solutions to circumvent errors due to outdated packages and data should be sourced and applied by the student through the forum. It should be in the interest of Coursera and / or the instructors to test the environment and provide updates where necessary. I really have to consider whether I want to continue with the next modules and Coursera in general given that most of my time is spent on getting the environment to run the hands on assignments running.

By Tina L

Jan 16, 2018

The elaborations in video lecture sometimes are too complicated to understand. It should consider all students comes from different industry. For example, the disease/gene relationships, actually it can replaced by GeneA, DiseaseA, etc. Also, the slides are not clear enough for students to capture the outstanding points. It's not good for students to review since it's truly vague of the relationships between the list items. Overall, the lecture is just different to understand, even causing confusion sometimes.


May 7, 2017

the course content is critical and as it appears in many interviews, and the fundamental understanding is important for beginners to learn this new area. however I think the software (spark or mongoDB) can be taught in a more systematic way (at least point out some resources that can help people learn them based on individual needs). I understand this course is for beginners and people supposed to learn deeper on themselves. but a road map will be helpful and reduce the pain finishing the tests.

By Lomiarz

Feb 4, 2017

The course was good enough...but exercises were very simple. Only the final course was little bit challenging. For a guy that sits in IT business for a while it's rather too simple. Besides, I've learned spark basics which is super thanks for that

Maybe you could consider to build docker image instead of using virtual machines. VM is ok, but I think that docker can simplify all the stuff without necessary downloading, installations etc.

Looking forward to the next spark challenges :)