Chevron Left
Back to Big Data Integration and Processing

Learner Reviews & Feedback for Big Data Integration and Processing by University of California San Diego

2,304 ratings
499 reviews

About the Course

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Top reviews

Oct 21, 2020

Hello Gentlemen,\n\nThis course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.

Sep 24, 2016

Best course taking into account the first three. Good material, more in depth than the other ones. Very well explained. Useful to get a sense of various interesting topics and orientative.

Filter by:

351 - 375 of 488 Reviews for Big Data Integration and Processing

By Mehul P

Dec 30, 2017

Nice overview to get into it.

By prasanth

Oct 28, 2021

Good content on the basics

By Hector G R

Jan 10, 2019

Pretty well course

By Muhammad N S

Apr 8, 2021

Thankyu coursera


Jul 10, 2020

Good Experience

By Vidit K

Mar 20, 2021

I learnt a lot

By Jürgen B

Oct 31, 2018

Good overview.

By Alejandro S M

Apr 23, 2020

Great for db

By Mario L

Aug 6, 2017

has bugs

By Rohit K S

Oct 12, 2020



Oct 18, 2017


By Johan A P O

Nov 10, 2019

Last week was a disaster in terms of giving the necessary educational resources. I found it extremely hard to finish the assignment because I couldn't understand the knowledge set required to do it.

I think you must work on making sure students are getting tailored to the functions that you will request them at the end. It was tremendously underwhelming to me to find such interesting tasks and finding myself unable to understand any clear path to perform even the first actions.

I had to research a lot out of the platform and dig up old replies in the forum just to have hints about what I had to do to find the answers you were requesting. If you consider that it's sufficient with what you explained, you're applying an unfair filter to students.

If you didn't mean that, please adjust either this whole module to focus on

* pyspark syntaxis

* clear use cases in Data retrieval and analysis

* evaluating the syntaxis of each function that you will request later

Or just change the last module to make it according to what you've taught. Thanks, even though I found these struggles, I was able to learn.

By Sarwar A

Oct 7, 2020

I am writing a review for not only for this course but for the previous two courses as well.

The points that I want to make:

The first two courses were okay as far as the theory is concerned but I am very much disappoint with this course because of the following reasons:

1.Not enough exercises for MongoDB

2.That means we have to go further to learn more about MongoDB

3. Too many tools outlined in this course but in return, only a few quizzes comprise hardly more than six questions each.

4.The instructors could have opted for more quizzes on Apache Spark, SparkSQL, MongoDB, Spark Streaming.

5.The creator of this specialization should add two more courses down the line namely " Querying Databases using SparkSQL and MongoDB" and another course could be on "Spark streaming and Splunk"

Overall I didn't like this course at all.

I would like to tell the future learners don't register for this course if you want to take lessons on MongoDB, spark SQL, spark streaming, and Splunk. Look for the courses on COURSERA if you want to take lessons on the above frameworks.

By Dana B

Jul 14, 2021

I really enjoy working on the topic of Big Data. I also think that the course structure and theoretical content as such is very useful and logical. Hence the 3 stars. However, hands on assignments and packages provided are outdated and getting the environment to run properly takes a lot of time and programming knowledge that I, for one, do not have. Also, data in the hands on assignements have changed, hence it is not always possible to reproduce results from assignments, which is really annoying if these results are part of a quiz. Generally, I do not think that solutions to circumvent errors due to outdated packages and data should be sourced and applied by the student through the forum. It should be in the interest of Coursera and / or the instructors to test the environment and provide updates where necessary. I really have to consider whether I want to continue with the next modules and Coursera in general given that most of my time is spent on getting the environment to run the hands on assignments running.

By Tong L

Jan 16, 2018

The elaborations in video lecture sometimes are too complicated to understand. It should consider all students comes from different industry. For example, the disease/gene relationships, actually it can replaced by GeneA, DiseaseA, etc. Also, the slides are not clear enough for students to capture the outstanding points. It's not good for students to review since it's truly vague of the relationships between the list items. Overall, the lecture is just different to understand, even causing confusion sometimes.


May 7, 2017

the course content is critical and as it appears in many interviews, and the fundamental understanding is important for beginners to learn this new area. however I think the software (spark or mongoDB) can be taught in a more systematic way (at least point out some resources that can help people learn them based on individual needs). I understand this course is for beginners and people supposed to learn deeper on themselves. but a road map will be helpful and reduce the pain finishing the tests.

By Lomiarz

Feb 4, 2017

The course was good enough...but exercises were very simple. Only the final course was little bit challenging. For a guy that sits in IT business for a while it's rather too simple. Besides, I've learned spark basics which is super thanks for that

Maybe you could consider to build docker image instead of using virtual machines. VM is ok, but I think that docker can simplify all the stuff without necessary downloading, installations etc.

Looking forward to the next spark challenges :)

By To P H

Dec 24, 2018

Too many software issues/installation bugs hampering the learning process. The setup procedures for every quiz takes up around 80% of the time and only 20% actually answering the quiz. Please reduce the number of quiz or consolidate them for learners do that we only need to do setup once. Mentor/Instructor presence in various discussions in which students encounter setup/installation issues are next to full absence and many sudents are left figuring out the problems themselves

By Gustavo V

Oct 12, 2020

This course gives an introductory overview in Bigdata processing and explain a variety of tools with little depth, concepts are well explained but the workshops take extra effort to complete due to the fact that the tools versions are outdated, some questionnaires don’t match python workbooks and some assignments for the final project don’t have practical examples in the lessons, so you have to use other learning resources.

By Bojan N

May 11, 2020

Good content, good instructors - they have a nice way of conveying a message, making it easy to follow. I'm rating this course as 3 stars as the content is not kept up to date at all: materials, files, technical dependencies, versioning of the tools - it consumes MUCH, MUCH more time to get the tools setup in place correctly (so that you are able to run the hands-on exercises) compared to the actual time spent studying

By Tomas M

Jul 27, 2017

While the contents are very interesting and the lectures very thorough the practical side has many draw backs. For instance: Connections to PostgresSql did not work even reading the FAQs, same with streaming data in spark. There are not enough examples on syntax and coding to correctly do the assignments. Overall I am happy with the course but it needs some improvements.

By Mauricio H

Sep 8, 2019

So, in general, the course provides you with significant knowledge about big data integration processing, however there were simple exercises that could be done faster if there were no problems executing the commands. This problem leads students to quit the course.

I request the staff correct those errors in order to increase the approval rate.

By David T

Oct 23, 2016

Good experience of using the big data tools but a total lack of engagement in the forums by the instructors and community mentors make it hard going if anything goes wrong. The final quiz took me over 8 hours mostly because there was no one to ask for hints when I was totally stuck and confused!

By Pranav K

May 23, 2021

The hands on exercises were very helpful and the course content was great. However, there were many issues while downloading datasets and configuring other applications. This should be updated so that students don't have to go to the forums every single time.

By Joren Z

Jul 5, 2017

The course covers interesting materials and seems thorough. It's mostly lectures and reading, and not so much actually working with the technology. Since the latter tends to be the hardest part, the overall difficulty remains on the low end of the scale.