Chevron Left
Back to Big Data Integration and Processing

Learner Reviews & Feedback for Big Data Integration and Processing by University of California San Diego

2,343 ratings

About the Course

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Top reviews


Oct 21, 2020

Hello Gentlemen,

This course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.


Sep 24, 2016

Best course taking into account the first three. Good material, more in depth than the other ones. Very well explained. Useful to get a sense of various interesting topics and orientative.

Filter by:

26 - 50 of 497 Reviews for Big Data Integration and Processing

By Ying H

Oct 18, 2020

great course material but poor and outdated hands-on material

By Charles G

Dec 14, 2017

It was a tough last project and I had to use outside resources with limited examples and that is why I could not finish on time. I don't like being late on anything but this was by far my worst time management due to vague instructions and lack of supp

By Dhananjay W

Oct 8, 2020

Overall course is awesome but lots of tries does not get to install spark in system. some problem with anaconda package is there.

By Mouâd B

Dec 6, 2021

U​nmaintained resources.

By Allyson D d L

Nov 19, 2021

The content is superficial but interesting. I liked the classes about MongoDB and Spark. The bad thing is that I couldn't use any of tools in Cloudera VM. I coudn't use Spark to answer the last quiz. I tried to install Spark in my personal computer but it didn't work too. I used PostgreSQL and MongoDB in my PC too. I was very disappointed because of that. I couldn't practice Spark as I wanted.

By Ana L

Sep 3, 2018

A lot of the instructions for the assignments were incomplete or just wrong. The installation for Splunk never worked as written, forums were of no help; I ended up downloading an older version of Splunk just to complete that week's assignment. This course was extremely frustrating to complete.

By Martin D

Aug 15, 2020

To avoid. For 4 years, instructors / Coursera / UCSD have never updated the course. In this course, when you follow the instructions "localhost: 8889" it doesn't work, you have to go through the terminal to start pyspark. In addition, the connection ports have been obsolete for over 4 years, therefore the function "", 12028) (or 12024, 12020) does not work. Students are left to fend for themselves, wasting their time finding solutions in forums where unanswered questions to the same issues were asked 4 years ago. Even their VM is obsolete and no longer supported by Coursera. Find other Big Data training if you want to have a higher level training. If I could put a negative star on this course, I would.

By Yuri C

Mar 3, 2022

I really liked the instructors. They have a really good way of showing the content and are very helpful. However, none of the examples in this whole course will work. The images used in the virtual machines are so old that they cannot be even updated anymore, because the servers changed and/or the linux distribution is not even developed anymore. Coursera has to immediately update this course and all the courses in this specialization! This is not fair with the student. Please, Coursera! I have always supported the work on the the platform. This was a great disappointment.

By Marshall

Sep 29, 2021

The content of the course is good. However, the self-driven mini projects are so out-of-date they cannot be completed without hours of wasteful troubleshooting. Coursera should seriously remove this course in it's current state.

By Sarsiz C

Jun 28, 2017

Phew! This course was really tough(the last week's last quiz) You need to do extensive research on the subject for solving further problems which I really liked but it was quite difficult to understand and implement the same after reading from documentation

By Kenat T F G

Apr 21, 2017

We need more exercises to use phyton and spark because with only the examples the hands on quizes are very complicated and I feel that I'm not understanding very well how to use the bigdata tools, is more like error and proof than real learning

thanks a lot!!!

By Dhawal K

Feb 15, 2017

Big data integration and processing is a very nice theoretical course on big data integrations systems and also includes a bunch of practical exercises with big data integration products. The assignments including mongodb and apache spark are worth doing.

By Suresh V

May 29, 2017

What a great relief, after passing that last quiz, it took my entire Memorial Day weekend, after the Microsoft Windows 10 Creators upgrade totally ruined my Virtualbox installation for the Cloudera QuickStart VM. Looks like that is a very very good quiz.

By Nwogbo b C

May 27, 2020

The entire course was quite thrilling filled with new information at each turn, the final hands-on assignment was an eye-opener to the possibilities and importance of using the various big data ecosystem to solve the world problems in a global scale

By Misty S

Aug 28, 2019

This course is very good, however not for beginners because one needs some background base for traditional databases and some math skills. Otherwise the course content is good and the assignment really makes the concepts understandable and clear.

By Sivakumar A

Feb 22, 2018

This course covered a lot of topics at a decent level, and provides a good platform for those interested in taking a deeper dive into specialization courses. I enjoyed the course content, as well as the hands on exercises/quizzes.

Thank you!

By Andres H

May 11, 2019

That's an excellent course I've Learned a lot about not just the Platforms Basics but also how to perform basic operations in Mongo, the tests and the practical exercises also were well planned to ensure that you know what are you doing.

By Helder V

Aug 12, 2017

The content of the course is good, in some situations the explanation is not so complete, but with some research it is possible to conclude. In addition to the videos could be offered some extra material with the content of the module.

By Jorge V

Dec 23, 2018

This has been one of the most exciting courses I've done. The final project makes a good job on making you apply a Big Data Processing Pipeline to solve a common task these days with SparkSQL: analyzing data on social media.

By Wania K

Jul 30, 2019

I found this quite beneficial for me, as it provide all the relevant knowledge that is required to know all about Big Data Integration and Processing. Thanks coursera for providing such a platform to everyone.


Mar 12, 2018

Course should be designed to be able to install trail/free software for hands on experience like spunk. Overall nice concepts and hands on with Spark and MongoDB. Excluding spunk area everything is good.

By Andrea C

Oct 15, 2016

Course is well balanced between theory and practical exercises. Assignments in week 6 for me was challenging but very instructive. I learned many interesting, and useful in real world, things.


Oct 22, 2020

Hello Gentlemen,

This course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.

By Federico A G C

Sep 25, 2016

Best course taking into account the first three. Good material, more in depth than the other ones. Very well explained. Useful to get a sense of various interesting topics and orientative.

By Hans E

Nov 3, 2017

Only the last test was a little too difficult (needed some more information to solve it). Have spend 3 days searching on the internet (:-(

Nice mix of theory and lots of nice hands -on