Chevron Left
Back to Big Data Integration and Processing

Learner Reviews & Feedback for Big Data Integration and Processing by University of California San Diego

2,319 ratings
501 reviews

About the Course

At the end of the course, you will be able to: *Retrieve data from example database and big data management systems *Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications *Identify when a big data problem needs data integration *Execute simple big data integration and processing on Hadoop and Spark platforms This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+....

Top reviews

Oct 21, 2020

Hello Gentlemen,\n\nThis course was very helpful foe me. It enhanced my knowledge about Big Data Integration. Thank you so much for providing me such important knowledge. Thank you once again.

Sep 24, 2016

Best course taking into account the first three. Good material, more in depth than the other ones. Very well explained. Useful to get a sense of various interesting topics and orientative.

Filter by:

426 - 450 of 490 Reviews for Big Data Integration and Processing

By Ярослав Ф

May 4, 2020

В курсе не рассматриваются базовые операторы MongoDB, например, обращение к данным в подструктуре. Некорректная информация по поводу установки нужных программ для работы в 6 неделе. Никак не мог подключиться к Jupyter и использовать PySpark. Преподаватели не общаются со студентами и отказываются обьяснить. Лекции интересные, но расплывчатые. Так же есть реклама приложения, что само по себе уже не красиво.

By Allyson D d L

Nov 19, 2021

The content is superficial but interesting. I liked the classes about MongoDB and Spark. The bad thing is that I couldn't use any of tools in Cloudera VM. I coudn't use Spark to answer the last quiz. I tried to install Spark in my personal computer but it didn't work too. I used PostgreSQL and MongoDB in my PC too. I was very disappointed because of that. I couldn't practice Spark as I wanted.

By MartinsT

Oct 29, 2020

I'm very satisfied with the knowledge I have gained by taking this course, but I'm VERY DISAPPOINTED in the practical tasks (Hands on tasks), because the tools used in analysis are not up to date - they are not working when you follow the hands on task guide. You can get them to work through researching forums for help, but still it's unacceptable because you have to pay for this course.

By Marek K

Jun 10, 2021

The course content is good - however the VM provided needs alot of work to get it working - i have spent weeks over this specialisation just trying to get software installed and working - thankfully i know Linux and work in IT . I think if the software was refreshed to something in support and not behind a paywall on the most part this would be great.

By Karen H

May 31, 2021

The course and specialization is overall good, however I encountered lots of technical challenges during the completion of the tasks.

First, Centos 6 is not supported and lots of difficulties do upgrade some staff. Couldn't connect pyspark with Jupyter Notebook.

Guys, you need either upgrade the materials or remove it from Coursera .

By Stephen L

Jul 23, 2017

Good material and challenging assignments, but too many technical issues with setup instructions and spark context. You have to be a cloudera expert to solve the issues. There does not seem to be any support from the course instructors or assistants. The files are out of date with the spark updates.

By Chew S J (

Jul 17, 2018

The contents seem to be fine at the beginning but the assignment on Week 6 was just way too much for me. The assignment lacks clear guideline and perhaps, lecture contents need to be updated, or the assignment task needs to be revised.

It took me two weeks to complete the last week of the course itself.

By Swetha K

May 25, 2020

Too much of theory, and very little practical knowledge is taught. But at the end of the course, the quiz requires hands-on knowledge on Python & Spark. How is it expected that we can solve those questions without prior knowledge, I do not understand. Disappointed with the course content.

By Polla T

Sep 16, 2017

The final pyspark project was too hard for me and I don't exactly understand without massive python knowledge how can this be solvable, while the weeks lessons were way too easy compare to this final project.

This whole course was a little bit too superficial, without comprehensive tasks.

By Christopher R

Jun 27, 2017

Interesting material and good lectures however some of the hands-on work was difficult / impossible to complete due to issues with SparkSQL, which have gone unanswered in the discussion forums, so had to export the data and use other tools to perform those analyses.

By Ferran G F

Jan 9, 2020

Low score because professor team/staff seemed to completely ignore discussion forums. A lot of participants have had problems running shell scripts and other setup instructions that are necessary to perform some tasks, and their posts have been ignored.

By Robert H

Sep 8, 2017

Tedious exercises through VM where instructions oftentimes do not work out of the box. It is a hassle to download the slides in small sets and their design awful. Definitely one of the worse courses I have taken.

By Klaas v S

May 19, 2020

The supplied tools are broken in hard to debug ways, as is evident from the discussion forums, where literally thousands of questions are raised. Somebody should do sentiment analysis on these forums, I suppose.

By Sergey K

Oct 8, 2016

Quizes are inconsistent with given dataset. Questions are bad: i.e. what is the difference between "number of distinct countries mentioned" and "number of countries mentioned in tweets".

By Enrique C M d C

Nov 19, 2017

La virtual box de cloudera va muy lenta y es imposible hacer los ejercicios de manera fluida y satisfactoria.

El usuario y la contraseña de Splunk que se proporcionan estan caducados

By Erwin v R

Dec 24, 2016

I miss a proper buildup from the theory to the practical exercises. Especially the last quiz I found very difficult based on the limited number of exercises presented upfront.

By Brian S

May 23, 2019

Practice activities files are outdated and a lot of the installation of downloaded tools requires manual fixing, there is no support at all from the course publishers.

By Luis E M

Apr 9, 2020

Links do not work and you must spent a lot of time in the Discussion Forum to know how to run the programs.

Instructions for the quiz are not clear and confused

By Matt M

Nov 17, 2016

I had many problems with the final two programming assignments with running Spark in the VM and there isn't a lot of help available online.

By Richard E A M

Apr 21, 2017

The materials explaining the course are too poor. This is a core course in the specialization.

The quizzes are out of the material's league

By Stephen B

Mar 29, 2020

Too many different software packages, not enough depth, and no support. Good high level overview.

By Ruijia W

Nov 17, 2017

1. the whole week 1 and week 2 are useless and not helpful.

2. Plz add more handson sections

By Mayank R

Apr 8, 2019

This course focuses entirely on theory and there are very few hands on exercises .

By M B A

Mar 10, 2021

On week 6 Pyspark hands-on, no guidance, un-understandable and not working

By Reinaldo L N

Sep 5, 2019

I had lots of problems with postgresql, could not run the hands-on for it