Back to Introduction to Big Data
University of California San Diego

Introduction to Big Data

Interested in increasing your knowledge of the Big Data landscape? This course is for those new to data science and interested in understanding why the Big Data Era has come to be. It is for those who want to become conversant with the terminology and the core concepts behind big data problems, applications, and systems. It is for those who want to start thinking about how Big Data might be useful in their business or career. It provides an introduction to one of the most common frameworks, Hadoop, that has made big data analysis easier and more accessible -- increasing the potential for data to transform our world! At the end of this course, you will be able to: * Describe the Big Data landscape including examples of real world big data problems including the three key sources of Big Data: people, organizations, and sensors. * Explain the V’s of Big Data (volume, velocity, variety, veracity, valence, and value) and why each impacts data collection, monitoring, storage, analysis and reporting. * Get value out of Big Data by using a 5-step process to structure your analysis. * Identify what are and what are not big data problems and be able to recast big data problems as data science questions. * Provide an explanation of the architectural components and programming models used for scalable big data analysis. * Summarize the features and value of core Hadoop stack components including the YARN resource and job management system, the HDFS file system and the MapReduce programming model. * Install and run a program using Hadoop! This course is for those new to data science. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge. Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+.

Status: Unstructured Data
Status: Big Data
Course18 hours

Featured reviews

JM

5.0Reviewed Jun 28, 2021

Great Introduction to Big Data. I did not realize how important it is for different expertise to come together in order to draw meaningful value from the data. It is not only a programmers world.

KN

5.0Reviewed Jun 1, 2020

This is a great course. I learnt a lot from this course. What I like about this course is the hands-on experience with Hadoop. Such a good add-on on our skill, instead of just theoretical learning.

AR

5.0Reviewed Mar 30, 2020

One of the best course to start learning new cutting-edge technology and to get deeper insights into Big Data. Thanks to the great instructors for amazing explanations of each module and e-materials.

HM

5.0Reviewed Sep 8, 2019

I love the course. It goes deep into the foundations, and then finishes up with an actual lab where you learn by practice. I greatly benefited from it and feel I have achieved a milestone in big data.

SS

5.0Reviewed Sep 14, 2019

It is a comprehensive introduction to big data which covers significant components with enough content that can be absorb at this stage. A very good kick-start and excited for the next course ahead.

AS

5.0Reviewed Oct 14, 2018

It was extremely helpful course to start learning Big Data! I learnt lots of things and how to use Cloudera to work on datasets.This course contains theoretical and pratical examples,thanks a lot!

RG

5.0Reviewed Jul 13, 2017

First of all i would like to take this opportunity to thanks the instructors the course is well structured and explained the foundations with real world problems with easy to understand the concepts.

JT

5.0Reviewed Aug 30, 2016

This is a great introduction for Big Data. It helps me to revisit what I learned from the meetups and webinars, then put the fundamental knowledge and information in a solid foundation. Thank you.

VG

5.0Reviewed Mar 25, 2018

Excellent learning opportunity to the concepts of Big Data and about the Hadoop ecosystem. Overall a wonderful learning experience with hands-on to get practical knowledge on the concepts learnt

AK

5.0Reviewed Aug 8, 2021

This course was really helpful in basic understanding of big Data and Instructor's appearance on screen was like we are taking course in classroom session which made the training more interesting

MZ

5.0Reviewed Aug 23, 2017

The Great Course offered. I had tried many resources before knowing about this one about big data. but couldn't understand exactly about it. But now I'm comfortable about my knowledge of big data

VJ

4.0Reviewed Oct 26, 2020

Hadoop commands were from the old version whereas there are new versions command also there however the content of the course was very much interactive and interesting and made the learning easy.

All reviews

Showing: 20 of 2,500

Deleted Account
5.0
Reviewed May 10, 2022
Isara Anantavrasilp
3.0
Reviewed Oct 4, 2018
Abdul Kittana
1.0
Reviewed Sep 26, 2016
Rakesh Gopidi
5.0
Reviewed Jul 14, 2017
Prabir Bhattacharyya
5.0
Reviewed May 25, 2018
Patricia peñalosa
2.0
Reviewed Nov 7, 2018
Catherine Boothman
5.0
Reviewed Nov 8, 2018
Raivis Joksts
3.0
Reviewed Feb 5, 2019
hatem murad
5.0
Reviewed Sep 9, 2019
Hendrik Bruns
3.0
Reviewed Dec 1, 2017
Alexandra Hazard Kampmann
1.0
Reviewed Apr 9, 2017
T Bizreh
4.0
Reviewed Feb 23, 2020
Guy Dupenloup
1.0
Reviewed Feb 6, 2021
Sean Green
1.0
Reviewed Sep 24, 2016
Pranav Vyas
1.0
Reviewed Mar 7, 2019
Mayank Raj
1.0
Reviewed Oct 3, 2019
Rongon Chatterjee
4.0
Reviewed May 14, 2019
Ahmed Khalifa Ahmed Alhammadi
5.0
Reviewed Jun 9, 2019
Azar Rzayev
5.0
Reviewed Mar 31, 2020
Brian Song
4.0
Reviewed May 9, 2022