About this Course
4.5
4,303 ratings
1,106 reviews
Specialization

Course 1 of 6 in the

100% online

100% online

Start instantly and learn at your own schedule.
Flexible deadlines

Flexible deadlines

Reset deadlines in accordance to your schedule.
Hours to complete

Approx. 16 hours to complete

Suggested: 3 weeks of study, 5-6 hours/week...
Available languages

English

Subtitles: English, Korean, Hindi, Persian...

Skills you will gain

Big DataApache HadoopMapreduceCloudera
Specialization

Course 1 of 6 in the

100% online

100% online

Start instantly and learn at your own schedule.
Flexible deadlines

Flexible deadlines

Reset deadlines in accordance to your schedule.
Hours to complete

Approx. 16 hours to complete

Suggested: 3 weeks of study, 5-6 hours/week...
Available languages

English

Subtitles: English, Korean, Hindi, Persian...

Syllabus - What you will learn from this course

Week
1
Hours to complete
20 minutes to complete

Welcome

Welcome to the Big Data Specialization! We're excited for you to get to know us and we're looking forward to learning about you! ...
Reading
2 videos (Total 3 min), 2 readings
Video2 videos
Tell us about yourself and learn about your classmatesm
Reading2 readings
By the end of this course you will be able to...2m
Optional: Watch this fun video about the San Diego Supercomputer Center!10m
Hours to complete
4 hours to complete

Big Data: Why and Where

Data -- it's been around (even digitally) for a while. What makes data "big" and where does this big data come from?...
Reading
13 videos (Total 77 min), 13 readings, 1 quiz
Video13 videos
Applications: What makes big data valuable11m
Example: Saving lives with Big Data6m
Example: Using Big Data to Help Patients10m
A Sentiment Analysis Success Story: Meltwater helping Danone1m
Getting Started: Where Does Big Data Come From?2m
Machine-Generated Data: It's Everywhere and There's a Lot!3m
Machine-Generated Data: Advantages4m
Big Data Generated By People: The Unstructured Challenge5m
Big Data Generated By People: How Is It Being Used?10m
Organization-Generated Data: Structured but often siloed7m
Organization-Generated Data: Benefits Come From Combining With Other Data Types4m
The Key: Integrating Diverse Data5m
Reading13 readings
Did you know?: 25 facts about big data10m
Slides: What Launched the Big Data Era?10m
Slides: Applications: What Makes Big Data Valuable?10m
Slides: Saving Lives With Big Data10m
Slides: Using Big Data to Help Patients10m
Extra Resources10m
Slides: Machine-Generated Data: It's Everywhere and There's a Lot!10m
Slides: Machine-Generated Data: Advantages10m
Slides: Big Data Generated By People: The Unstructured Challenge10m
Slides: Big Data Generated By People: How is it Being Used?10m
Slides: Organization-Generated Big Data: Structured But Often Siloed10m
Slides: Organizaton-Generated Big Data: Benefits10m
Slides: The Key - Integrating Diverse Data10m
Quiz1 practice exercise
Why Big Data and Where Did it Come From?38m
Week
2
Hours to complete
2 hours to complete

Characteristics of Big Data and Dimensions of Scalability

You may have heard of the "Big Vs". We'll give examples and descriptions of the commonly discussed 5. But, we want to propose a 6th V and we'll ask you to practice writing Big Data questions targeting this V -- value....
Reading
7 videos (Total 34 min), 9 readings, 1 quiz
Video7 videos
Characteristics of Big Data - Volume5m
Characteristics of Big Data - Variety5m
Characteristics of Big Data - Velocity6m
Characteristics of Big Data - Veracity6m
Characteristics of Big Data - Valence2m
The Sixth V: Value4m
Reading9 readings
What does astronomical scale mean?10m
A Small Definition of Big Data10m
Slides: Getting Started - Characteristics of Big Data10m
Slides: Characteristics of Big Data - Volume10m
Slides: Characteristics of Big Data - Variety10m
Slides: Characteristics of Big Data - Velocity10m
Slides: Characteristics of Big Data - Veracity10m
Slides: Characteristics of Big Data - Value10m
Slides: Characteristics of Big Data - Valence10m
Quiz1 practice exercise
V for the V's of Big Data14m
Hours to complete
4 hours to complete

Data Science: Getting Value out of Big Data

We love science and we love computing, don't get us wrong. But the reality is we care about Big Data because it can bring value to our companies, our lives, and the world. In this module we'll introduce a 5 step process for approaching data science problems....
Reading
11 videos (Total 66 min), 12 readings, 1 quiz
Video11 videos
Building a Big Data Strategy9m
How does big data science happen?: Five Components of Data Science9m
Asking the Right Questions3m
Steps in the Data Science Process3m
Step 1: Acquiring Data6m
Step 2-A: Exploring Data4m
Step 2-B: Pre-Processing Data8m
Step 3: Analyzing Data8m
Step 4: Communicating Results4m
Step 5: Turning Insights into Action2m
Reading12 readings
Five P's of Data Science10m
Slides: Getting Value Out of Big Data10m
Slides: Building a Big Data Strategy10m
Slides: The Five P's of Data Science10m
Slides: Asking the Right Questions10m
Slides: Steps in the Data Science Process10m
Slides: Step 1 - Acquiring Data10m
Slides: Step 2A-Exploring Data10m
Slides: Step 2B-Preprocessing Data10m
Slides: Step 3-Data Analysis10m
Slides: Step 4-Communicating Results10m
Slides: Step 5-Turning Insights Into Action10m
Quiz1 practice exercise
Data Science 10120m
Week
3
Hours to complete
1 hour to complete

Foundations for Big Data Systems and Programming

Big Data requires new programming frameworks and systems. For this course, we don't programming knowledge or experience -- but we do want to give you a grounding in some of the key concepts....
Reading
4 videos (Total 19 min), 4 readings, 1 quiz
Video4 videos
What is a Distributed File System?6m
Scalable Computing over the Internet4m
Programming Models for Big Data6m
Reading4 readings
Slides: Getting Started-Why Worry About Foundations?10m
Slides: What is a Distributed File System?10m
Slides: Scalable Computing Over the Internet10m
Slides: Programming Models for Big Data10m
Quiz1 practice exercise
Foundations for Big Data20m
Hours to complete
5 hours to complete

Systems: Getting Started with Hadoop

Let's look at some details of Hadoop and MapReduce. Then we'll go "hands on" and actually perform a simple MapReduce task in the Cloudera VM. Pay attention - as we'll guide you in "learning by doing" in diagramming a MapReduce task as a Peer Review....
Reading
11 videos (Total 66 min), 8 readings, 3 quizzes
Video11 videos
The Hadoop Ecosystem: Welcome to the zoo!7m
The Hadoop Distributed File System: A Storage System for Big Data7m
YARN: A Resource Manager for Hadoop5m
MapReduce: Simple Programming for Big Results12m
When to Reconsider Hadoop?4m
Cloud Computing: An Important Big Data Enabler6m
Cloud Service Models: An Exploration of Choices4m
Value From Hadoop and Pre-built Hadoop Images3m
Copy your data into the Hadoop Distributed File System (HDFS)4m
Run the WordCount program5m
Reading8 readings
MapReduce in the Pasta Sauce Example10m
Slides for Getting Started With Hadoop10m
Downloading and Installing the Cloudera VM Instructions (Mac)10m
Downloading and Installing the Cloudera VM Instructions (Windows)10m
FAQ10m
Copy your data into the Hadoop Distributed File System (HDFS) Instructions10m
Run the WordCount program Instructions10m
How do I figure out how to run Hadoop MapReduce programs?10m
Quiz2 practice exercises
Intro to Hadoop26m
Running Hadoop MapReduce Programs Quiz4m
4.5
Career direction

47%

started a new career after completing these courses
Career Benefit

83%

got a tangible career benefit from this course
Career promotion

11%

got a pay increase or promotion

Top Reviews

By PBMay 25th 2018

A step by step approach stating from basic big data concept extending to Hadoop framework and hands on mapping and simple MapReduce application development effort.\n\nVery smooth learning experience.

By RGJul 14th 2017

First of all i would like to take this opportunity to thanks the instructors the course is well structured and explained the foundations with real world problems with easy to understand the concepts.

Instructors

Avatar

Ilkay Altintas

Chief Data Science Officer
San Diego Supercomputer Center
Avatar

Amarnath Gupta

Director, Advanced Query Processing Lab
San Diego Supercomputer Center (SDSC)

About University of California San Diego

UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory....

About the Big Data Specialization

Drive better business decisions with an overview of how big data is organized, analyzed, and interpreted. Apply your insights to real-world problems and questions. ********* Do you need to understand big data and how it will impact your business? This Specialization is for you. You will gain an understanding of what insights big data can provide through hands-on experience with the tools and systems used by big data scientists and engineers. Previous programming experience is not required! You will be guided through the basics of using Hadoop with MapReduce, Spark, Pig and Hive. By following along with provided code, you will experience how one can perform predictive modeling and leverage graph analytics to model problems. This specialization will prepare you to ask the right questions about data, communicate effectively with data scientists, and do basic exploration of large, complex datasets. In the final Capstone Project, developed in partnership with data software company Splunk, you’ll apply the skills you learned to do basic analyses of big data....
Big Data

Frequently Asked Questions

  • Once you enroll for a Certificate, you’ll have access to all videos, quizzes, and programming assignments (if applicable). Peer review assignments can only be submitted and reviewed once your session has begun. If you choose to explore the course without purchasing, you may not be able to access certain assignments.

  • When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

More questions? Visit the Learner Help Center.