Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.
This course is part of the Big Data Specialization
15,035 already enrolled
Offered By
About this Course
9,874 recent views
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Coursera Labs
Includes hands on learning projects.
Learn more about Coursera Labs Course 6 of 6 in the
Approx. 20 hours to complete
English
Could your company benefit from training employees on in-demand skills?
Try Coursera for BusinessSkills you will gain
- Big Data
- Neo4j
- Knime
- Splunk
Flexible deadlines
Reset deadlines in accordance to your schedule.
Shareable Certificate
Earn a Certificate upon completion
100% online
Start instantly and learn at your own schedule.
Coursera Labs
Includes hands on learning projects.
Learn more about Coursera Labs Course 6 of 6 in the
Approx. 20 hours to complete
English
Could your company benefit from training employees on in-demand skills?
Try Coursera for BusinessOffered by
Syllabus - What you will learn from this course
1 hour to complete
Simulating Big Data for an Online Game
1 hour to complete
4 videos (Total 18 min), 4 readings
4 hours to complete
Acquiring, Exploring, and Preparing the Data
4 hours to complete
6 readings
5 hours to complete
Data Classification with KNIME
5 hours to complete
4 readings
5 hours to complete
Clustering with Spark
5 hours to complete
2 readings
3 hours to complete
Graph Analytics of Simulated Chat Data With Neo4j
3 hours to complete
2 readings
Reviews
- 5 stars66.06%
- 4 stars21.85%
- 3 stars5.91%
- 2 stars1.79%
- 1 star4.37%
TOP REVIEWS FROM BIG DATA - CAPSTONE PROJECT
by JANov 23, 2018
waoh.. it's incredible.. .. I strongly recommend this Capstone Project. Be sure to put on frank effort.
T
H
A
K
Y
O
U
S
O
M
U
C
H
by NAApr 1, 2018
A
b
s
u
l
o
t
e
l
y
r
e
c
o
m
m
e
n
d
e
d
by STJul 11, 2020
Good exercise to cover the whole essence of many weeks of other courses of the Big Data Specialization.
by NDNov 25, 2018
All the sessions were very informative and provided the required knowledge from basics.
About the Big Data Specialization

Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
What is the refund policy?
Is financial aid available?
More questions? Visit the Learner Help Center.