Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.

Big Data - Capstone Project

Big Data - Capstone Project
This course is part of Big Data Specialization


Instructors: Ilkay Altintas
Access provided by TVET Authority Google Scholarships
18,779 already enrolled
400 reviews
Skills you'll gain
Tools you'll learn
Details to know

Add to your LinkedIn profile
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 7 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors


Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
66.25%
- 4 stars
21.50%
- 3 stars
5.75%
- 2 stars
1.75%
- 1 star
4.75%
Showing 3 of 400
Reviewed on Apr 25, 2022
This course was good to provide more practical lessons using Splunk, KNIME, Spark MLlib, and Neo4j.
Reviewed on Apr 13, 2018
What a challenge, I came into this course as a London Black Cab Taxi Driver, I thought the knowledge was hard but this capstone was a challenge more intense than the Knowledge of London!!!
Reviewed on Jan 6, 2021
A lot more work and time than expected. Some issues with software tools as per expected.
Explore more from Data Science

University of California San Diego

O.P. Jindal Global University
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.



