Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.
About this Course
Learner Career Outcomes
Learner Career Outcomes
University of California San Diego
UC San Diego is an academic powerhouse and economic engine, recognized as one of the top 10 public universities by U.S. News and World Report. Innovation is central to who we are and what we do. Here, students learn that knowledge isn't just acquired in the classroom—life is their laboratory.
- 5 stars
- 4 stars
- 3 stars
- 2 stars
- 1 star
TOP REVIEWS FROM BIG DATA - CAPSTONE PROJECT
Overall a good capstone project. However, the guidance is inconsistent some times. Also the setup for PySpark is not working for a lot of students, for at least 3 years, and they did not uptdate it.
Really interesting insights into the general overview of the big data specialization with brain-teasing hands-on exercises and a look to hoe reporting various big data analytics should be undertaken
This is very helpful project where i have applied all learning through ouot journey of this course.Though it was time consuming but worth to invest time, which benefits to upskill my knowledge
What a challenge, I came into this course as a London Black Cab Taxi Driver, I thought the knowledge was hard but this capstone was a challenge more intense than the Knowledge of London!!!
About the Big Data Specialization
Drive better business decisions with an overview of how big data is organized, analyzed, and interpreted. Apply your insights to real-world problems and questions.
Frequently Asked Questions
When will I have access to the lectures and assignments?
What will I get if I subscribe to this Specialization?
Is financial aid available?
More questions? Visit the Learner Help Center.