Chevron Left
Back to Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform, Google Cloud

1,923 ratings
239 reviews

About this Course

This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs. In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis. Pre-requisites • Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience) • Some knowledge of Python...

Top reviews


Dec 29, 2017

Really enjoyed it, woudl have liked to spend more time with the APIs and integrate with real time web downloads. There are a few bugs and misprints, but wasn't too hard to find them.


Aug 08, 2018

The course was really helpful to understand how to use google bigdata offering - dataproc for creating and managing Hadoop/hive/spark/pig and many more opensource bigdata products.

Filter by:

242 Reviews

By Chris

Dec 18, 2018

good course

By Rai Shahnawaz

Dec 18, 2018

I would like to have more extensive labs for Data Proc. Selection of pyspark for most of the course was quiet useful. A little bit more of dataproc use cases comparison of dataproc modules including spark, hive and then relevant proprietary options available in Google.

By Paul Conyngham

Dec 18, 2018

didnt feel that I learnt anything in this course

By Wesley Shimabukuro

Dec 18, 2018

Really great everything i have learnt a lot.

By Muthu Mariappan H

Dec 17, 2018

Excellent course for Beginners with more lab experience.

By sugimiyanto

Dec 16, 2018

some practical did not work as expected, such as accessing hadoop administration web UI from my local browser

By Roberto Fonseca

Dec 13, 2018

I learned a lot of thr plstform for ML. I wnt to begin the next course.

By Peter Schimpl

Dec 12, 2018

Good course .Very interesting!!

By Bethany Baker

Dec 08, 2018

Dataproc is a very impressive piece of software and the labs gave me a great introduction.

By Patrick Baker

Dec 06, 2018

Had trouble with labs as they did not score correctly even though all steps were taken and leading to a successful outcome.

This lead to repeated execution of the labs with support staff copying and posting the instructions from the lab as steps to rectify ... not helpful