Chevron Left
Back to Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform

Leveraging Unstructured Data with Cloud Dataproc on Google Cloud Platform, Google Cloud

2,495 ratings
314 reviews

About this Course

***NEW! Specialization Completion Challenge, receive Qwiklabs credits valued up to $150! See below for details.*** This 1-week, accelerated course builds upon previous courses in the Data Engineering on Google Cloud Platform specialization. Through a combination of video lectures, demonstrations, and hands-on labs, you'll learn how to create and manage computing clusters to run Hadoop, Spark, Pig and/or Hive jobs on Google Cloud Platform. You will also learn how to access various cloud storage options from their compute clusters and integrate Google’s machine learning capabilities into their analytics programs. In the hands-on labs, you will create and manage Dataproc Clusters using the Web Console and the CLI, and use cluster to run Spark and Pig jobs. You will then create iPython notebooks that integrate with BigQuery and storage and utilize Spark. Finally, you integrate the machine learning APIs into your data analysis. Pre-requisites • Google Cloud Platform Big Data & Machine Learning Fundamentals (or equivalent experience) • Some knowledge of Python SPECIALIZATION COMPLETION CHALLENGE As if learning new skills wasn’t enough of an incentive, we're excited to announce a special completion challenge for 'Data Engineering on Google Cloud Platform’ specialization. Here’s how it works: Our completion challenge runs through 11:59pm PT May 5, 2019. Complete any course in this Specialization including this one, anytime in this period and we'll send you 30 Qwiklabs credits for each course completed (upto $150 value given there are 5 courses in the specialization). You can use these credits to take additional labs and earn badges, which you can then add to your resume and social profiles. Your challenge awaits – begin learning on Coursera today!...

Top reviews


Mar 01, 2019

This is very handy course compared with other cloud platform where a customized environment was provided without concerning setup it on my own. This is very thoughtful and I'm very appreciated.


Dec 29, 2017

Really enjoyed it, woudl have liked to spend more time with the APIs and integrate with real time web downloads. There are a few bugs and misprints, but wasn't too hard to find them.

Filter by:

315 Reviews

By Ahmed Tealeb

Mar 19, 2019

Excellent :)

By Jordan Cheah

Mar 16, 2019

Love the labs and the demo after the lab. Excellent instructions and great lectures!!! Very happy with the course and I learned A LOT *****

By Mangesh Kumar Soni

Mar 16, 2019

I am grateful for Coursera for make such a syllabus and an environment where one can do hands-on the problem.

By Gregory Dillon

Mar 15, 2019

Well designed course. Personally, I need(ed) some outside resources to reach a comfort level with this material

By Jesse Scott

Mar 15, 2019

Super clear and focused.

By Robin Ahmed

Mar 11, 2019

awesome one

By Varun Sharma

Mar 09, 2019

Very good and concise course to explain how to use the Dataproc to process unstructured data and various customization available on the dataproc

By Sainath Revankar

Mar 09, 2019

The training material is good but some of the labs can be combined to have connectivity between lectures.

By John Drinane

Mar 06, 2019

I'm learning stuff, but I feel like until I take on a ambiguous project it is basic learning. Some of the questions are really vague... the last one was what is not good for MP API and the answer was identify pictures where the product is upside down. I suppose the linear off the shelf selection does not exist for that, but if there is an API to translate Chinese to blah with code written in English... flipping a picture upside down seems like something that must be accessible with a combination of scripts. Long story short I think some of the questions suck balls.

By Dmitry Lukashonok

Mar 05, 2019

i've been tired to get it done due to some platform issue. it was necessary to review everything several times to get green sign