Building Realtime Pipelines in Cloud Data Fusion

In this Project, you will:
1 hour 30 minutes
No download needed
Shareable certificate
Desktop only

This is a self-paced lab that takes place in the Google Cloud console. In addition to batch pipelines, Data Fusion also allows you to create real-time pipelines, that can process events as they are generated. Currently, realtime pipelines execute using Apache Spark Streaming on Cloud Dataproc clusters. In this lab, you will learn how to build a streaming pipeline using Data Fusion.

Skills you will develop

  • Data Analysis

  • Data Management

  • Data Visualization (DataViz)

How Projects work

Learn a new tool or skill in an interactive, hands-on environment

You'll gain access to software and tools in a cloud workspace - no download required

Offered by


Google Cloud

Frequently Asked Questions