menu
arrow_back

ETL Processing on Google Cloud Using Dataflow and BigQuery

ETL Processing on Google Cloud Using Dataflow and BigQuery

1 个小时 7 个积分

GSP290

Google Cloud Self-Paced Labs

Overview

In this lab you build several Data Pipelines that ingest data from a publicly available dataset into BigQuery, using these Google Cloud services:

  • Cloud Storage
  • Dataflow
  • BigQuery

You will create your own Data Pipeline, including the design considerations, as well as implementation details, to ensure that your prototype meets the requirements. Be sure to open the python files and read the comments when instructed to.

加入 Qwiklabs 即可阅读本实验的剩余内容…以及更多精彩内容!

  • 获取对“Google Cloud Console”的临时访问权限。
  • 200 多项实验,从入门级实验到高级实验,应有尽有。
  • 内容短小精悍,便于您按照自己的节奏进行学习。
加入以开始此实验
分数

—/100

Create a Cloud Storage Bucket

运行步骤

/ 20

Copy Files to Your Bucket

运行步骤

/ 10

Create the BigQuery Dataset (name: lake)

运行步骤

/ 20

Build a Data Ingestion Dataflow Pipeline

运行步骤

/ 10

Build a Data Transformation Dataflow Pipeline

运行步骤

/ 10

Build a Data Enrichment Dataflow Pipeline

运行步骤

/ 10

Build a Data lake to Mart Dataflow Pipeline

运行步骤

/ 20