Ingesting Data Into The Cloud Using Google Cloud Functions




Create a new Cloud Storage bucket

Ingest data files to the storage bucket

Deploy a Cloud Function

Use Cloud Scheduler to ingest data with a Cloud Function

This lab demonstrates how to use local Python scripts to retrieve data from the US Bureau of Transport Statistics website, then modify the scripts so they can be run using Google Cloud Functions.

You will configure then schedule these Cloud Function to periodically fetch new data using Google Cloud Scheduler. Google Cloud Scheduler can be used to run a scheduled Cloud Function task to import and sanitize data that is periodically updated in order to maintain an up-to-date active data set for analysis. This dataset can be used to demonstrate a wide range of data science concepts and techniques and will be used in all of the other labs in the Data Science on Google Cloud Platform quest.

