From charlesreid1

Notes for Google Cloud Data Engineer (GCDE) certification. See GCDE.

Links:

Case Study

The GCDEC page gives an example of a case study that can be used to see how different parts of the Google Cloud platform come together in the kind of scenario a real company might face. The case study focuses on a logistics company that delivers packages and tracks the deliveries with servers, software, and other infrastructure already in-place. The company's goal is to improve their computational infrastructure by moving parts of it to the cloud, and implement the ability to predict late shipments.

Google Cloud/Case Study

Google Cloud Services

Notes on all of the various parts of the Google Cloud platform and the services available on it.

Introduction

Google Cloud for Big Data

  • MapReduce
  • Spark
  • BigQuery

Usage scenarios

Foundations

Compute and Storage

Data ingestion

Data storage

Federated analysis

Compute engine

Cloud storage

Data Analytics

Cloud SQL - relational database

Dataproc for machine learning

  • Bigtop ecosystem:
  • Pig
  • Spark
  • Hive
  • Hadoop

Scaling Data Analysis

Datalab

Datastore

BigTable

BigQuery (query petabytes in seconds)

TensorFlow

Demand forecasting with machine learning

Data Processing Architectures

PubSub

Dataflow