From charlesreid1

GCDEC: Google Cloud Data Engineer Certification

Basic Info

Certification overview: https://cloud.google.com/certification/data-engineer

Sample case study: https://cloud.google.com/certification/guides/data-engineer/casestudy-flowlogistic

What this certification "certifies" you can do:

  • Build and maintain data structures and databases
  • Design data processing systems
  • Analyze data and enable machine learning
  • Model business processes for analysis and optimization
  • Design for reliability
  • Visualize data and advocate policy
  • Design for security and compliance

Underlying goal: building data-handling capabilities (pipelines to ingest, process, and analyze data, and build models)

  • Data engineers enable better decision-making
  • Cloud services enable you to do more stuff with less knowledge and work - more infrastructure and better models, without getting bogged down by rote devops tasks or slogging through low-level statistics

What is a Data Engineer

Data engineers do any number of things:

  • Design, build, and maintain data structures, databases, data processing systems, data pipelines
  • Move data from one place to another
  • Data science
  • Enabling machine learning to happen, doing machine learning themselves
  • Model the process
  • Enable data-driven decision making in a company

Google Cloud Services

See Google Cloud#Google Cloud Services

Technology Stack

Case study with an example of the kind of technology stack that you might see in use at a company:

List of all pages related to Google Cloud platform:

Training Resources

Coursera

Course 1 - Google Cloud Platform Big Data and Machine Learning Fundamentals

Course 2 - Leveraging Unstructured Data with Cloud Dataproc

Course 3a - Serverless Data Analysis with BigQuery

Course 3b - Serverless Data Analysis with Dataflow

Course 4a - Building Machine Learning Models with Tensorflow (not necessarily in the cloud...)

Course 4b - Deploying Machine Learning Models with Tensorflow (in the cloud)

Course 4c - Engineering Machine Learning Models with Tensorflow (feature engineering and pipelining in the cloud)

Course 5 - Building Resilient Streaming Systems

Udemy

GCDE practice exams

GCDE/Practice Exam 1 - Taken 9/29/17. Mostly missed questions covering Dataproc.

Labs

Code Labs, GCP Essentials Quest: https://google.qwiklabs.com/quests/23?locale=en

Code Labs, Data Engineering Quest: https://google.qwiklabs.com/quests/25?locale=en

Code Labs, Scientific Data Quest: https://google.qwiklabs.com/quests/28?locale=en

Certification Outline

Sample case study link: https://cloud.google.com/certification/guides/data-engineer/casestudy-flowlogistic

Outline of topics covered by exam link: https://cloud.google.com/certification/guides/data-engineer/#certificate-exam-guide

Analysis of outline of topics: GCDE/Outline of Topics

Flags