From charlesreid1

No edit summary
No edit summary
Line 10: Line 10:
==Procedure==
==Procedure==


Review case study: [[Google Cloud/Case Study]]
Software tools list, (abstract) example for each: [[Google Cloud]]
 
Software tools: [[Google Cloud]]
* Storage/database/computation/GPUs vs CPUs/containerization
* Storage/database/computation/GPUs vs CPUs/containerization


Software quality assurance:
* Github page - 10 things
* Apply style of later points to earlier points
* Clear out lorem ipsum (7-10)


 
Notes review: GCDEC
===Review Process===
* Case study - [[Google Cloud/Case Study]]
 
 
Software Quality Assurance
* Github pages/10 things list (time machine)
* Needs some dusting off, shortening, apply style of later points to earlier points
* Points 7-10 need to be finished, still lorem ipsum
 
GCDEC Review:
* 1 - https://charlesreid1.com/wiki/GCDEC/Fundamentals/Notes
* 1 - https://charlesreid1.com/wiki/GCDEC/Fundamentals/Notes
* 2 - https://charlesreid1.com/wiki/GCDEC/Unstructured_Data/Notes
* 2 - https://charlesreid1.com/wiki/GCDEC/Unstructured_Data/Notes
Line 34: Line 28:
* 4c - https://charlesreid1.com/wiki/GCDEC/Engineering_Tensorflow/Notes
* 4c - https://charlesreid1.com/wiki/GCDEC/Engineering_Tensorflow/Notes
* 5 - https://charlesreid1.com/w/index.php?title=GCDEC/Streaming/Notes&action=edit&redlink=1
* 5 - https://charlesreid1.com/w/index.php?title=GCDEC/Streaming/Notes&action=edit&redlink=1
===Examples===


Google Codelabs:
Google Codelabs:
Line 54: Line 46:
* Scientific data processing - https://google.qwiklabs.com/quests/28?locale=en
* Scientific data processing - https://google.qwiklabs.com/quests/28?locale=en
* Data engineering - https://google.qwiklabs.com/quests/25?locale=en
* Data engineering - https://google.qwiklabs.com/quests/25?locale=en




[[Category:Google Cloud]]
[[Category:Google Cloud]]
[[Category:Data Engineering]]
[[Category:Data Engineering]]

Revision as of 22:07, 8 January 2018

Review of Google Cloud and Data Engineering

Review in preparation for interview:

  • Components of workflow in cloud, analogies
  • Open source tools used at each "step"
  • Highlighting different workflows using repositories
  • Quick/easy example: why so many database solutions? How to do basics?
  • Specific challenges, software, workflow for genomics research

Procedure

Software tools list, (abstract) example for each: Google Cloud

  • Storage/database/computation/GPUs vs CPUs/containerization

Software quality assurance:

  • Github page - 10 things
  • Apply style of later points to earlier points
  • Clear out lorem ipsum (7-10)

Notes review: GCDEC

Google Codelabs:

Google Quiklabs: