Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets.The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions.What You'll LearnBecome fluent in the essential concepts and terminology of data science and data engineering Build and use a technology stack that meets industry criteriaMaster the methods for retrieving actionable business knowledgeCoordinate the handling ofpolyglot data types in a data lake for repeatable resultsWho This Book Is ForData scientists and data engineers who are required to convert data from a data lake into actionable knowledge for their business, and students who aspire to be data scientists and data engineers
Les mer
Chapter 1: Data Science Technology Stack.- Chapter 2: Vermeulen - Krennwallner - Hillman - Clark.- Chapter 3: Layered Framework.- Chapter 4: Business Layer.- Chapter 5: Utility Layer.- Chapter 6: Three Management Layers.- Chapter 7: Retrieve Super Step.- Chapter 8: Assess Super Step.- Chapter 9: Process Super Step.- Chapter 10: Transform Super Step.- Chapter 11: Organize and Report Super Step.-
Les mer
Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets.The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions.What You'll Learn:Become fluent in the essential concepts and terminology of data science and data engineering Build and use a technology stack that meets industry criteriaMaster the methods for retrieving actionable business knowledgeCoordinate the handling of polyglot data types in a data lake for repeatable results
Les mer
Provides the essential concepts and terminology to gain fluency in data science and data engineering Walks through the steps of building a technology stack on a layered framework to retrieve actionable business knowledge Teaches how to synthesize the polyglot data types in a data lake with repeatable results
Les mer
Produktdetaljer
ISBN
9781484230534
Publisert
2018-02-22
Utgiver
Vendor
Apress
Høyde
254 mm
Bredde
178 mm
Aldersnivå
Professional/practitioner, P, UP, UU, 06, 05
Språk
Product language
Engelsk
Format
Product format
Heftet
Forfatter