Dekorationsartikel gehören nicht zum Leistungsumfang.
Practical Enterprise Data Lake Insights
Handle Data-Driven Challenges in an Enterprise Big Data Lake
Taschenbuch von Venkata Giri (u. a.)
Sprache: Englisch

47,10 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Lieferzeit 4-7 Werktage

Kategorien:
Beschreibung
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.
When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go through stages that can bring up tough questions such as data processing, data querying, and security. Concepts such as change data capture and data streaming are covered. The book takes an end-to-end solution approach in a data lake environment that includes data security, high availability, data processing, data streaming, and more.
Each chapter includes application of a concept, code snippets, and use case demonstrations to provide you with a practical approach. You will learn the concept, scope, application, and starting point.
What You'll Learn
Get to know data lake architecture and design principles
Implement data capture and streaming strategies

Implement data processing strategies in Hadoop
Understand the data lake security framework and availability model
Who This Book Is For
Big data architects and solution architects
Use this practical guide to successfully handle the challenges encountered when designing an enterprise data lake and learn industry best practices to resolve issues.
When designing an enterprise data lake you often hit a roadblock when you must leave the comfort of the relational world and learn the nuances of handling non-relational data. Starting from sourcing data into the Hadoop ecosystem, you will go through stages that can bring up tough questions such as data processing, data querying, and security. Concepts such as change data capture and data streaming are covered. The book takes an end-to-end solution approach in a data lake environment that includes data security, high availability, data processing, data streaming, and more.
Each chapter includes application of a concept, code snippets, and use case demonstrations to provide you with a practical approach. You will learn the concept, scope, application, and starting point.
What You'll Learn
Get to know data lake architecture and design principles
Implement data capture and streaming strategies

Implement data processing strategies in Hadoop
Understand the data lake security framework and availability model
Who This Book Is For
Big data architects and solution architects
Über den Autor
Saurabh K. Gupta is a technology leader, published author, and database enthusiast with more than 11 years of industry experience in data architecture, engineering, development, and administration. Working as a Manager, Data & Analytics at GE Transportation, his focus lies with data lake analytics programs that build a digital solution for business stakeholders. In the past, he has worked extensively with Oracle database design and development, PaaS and IaaS cloud service models, consolidation, and in-memory technologies. He has authored two books on advanced PL/SQL for Oracle versions 11g and 12c. He is a frequent speaker at numerous conferences organized by the user community and technical institutions. He tweets at [...] and blogs at sbhoracle.[...].
Venkata Giri currently works with GE Digital and has been involved with building resilient distributed services at a massive scale. He has worked on big data tech stack, relational databases, high availability, and performance tuning. With over 20 years of experience in data technologies, he has in-depth knowledge of big data ecosystems, complex data ingestion pipelines, data engineering, data processing, and operations. Prior to working at GE, he worked with the data teams at Linkedin and Yahoo.
Zusammenfassung

First book to provide an end-to-end solution approach

Includes data capture strategies for time series and relational data

Covers data processing using Hive and Spark

Inhaltsverzeichnis
Chapter 1: Introduction to Enterprise Data Lakes.- Chapter 2: Data Lake Ingestion Strategies.- Chapter - 3: Capture Streaming Data with Change-Data-Capture.- Chapter 4: Data Processing Strategies in Data Lakes.- Chapter 5: Data Archiving Strategies in Data Lakes.- Chapter 6: Data Security in Data Lakes.- Chapter 7: Ensuring High-Availability of Data Lakes.- Chapter 8: Managing Data Lake Operations.
Details
Erscheinungsjahr: 2018
Fachbereich: Datenkommunikation, Netze & Mailboxen
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Seiten: 348
Inhalt: xviii
327 S.
90 s/w Illustr.
327 p. 90 illus.
ISBN-13: 9781484235218
ISBN-10: 1484235215
Sprache: Englisch
Herstellernummer: 978-1-4842-3521-8
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Giri, Venkata
Gupta, Saurabh
Auflage: 1st ed.
Hersteller: APRESS
Maße: 235 x 155 x 19 mm
Von/Mit: Venkata Giri (u. a.)
Erscheinungsdatum: 28.06.2018
Gewicht: 0,528 kg
preigu-id: 111194651
Über den Autor
Saurabh K. Gupta is a technology leader, published author, and database enthusiast with more than 11 years of industry experience in data architecture, engineering, development, and administration. Working as a Manager, Data & Analytics at GE Transportation, his focus lies with data lake analytics programs that build a digital solution for business stakeholders. In the past, he has worked extensively with Oracle database design and development, PaaS and IaaS cloud service models, consolidation, and in-memory technologies. He has authored two books on advanced PL/SQL for Oracle versions 11g and 12c. He is a frequent speaker at numerous conferences organized by the user community and technical institutions. He tweets at [...] and blogs at sbhoracle.[...].
Venkata Giri currently works with GE Digital and has been involved with building resilient distributed services at a massive scale. He has worked on big data tech stack, relational databases, high availability, and performance tuning. With over 20 years of experience in data technologies, he has in-depth knowledge of big data ecosystems, complex data ingestion pipelines, data engineering, data processing, and operations. Prior to working at GE, he worked with the data teams at Linkedin and Yahoo.
Zusammenfassung

First book to provide an end-to-end solution approach

Includes data capture strategies for time series and relational data

Covers data processing using Hive and Spark

Inhaltsverzeichnis
Chapter 1: Introduction to Enterprise Data Lakes.- Chapter 2: Data Lake Ingestion Strategies.- Chapter - 3: Capture Streaming Data with Change-Data-Capture.- Chapter 4: Data Processing Strategies in Data Lakes.- Chapter 5: Data Archiving Strategies in Data Lakes.- Chapter 6: Data Security in Data Lakes.- Chapter 7: Ensuring High-Availability of Data Lakes.- Chapter 8: Managing Data Lake Operations.
Details
Erscheinungsjahr: 2018
Fachbereich: Datenkommunikation, Netze & Mailboxen
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Seiten: 348
Inhalt: xviii
327 S.
90 s/w Illustr.
327 p. 90 illus.
ISBN-13: 9781484235218
ISBN-10: 1484235215
Sprache: Englisch
Herstellernummer: 978-1-4842-3521-8
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Giri, Venkata
Gupta, Saurabh
Auflage: 1st ed.
Hersteller: APRESS
Maße: 235 x 155 x 19 mm
Von/Mit: Venkata Giri (u. a.)
Erscheinungsdatum: 28.06.2018
Gewicht: 0,528 kg
preigu-id: 111194651
Warnhinweis

Ähnliche Produkte

Ähnliche Produkte