Zum Hauptinhalt springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
Kafka Troubleshooting in Production
Stabilizing Kafka Clusters in the Cloud and On-premises
Taschenbuch von Elad Eldor
Sprache: Englisch

44,15 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

Lieferzeit 1-2 Wochen

Kategorien:
Beschreibung
This book provides Kafka administrators, site reliability engineers, and DataOps and DevOps practitioners with a list of real production issues that can occur in Kafka clusters and how to solve them. The production issues covered are assembled into a comprehensive troubleshooting guide for those engineers who are responsible for the stability and performance of Kafka clusters in production, whether those clusters are deployed in the cloud or on-premises. This book teaches you how to detect and troubleshoot the issues, and eventually how to prevent them.
Kafka stability is hard to achieve, especially in high throughput environments, and the purpose of this book is not only to make troubleshooting easier, but also to prevent production issues from occurring in the first place. The guidance in this book is drawn from the author's years of experience in helping clients and internal customers diagnose and resolve knotty production problems and stabilize their Kafka environments. The book is organized into recipe-style troubleshooting checklists that field engineers can easily follow when under pressure to fix an unstable cluster. This is the book you will want by your side when the stakes are high, and your job is on the line.
What You Will Learn
Monitor and resolve production issues in your Kafka clusters
Provision Kafka clusters with the lowest costs and still handle the required loads
Perform root cause analyses of issues affecting your Kafka clusters
Know the ways in which your Kafka cluster can affect its consumers and producers
Prevent or minimize data loss and delays in data streaming
Forestall production issues through an understanding of common failure points
Create checklists for troubleshooting your Kafka clusters when problems occur
Who This Book Is For
Site reliability engineers tasked with maintaining stability of Kafka clusters, Kafka administrators who troubleshoot production issues around Kafka, DevOps and DataOps experts who are involved with provisioning Kafka (whether on-premises or in the cloud), developers of Kafka consumers and producers who wish to learn more about Kafka
This book provides Kafka administrators, site reliability engineers, and DataOps and DevOps practitioners with a list of real production issues that can occur in Kafka clusters and how to solve them. The production issues covered are assembled into a comprehensive troubleshooting guide for those engineers who are responsible for the stability and performance of Kafka clusters in production, whether those clusters are deployed in the cloud or on-premises. This book teaches you how to detect and troubleshoot the issues, and eventually how to prevent them.
Kafka stability is hard to achieve, especially in high throughput environments, and the purpose of this book is not only to make troubleshooting easier, but also to prevent production issues from occurring in the first place. The guidance in this book is drawn from the author's years of experience in helping clients and internal customers diagnose and resolve knotty production problems and stabilize their Kafka environments. The book is organized into recipe-style troubleshooting checklists that field engineers can easily follow when under pressure to fix an unstable cluster. This is the book you will want by your side when the stakes are high, and your job is on the line.
What You Will Learn
Monitor and resolve production issues in your Kafka clusters
Provision Kafka clusters with the lowest costs and still handle the required loads
Perform root cause analyses of issues affecting your Kafka clusters
Know the ways in which your Kafka cluster can affect its consumers and producers
Prevent or minimize data loss and delays in data streaming
Forestall production issues through an understanding of common failure points
Create checklists for troubleshooting your Kafka clusters when problems occur
Who This Book Is For
Site reliability engineers tasked with maintaining stability of Kafka clusters, Kafka administrators who troubleshoot production issues around Kafka, DevOps and DataOps experts who are involved with provisioning Kafka (whether on-premises or in the cloud), developers of Kafka consumers and producers who wish to learn more about Kafka
Über den Autor
Elad Eldor is a DataOps team leader in the Grow division of Unity (formerly ironSource), working on handling stability issues, improving performance, and reducing the cost of high-scale Kafka, Druid, Presto, and Spark clusters on AWS. He has 12 years of experience as a backend software engineer and six years handling DataOps of big data Linux-based clusters.

Prior to working at Unity, Elad was a Site Reliability Engineer (SRE) at Cognyte, where he developed big data applications and handled the reliability and scalability of Spark and Kafka clusters in production. His main interests are performance tuning and cost reduction of big data clusters.

Zusammenfassung

Helps you position yourself as a focal point of Kafka expertise in your organization

Provides guidance to implement fast resolutions to Kafka production problems

Shows you how to reduce data delays and minimize data loss when production issues occur

Inhaltsverzeichnis

1. Storage Usage in Kafka: Challenges, Strategies and Best Practices.- 2. Strategies for Aggregation, Data Cardinality and Batching.- 3. Understanding and Addressing Partition Skew in Kafka.- 4. Dealing with Skewed and Lost Leaders.- 5. CPU Saturation in Kafka: Causes, Consequences and Solutions.- 6. RAM Allocation in Kafka Clusters: Performance, Stability and Optimization Strategies.- 7. Disk IO Overload in Kafka: Diagnosing and Overcoming Challenges.- 8. Disk Configuration: RAID10 vs. JBOD.- 9 A Deep Dive into Producer Monitoring.- 10. A Deep Dive into Consumer Monitoring.- 11. Stability issues in On-premises Kafka Data Centers.- 12. Cost Reduction of Kafka Clusters.

Details
Erscheinungsjahr: 2023
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xx
216 S.
17 s/w Illustr.
74 farbige Illustr.
216 p. 91 illus.
74 illus. in color.
ISBN-13: 9781484294895
ISBN-10: 1484294890
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Eldor, Elad
Auflage: 1st ed.
Hersteller: Apress
Apress L.P.
Maße: 254 x 178 x 13 mm
Von/Mit: Elad Eldor
Erscheinungsdatum: 30.11.2023
Gewicht: 0,453 kg
Artikel-ID: 126734075
Über den Autor
Elad Eldor is a DataOps team leader in the Grow division of Unity (formerly ironSource), working on handling stability issues, improving performance, and reducing the cost of high-scale Kafka, Druid, Presto, and Spark clusters on AWS. He has 12 years of experience as a backend software engineer and six years handling DataOps of big data Linux-based clusters.

Prior to working at Unity, Elad was a Site Reliability Engineer (SRE) at Cognyte, where he developed big data applications and handled the reliability and scalability of Spark and Kafka clusters in production. His main interests are performance tuning and cost reduction of big data clusters.

Zusammenfassung

Helps you position yourself as a focal point of Kafka expertise in your organization

Provides guidance to implement fast resolutions to Kafka production problems

Shows you how to reduce data delays and minimize data loss when production issues occur

Inhaltsverzeichnis

1. Storage Usage in Kafka: Challenges, Strategies and Best Practices.- 2. Strategies for Aggregation, Data Cardinality and Batching.- 3. Understanding and Addressing Partition Skew in Kafka.- 4. Dealing with Skewed and Lost Leaders.- 5. CPU Saturation in Kafka: Causes, Consequences and Solutions.- 6. RAM Allocation in Kafka Clusters: Performance, Stability and Optimization Strategies.- 7. Disk IO Overload in Kafka: Diagnosing and Overcoming Challenges.- 8. Disk Configuration: RAID10 vs. JBOD.- 9 A Deep Dive into Producer Monitoring.- 10. A Deep Dive into Consumer Monitoring.- 11. Stability issues in On-premises Kafka Data Centers.- 12. Cost Reduction of Kafka Clusters.

Details
Erscheinungsjahr: 2023
Genre: Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xx
216 S.
17 s/w Illustr.
74 farbige Illustr.
216 p. 91 illus.
74 illus. in color.
ISBN-13: 9781484294895
ISBN-10: 1484294890
Sprache: Englisch
Ausstattung / Beilage: Paperback
Einband: Kartoniert / Broschiert
Autor: Eldor, Elad
Auflage: 1st ed.
Hersteller: Apress
Apress L.P.
Maße: 254 x 178 x 13 mm
Von/Mit: Elad Eldor
Erscheinungsdatum: 30.11.2023
Gewicht: 0,453 kg
Artikel-ID: 126734075
Warnhinweis