Zum Hauptinhalt springen
Dekorationsartikel gehören nicht zum Leistungsumfang.
Machine Learning Upgrade
A Data Scientist's Guide to Mlops, Llms, and ML Infrastructure
Taschenbuch von Kristen Kehrer (u. a.)
Sprache: Englisch

40,40 €*

inkl. MwSt.

Versandkostenfrei per Post / DHL

auf Lager, Lieferzeit 1-2 Werktage

Kategorien:
Beschreibung

An end-to-end framework for developing Large Language Model (LLM)-based applications

Traditionally, there has been a divide between data scientists and software engineers. With the advent of LLMs, however, this has changed. Machine learning is no longer primarily a tool for data analysis, but is now a fundamental feature of modern software applications. In Machine Learning Upgrade, data scientists are given a comprehensive framework not just for understanding LLMs, but for building efficient, reproducible, and scalable LLM applications.

Written by leading data scientists, this book brings you up to date on the current state of LLM technology and offers both a conceptual and hands-on overview of how it can be most responsibly integrated into business. Readers will follow along as the authors build an LLM-powered application, providing a concrete example of their framework in action. Best practices for data versioning, experiment tracking, model monitoring, and ethical considerations are also central.

Data professionals of all levels looking for a holistic understanding of LLM aplications using the latest technologies and practices will benefit from this book. By adopting a data-centric view, we can identify opportunities to integrate LLMs and drive business success.

An end-to-end framework for developing Large Language Model (LLM)-based applications

Traditionally, there has been a divide between data scientists and software engineers. With the advent of LLMs, however, this has changed. Machine learning is no longer primarily a tool for data analysis, but is now a fundamental feature of modern software applications. In Machine Learning Upgrade, data scientists are given a comprehensive framework not just for understanding LLMs, but for building efficient, reproducible, and scalable LLM applications.

Written by leading data scientists, this book brings you up to date on the current state of LLM technology and offers both a conceptual and hands-on overview of how it can be most responsibly integrated into business. Readers will follow along as the authors build an LLM-powered application, providing a concrete example of their framework in action. Best practices for data versioning, experiment tracking, model monitoring, and ethical considerations are also central.

Data professionals of all levels looking for a holistic understanding of LLM aplications using the latest technologies and practices will benefit from this book. By adopting a data-centric view, we can identify opportunities to integrate LLMs and drive business success.

Über den Autor

Kristen Kehrer has been providing innovative and practical statistical modeling solutions since 2010. In 2018, she achieved recognition as a LinkedIn Top Voice in Data Science & Analytics. Kristen is also the founder of Data Moves Me, LLC.

Caleb Kaiser is a Full Stack Engineer at Comet. Caleb was previously on the Founding Team at Cortex Labs. Caleb also worked at Scribe Media on the Author Platform Team.

Inhaltsverzeichnis

Introduction ix

1 A Gentle Introduction to Modern Machine Learning 1

Data Science Is Diverging from Business Intelligence 3

From CRISP-DM to Modern, Multicomponent ml Systems 4

The Emergence of LLMs Has Increased ML's Power and Complexity 7

What You Can Expect from This Book 9

2 An End-to-End Approach 11

Components of a YouTube Search Agent 13

Principles of a Production Machine Learning System 16

Observability 19

Reproducibility 19

Interoperability 20

Scalability 21

Improvability 22

A Note on Tools 23

3 A Data-Centric View 25

The Emergence of Foundation Models 25

The Role of Off-the-Shelf Components 27

The Data-Driven Approach 28

A Note on Data Ethics 28

Building the Dataset 30

Working with Vector Databases 34

Data Versioning and Management 50

Getting Started with Data Versioning 53

Knowing "Just Enough" Engineering 57

4 Standing Up Your LLM 61

Selecting Your LLM 61

What Type of Inference Do I Need to Perform? 65

How Open-Ended Is This Task? 66

What Are the Privacy Concerns for This Data? 66

How Much Will This Model Cost? 67

Experiment Management with LLMs 68

LLM Inference 74

Basics of Prompt Engineering 74

In-Context Learning 77

Intermediary Computation 85

Augmented Generation 89

Agentic Techniques 94

Optimizing LLM Inference with Experiment Management 102

Fine-Tuning LLMs 111

When to Fine-Tune an LLM 112

Quantization, QLOrA, and Parameter Efficient Fine-Tuning 113

Wrapping Things Up 121

5 Putting Together an Application 123

Prototyping with Gradio 125

Creating Graphics with Plotnine 128

Adding the Author Selector 137

Adding a Logo 138

Adding a Tab 139

Adding a Title and Subtitle 140

Changing the Color of the Buttons 140

Click to Download Button 141

Putting It All Together 141

Deploying Models as APIs 144

Implementing an API with FastAPI 146

Implementing Uvicorn 148

Monitoring an LLM 149

Dockerizing Your Service 151

Deploying Your Own LLM 154

Wrapping Things Up 159

6 Rounding Out the ML Life Cycle 161

Deploying a Simple Random Forest Model 161

An Introduction to Model Monitoring 167

Model Monitoring with Evidently AI 175

Building a Model Monitoring System 176

Final Thoughts on Monitoring 187

7 Review of Best Practices 189

Step 1: Understand the Problem 189

Step 2: Model Selection and Training 190

Step 3: Deploy and Maintain 192

Step 4: Collaborate and Communicate 196

Emerging Trends in LLMs 197

Next Steps in Learning 199

Appendix: Additional LLM Example 201

Index 209

Details
Erscheinungsjahr: 2024
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: Einband - flex.(Paperback)
ISBN-13: 9781394249633
ISBN-10: 1394249632
Sprache: Englisch
Einband: Kartoniert / Broschiert
Autor: Kehrer, Kristen
Kaiser, Caleb
Hersteller: Wiley
Verantwortliche Person für die EU: Produktsicherheitsverantwortliche/r, Europaallee 1, D-36244 Bad Hersfeld, gpsr@libri.de
Maße: 225 x 151 x 14 mm
Von/Mit: Kristen Kehrer (u. a.)
Erscheinungsdatum: 20.08.2024
Gewicht: 0,284 kg
Artikel-ID: 128085065
Über den Autor

Kristen Kehrer has been providing innovative and practical statistical modeling solutions since 2010. In 2018, she achieved recognition as a LinkedIn Top Voice in Data Science & Analytics. Kristen is also the founder of Data Moves Me, LLC.

Caleb Kaiser is a Full Stack Engineer at Comet. Caleb was previously on the Founding Team at Cortex Labs. Caleb also worked at Scribe Media on the Author Platform Team.

Inhaltsverzeichnis

Introduction ix

1 A Gentle Introduction to Modern Machine Learning 1

Data Science Is Diverging from Business Intelligence 3

From CRISP-DM to Modern, Multicomponent ml Systems 4

The Emergence of LLMs Has Increased ML's Power and Complexity 7

What You Can Expect from This Book 9

2 An End-to-End Approach 11

Components of a YouTube Search Agent 13

Principles of a Production Machine Learning System 16

Observability 19

Reproducibility 19

Interoperability 20

Scalability 21

Improvability 22

A Note on Tools 23

3 A Data-Centric View 25

The Emergence of Foundation Models 25

The Role of Off-the-Shelf Components 27

The Data-Driven Approach 28

A Note on Data Ethics 28

Building the Dataset 30

Working with Vector Databases 34

Data Versioning and Management 50

Getting Started with Data Versioning 53

Knowing "Just Enough" Engineering 57

4 Standing Up Your LLM 61

Selecting Your LLM 61

What Type of Inference Do I Need to Perform? 65

How Open-Ended Is This Task? 66

What Are the Privacy Concerns for This Data? 66

How Much Will This Model Cost? 67

Experiment Management with LLMs 68

LLM Inference 74

Basics of Prompt Engineering 74

In-Context Learning 77

Intermediary Computation 85

Augmented Generation 89

Agentic Techniques 94

Optimizing LLM Inference with Experiment Management 102

Fine-Tuning LLMs 111

When to Fine-Tune an LLM 112

Quantization, QLOrA, and Parameter Efficient Fine-Tuning 113

Wrapping Things Up 121

5 Putting Together an Application 123

Prototyping with Gradio 125

Creating Graphics with Plotnine 128

Adding the Author Selector 137

Adding a Logo 138

Adding a Tab 139

Adding a Title and Subtitle 140

Changing the Color of the Buttons 140

Click to Download Button 141

Putting It All Together 141

Deploying Models as APIs 144

Implementing an API with FastAPI 146

Implementing Uvicorn 148

Monitoring an LLM 149

Dockerizing Your Service 151

Deploying Your Own LLM 154

Wrapping Things Up 159

6 Rounding Out the ML Life Cycle 161

Deploying a Simple Random Forest Model 161

An Introduction to Model Monitoring 167

Model Monitoring with Evidently AI 175

Building a Model Monitoring System 176

Final Thoughts on Monitoring 187

7 Review of Best Practices 189

Step 1: Understand the Problem 189

Step 2: Model Selection and Training 190

Step 3: Deploy and Maintain 192

Step 4: Collaborate and Communicate 196

Emerging Trends in LLMs 197

Next Steps in Learning 199

Appendix: Additional LLM Example 201

Index 209

Details
Erscheinungsjahr: 2024
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: Einband - flex.(Paperback)
ISBN-13: 9781394249633
ISBN-10: 1394249632
Sprache: Englisch
Einband: Kartoniert / Broschiert
Autor: Kehrer, Kristen
Kaiser, Caleb
Hersteller: Wiley
Verantwortliche Person für die EU: Produktsicherheitsverantwortliche/r, Europaallee 1, D-36244 Bad Hersfeld, gpsr@libri.de
Maße: 225 x 151 x 14 mm
Von/Mit: Kristen Kehrer (u. a.)
Erscheinungsdatum: 20.08.2024
Gewicht: 0,284 kg
Artikel-ID: 128085065
Sicherheitshinweis