Zum Hauptinhalt springen Zur Suche springen Zur Hauptnavigation springen
Beschreibung
This groundbreaking book transcends traditional machine learning approaches by introducing information measurement methodologies that revolutionize the field.

Stemming from a UC Berkeley seminar on experimental design for machine learning tasks, these techniques aim to overcome the 'black box' approach of machine learning by reducing conjectures such as magic numbers (hyper-parameters) or model-type bias. Information-based machine learning enables data quality measurements, a priori task complexity estimations, and reproducible design of data science experiments. The benefits include significant size reduction, increased explainability, and enhanced resilience of models, all contributing to advancing the discipline's robustness and credibility.

While bridging the gap between machine learning and disciplines such as physics, information theory, and computer engineering, this textbook maintains an accessible and comprehensive style, making complex topics digestible fora broad readership. Information-Driven Machine Learning explores the synergistic harmony among these disciplines to enhance our understanding of data science modeling. Instead of solely focusing on the "how," this text provides answers to the "why" questions that permeate the field, shedding light on the underlying principles of machine learning processes and their practical implications. By advocating for systematic methodologies grounded in fundamental principles, this book challenges industry practices that have often evolved from ideologic or profit-driven motivations. It addresses a range of topics, including deep learning, data drift, and MLOps, using fundamental principles such as entropy, capacity, and high dimensionality.

Ideal for both academia and industry professionals, this textbook serves as a valuable tool for those seeking to deepen their understanding of data science as an engineering discipline. Its thought-provoking content stimulates intellectual curiosity and caters to readers who desire more than just code or ready-made formulas. The text invites readers to explore beyond conventional viewpoints, offering an alternative perspective that promotes a big-picture view for integrating theory with practice. Suitable for upper undergraduate or graduate-level courses, this book can also benefit practicing engineers and scientists in various disciplines by enhancing their understanding of modeling and improving data measurement effectively.
This groundbreaking book transcends traditional machine learning approaches by introducing information measurement methodologies that revolutionize the field.

Stemming from a UC Berkeley seminar on experimental design for machine learning tasks, these techniques aim to overcome the 'black box' approach of machine learning by reducing conjectures such as magic numbers (hyper-parameters) or model-type bias. Information-based machine learning enables data quality measurements, a priori task complexity estimations, and reproducible design of data science experiments. The benefits include significant size reduction, increased explainability, and enhanced resilience of models, all contributing to advancing the discipline's robustness and credibility.

While bridging the gap between machine learning and disciplines such as physics, information theory, and computer engineering, this textbook maintains an accessible and comprehensive style, making complex topics digestible fora broad readership. Information-Driven Machine Learning explores the synergistic harmony among these disciplines to enhance our understanding of data science modeling. Instead of solely focusing on the "how," this text provides answers to the "why" questions that permeate the field, shedding light on the underlying principles of machine learning processes and their practical implications. By advocating for systematic methodologies grounded in fundamental principles, this book challenges industry practices that have often evolved from ideologic or profit-driven motivations. It addresses a range of topics, including deep learning, data drift, and MLOps, using fundamental principles such as entropy, capacity, and high dimensionality.

Ideal for both academia and industry professionals, this textbook serves as a valuable tool for those seeking to deepen their understanding of data science as an engineering discipline. Its thought-provoking content stimulates intellectual curiosity and caters to readers who desire more than just code or ready-made formulas. The text invites readers to explore beyond conventional viewpoints, offering an alternative perspective that promotes a big-picture view for integrating theory with practice. Suitable for upper undergraduate or graduate-level courses, this book can also benefit practicing engineers and scientists in various disciplines by enhancing their understanding of modeling and improving data measurement effectively.
Über den Autor
Gerald Friedland: Dr. Friedlands Beiträge auf dem Gebiet des maschinellen Lernens, die in der "AI2000 Most Influential Scholar"-Liste als meistzitierte KI-Werke des letzten Jahrzehnts aufgeführt sind, waren sowohl substanziell als auch nachhaltig, seit er 2001 auf diesem Gebiet zu arbeiten begann. Sein "Simple Interactive Object Extraction"-Algorithmus ist seit 2005 Teil von Open-Source-Bildbearbeitungs- und Erstellungsanwendungen (wie z.B. GIMP oder Blender) und sein cloudloses MOVI Speech Recognition Board wird seit 2015 von Herstellern verwendet. Derzeit ist er außerordentlicher Dozent an der University of California, Berkeley, Fakultätsmitglied des Berkeley Institute of Data Science und leitender Wissenschaftler im Sagemaker-Team bei Amazon AWS.
Nach seiner Promotion an der Freien Universität Berlin unter Prof. Raul Rojas im Jahr 2006, leitete Dr. Friedland als Direktor für Audio- und Multimedia-Forschung am International Computer Science Institute in Berkeley ein Forscherteam im Bereich Sprach- und Multimedia-Inhaltsanalyse. Anschließend war er von 2016 bis 2019 Principal Data Scientist am Lawrence Livermore National Lab. In diesem Jahr war er Mitbegründer von Brainome, Inc., wo er sein technisches Fachwissen nutzte, um ein automatisches maschinelles Lerntool zu entwickeln, das auf Techniken zur Informationsmessung basiert, welche auch im Mittelpunkt dieses Buches stehen. Seine Reise führte ihn dann im Jahr 2022 als Principal Scientist, AutoML, zu Amazon AWS.
Über seine Branchen- und akademischen Tätigkeiten hinaus ist Gerald ein erfahrener Autor. Seine Literaturbeiträge reichen von den Lehrbüchern Multimedia Computing (Cambridge University Press) und Multimodal Location Estimation of Videos and Images (Springer) bis zu einem von Apress veröffentlichten Programmierbuch für Kinder.
Inhaltsverzeichnis
Preface.- 1 Introduction.- 2 The Automated Scientific Process.- 3 The (Black Box) Machine Learning Process.- 4 Information Theory.- 5 Capacity.- 6 The Mechanics of Generalization.- 7 Meta-Math: Exploring the Limits of Modeling.- 8 Capacity of Neural Networks.- 8 Capacity of Neural Networks.- 10 Capacities of some other Machine Learning Methods.- 11 Data Collection and Preparation.- 12 Measuring Data Sufficiency.- 13 Machine Learning Operations.- 14 Explainability.- 15 Repeatability and Reproducibility.- 16 The Curse of Training and the Blessing of High Dimensionality.- 16 The Curse of Training and the Blessing of High Dimensionality.- Appendix A Recap: The Logarithm.- Appendix B More on Complexity.- Appendix C Concepts Cheat Sheet.- Appendix D A Review Form that Promotes Reproducibility.- List of Illustrations.- Bibliography.
Details
Erscheinungsjahr: 2024
Genre: Informatik, Mathematik, Medizin, Naturwissenschaften, Technik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xxii
267 S.
17 s/w Illustr.
33 farbige Illustr.
267 p. 50 illus.
33 illus. in color.
ISBN-13: 9783031394799
ISBN-10: 3031394798
Sprache: Englisch
Einband: Kartoniert / Broschiert
Autor: Friedland, Gerald
Hersteller: Springer
Springer International Publishing AG
Verantwortliche Person für die EU: Springer Verlag GmbH, Tiergartenstr. 17, D-69121 Heidelberg, juergen.hartmann@springer.com
Maße: 235 x 155 x 16 mm
Von/Mit: Gerald Friedland
Erscheinungsdatum: 02.12.2024
Gewicht: 0,446 kg
Artikel-ID: 130630695