Zum Hauptinhalt springen Zur Suche springen Zur Hauptnavigation springen
Beschreibung

This is the first hands-on guide that takes you from a simple “Hello, LLM” to production-ready microservices, all within the JVM. You’ll integrate hosted models such as OpenAI’s GPT-4o, run alternatives with Ollama or Jlama, and embed them in Spring Boot or Quarkus apps for cloud or on-pre deployment.

You’ll learn how prompt-engineering patterns, Retrieval-Augmented Generation (RAG), vector stores such as Pinecone and Milvus, and agentic workflows come together to solve real business problems. Robust test suites, CI/CD pipelines, and security guardrails ensure your AI features reach production safely, while detailed observability playbooks help you catch hallucinations before your users do. You’ll also explore DJL, the future of machine learning in Java.

This book delivers runnable examples, clean architectural diagrams, and a GitHub repo you can clone on day one. Whether you’re modernizing a legacy platform or launching a green-field service, you’ll have a roadmap for adding state-of-the-art generative AI without abandoning the language—and ecosystem—you rely on.

What You Will Learn

  • Establish generative AI and LLM foundations
  • Integrate hosted or local models using Spring Boot, Quarkus, LangChain4j, Spring AI, OpenAI, Ollama, and Jlama
  • Craft effective prompts and implement RAG with Pinecone or Milvus for context-rich answers
  • Build secure, observable, scalable AI microservices for cloud or on-prem deployment
  • Test outputs, add guardrails, and monitor performance of LLMs and applications
  • Explore advanced patterns, such as agentic workflows, multimodal LLMs, and practical image-processing use cases

Who This Book Is For

Java developers, architects, DevOps engineers, and technical leads who need to add AI features to new or existing enterprise systems. Data scientists and educators will also appreciate the code-first, Java-centric approach.

This is the first hands-on guide that takes you from a simple “Hello, LLM” to production-ready microservices, all within the JVM. You’ll integrate hosted models such as OpenAI’s GPT-4o, run alternatives with Ollama or Jlama, and embed them in Spring Boot or Quarkus apps for cloud or on-pre deployment.

You’ll learn how prompt-engineering patterns, Retrieval-Augmented Generation (RAG), vector stores such as Pinecone and Milvus, and agentic workflows come together to solve real business problems. Robust test suites, CI/CD pipelines, and security guardrails ensure your AI features reach production safely, while detailed observability playbooks help you catch hallucinations before your users do. You’ll also explore DJL, the future of machine learning in Java.

This book delivers runnable examples, clean architectural diagrams, and a GitHub repo you can clone on day one. Whether you’re modernizing a legacy platform or launching a green-field service, you’ll have a roadmap for adding state-of-the-art generative AI without abandoning the language—and ecosystem—you rely on.

What You Will Learn

  • Establish generative AI and LLM foundations
  • Integrate hosted or local models using Spring Boot, Quarkus, LangChain4j, Spring AI, OpenAI, Ollama, and Jlama
  • Craft effective prompts and implement RAG with Pinecone or Milvus for context-rich answers
  • Build secure, observable, scalable AI microservices for cloud or on-prem deployment
  • Test outputs, add guardrails, and monitor performance of LLMs and applications
  • Explore advanced patterns, such as agentic workflows, multimodal LLMs, and practical image-processing use cases

Who This Book Is For

Java developers, architects, DevOps engineers, and technical leads who need to add AI features to new or existing enterprise systems. Data scientists and educators will also appreciate the code-first, Java-centric approach.

Über den Autor
Satej Kumar Sahu is a Principal Engineer at Zalando SE with 15 years of hands-on experience designing large-scale, data-intensive systems for global brands including Boeing, Adidas, and Honeywell. A specialist in software architecture, big-data pipelines, and applied machine learning, he has shepherded multiple projects from whiteboard sketches to production deployments serving millions of users.

Satej has been working with Large Language Models since their earliest open-source releases, piloting Retrieval-Augmented Generation (RAG) and agentic patterns long before they became industry buzzwords. He is the author of two previous programming books—Building Secure PHP Applications and PHP 8 Basics—and is a frequent speaker at developer conferences and meet-ups across the world.

When he isn’t translating cutting-edge AI research into practical code, you’ll find him mentoring engineering teams, contributing to open-source projects, or tinkering with the newest transformer models in his home lab.
Inhaltsverzeichnis

1: Megabrains 101: Generative AI & LLMs Unboxed.- 2: First Contact: “Hello, LLM” with Spring Boot.- 3: The Transformer Saga—From Attention to Fine-Tuning- 4: Bring Your Own Model: Self-Hosting with Ollama.- 5: Power Tools: LangChain4j Quick-Start.- 6: Integrating LLMs with Java Applications.- 7: From Chatty to Clever: Retrieval-Augmented Generation.- 8: Spring AI Ninja Moves.- 9: Prompt Alchemy: Patterns that Make Models Look Smarter.- 10: Swiss-Army LLMs: Tool Calls in Spring AI.- 11: Agents Assemble! Building Autonomous Workflows.- 12: Quarkus + LangChain4j: Lightning-Fast Gen AI.- 13: Jlama & Friends: Hosting Models the Java Way.- 14: Seeing Is Believing: Multimodal LLMs & Image Hacking.- 15: Does It Even Work? Testing & Evaluating LLM Apps.- 16: Cloud Power-Ups—Bedrock, Vertex & Azure OpenAI.- 17: Talking in Protocols: The MCP Revolution.- 18: Can You See Me Now? Observability for LLM Pipelines.- 19: Native-Speed Machine Learning in Java: DJL, ONNX & JNI.- 20: Architectures of Tomorrow: From Monoliths to Modular Minds.

Details
Erscheinungsjahr: 2026
Genre: Importe, Informatik
Rubrik: Naturwissenschaften & Technik
Medium: Taschenbuch
Inhalt: xxv
698 S.
11 s/w Illustr.
203 farbige Illustr.
698 p. 214 illus.
203 illus. in color.
ISBN-13: 9798868816086
Sprache: Englisch
Einband: Kartoniert / Broschiert
Autor: Sahu, Satej Kumar
Auflage: First Edition
Hersteller: Apress
Apress L.P.
Verantwortliche Person für die EU: APress in Springer Science + Business Media, Heidelberger Platz 3, D-14197 Berlin, juergen.hartmann@springer.com
Maße: 254 x 178 x 39 mm
Von/Mit: Satej Kumar Sahu
Erscheinungsdatum: 03.01.2026
Gewicht: 1,337 kg
Artikel-ID: 134424719

Ähnliche Produkte