Generative Ai-Driven Application Development with Java: Leveraging Large Language Models in Modern Java Applications
暫譯: 使用 Java 的生成式 AI 驅動應用程式開發:在現代 Java 應用程式中利用大型語言模型

Sahu, Satej Kumar

  • 出版商: Apress
  • 出版日期: 2025-12-22
  • 售價: $1,710
  • 貴賓價: 9.5$1,625
  • 語言: 英文
  • 頁數: 698
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 9798868816086
  • ISBN-13: 9798868816086
  • 相關分類: Large language modelJava 相關技術CI/CD
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

This is the first hands-on guide that takes you from a simple "Hello, LLM" to production-ready microservices, all within the JVM. You'll integrate hosted models such as OpenAI's GPT-4o, run alternatives with Ollama or Jlama, and embed them in Spring Boot or Quarkus apps for cloud or on-pre deployment.

You'll learn how prompt-engineering patterns, Retrieval-Augmented Generation (RAG), vector stores such as Pinecone and Milvus, and agentic workflows come together to solve real business problems. Robust test suites, CI/CD pipelines, and security guardrails ensure your AI features reach production safely, while detailed observability playbooks help you catch hallucinations before your users do. You'll also explore DJL, the future of machine learning in Java.

This book delivers runnable examples, clean architectural diagrams, and a GitHub repo you can clone on day one. Whether you're modernizing a legacy platform or launching a green-field service, you'll have a roadmap for adding state-of-the-art generative AI without abandoning the language--and ecosystem--you rely on.

What You Will Learn

  • Establish generative AI and LLM foundations
  • Integrate hosted or local models using Spring Boot, Quarkus, LangChain4j, Spring AI, OpenAI, Ollama, and Jlama
  • Craft effective prompts and implement RAG with Pinecone or Milvus for context-rich answers
  • Build secure, observable, scalable AI microservices for cloud or on-prem deployment
  • Test outputs, add guardrails, and monitor performance of LLMs and applications
  • Explore advanced patterns, such as agentic workflows, multimodal LLMs, and practical image-processing use cases

Who This Book Is For

Java developers, architects, DevOps engineers, and technical leads who need to add AI features to new or existing enterprise systems. Data scientists and educators will also appreciate the code-first, Java-centric approach.

商品描述(中文翻譯)

這是第一本實作指南,帶您從簡單的「Hello, LLM」到生產就緒的微服務,全部在JVM內完成。您將整合如OpenAI的GPT-4o等託管模型,運行Ollama或Jlama等替代方案,並將它們嵌入Spring Boot或Quarkus應用程式中,以便於雲端或本地部署。您將學習如何將提示工程模式、檢索增強生成(Retrieval-Augmented Generation, RAG)、如Pinecone和Milvus的向量存儲,以及代理工作流程結合起來,以解決實際的商業問題。穩健的測試套件、CI/CD管道和安全防護措施確保您的AI功能安全地達到生產環境,而詳細的可觀察性手冊幫助您在用戶發現之前捕捉到幻覺。您還將探索DJL,Java中機器學習的未來。

本書提供可執行的範例、清晰的架構圖,以及您可以在第一天就克隆的GitHub倉庫。無論您是在現代化遺留平台還是啟動全新服務,您都將擁有一條添加最先進生成AI的路線圖,而不必放棄您所依賴的語言和生態系統。

您將學到的內容:
- 建立生成AI和LLM的基礎
- 使用Spring Boot、Quarkus、LangChain4j、Spring AI、OpenAI、Ollama和Jlama整合託管或本地模型
- 編寫有效的提示並使用Pinecone或Milvus實現RAG,以獲得豐富上下文的答案
- 建立安全、可觀察、可擴展的AI微服務,以便於雲端或本地部署
- 測試輸出、添加防護措施並監控LLM和應用程式的性能
- 探索進階模式,如代理工作流程、多模態LLM和實用的影像處理案例

本書適合對象:
Java開發人員、架構師、DevOps工程師和技術負責人,他們需要為新或現有的企業系統添加AI功能。數據科學家和教育工作者也會欣賞這種以代碼為先、以Java為中心的方法。

作者簡介

Satej Kumar Sahu is a Principal Engineer at Zalando SE with 15 years of hands-on experience designing large-scale, data-intensive systems for global brands including Boeing, Adidas, and Honeywell. A specialist in software architecture, big-data pipelines, and applied machine learning, he has shepherded multiple projects from whiteboard sketches to production deployments serving millions of users.


Satej has been working with Large Language Models since their earliest open-source releases, piloting Retrieval-Augmented Generation (RAG) and agentic patterns long before they became industry buzzwords. He is the author of two previous programming books--Building Secure PHP Applications and PHP 8 Basics--and is a frequent speaker at developer conferences and meet-ups across the world.


When he isn't translating cutting-edge AI research into practical code, you'll find him mentoring engineering teams, contributing to open-source projects, or tinkering with the newest transformer models in his home lab.

作者簡介(中文翻譯)

Satej Kumar Sahu 是 Zalando SE 的首席工程師,擁有 15 年設計大型數據密集型系統的實務經驗,曾為包括波音、阿迪達斯和霍尼韌等全球品牌服務。他專注於軟體架構、大數據管道和應用機器學習,成功將多個專案從白板草圖推進到生產部署,服務數百萬用戶。

Satej 自最早的開源版本以來便開始研究大型語言模型,早在檢索增強生成 (Retrieval-Augmented Generation, RAG) 和代理模式成為業界流行詞之前,他就已經在這方面進行探索。他是兩本先前程式設計書籍的作者——《建立安全的 PHP 應用程式》和《PHP 8 基礎》,並且經常在全球的開發者會議和聚會上發表演講。

當他不在將尖端 AI 研究轉化為實用程式碼時,你會發現他在指導工程團隊、貢獻於開源專案,或在家中的實驗室中研究最新的變壓器模型。