Architecting Generative AI Applications: Build, deploy, and scale production-ready GenAI systems with LLMOps best practices
暫譯: 架構生成式 AI 應用程式:使用 LLMOps 最佳實踐建構、部署及擴展生產就緒的 GenAI 系統
Kuligin, Leonid
- 出版商: Packt Publishing
- 出版日期: 2026-03-30
- 售價: $1,840
- 貴賓價: 9.5 折 $1,748
- 語言: 英文
- 頁數: 278
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1806678659
- ISBN-13: 9781806678655
-
相關分類:
Large language model
海外代購書籍(需單獨結帳)
相關主題
商品描述
Take generative AI applications from prototype to production by mastering LLM architectures, evaluation strategies, LLMOps workflows, and deployment pipelines, using proven approaches to build reliable, secure, and scalable systems
Free with your book: DRM-free PDF version + access to Packt's next-gen Reader*
Key Features:
- Learn how to take generative AI apps from prototype to production
- Apply evaluation, LLMOps, and SRE practices for reliable systems
- Design scalable architectures using modern AI engineering patterns
Book Description:
Build production-ready generative AI applications by moving beyond prototypes and applying proven engineering principles. This book shows you how to design, evaluate, deploy, and scale AI systems that remain reliable, secure, and maintainable in real-world environments.
Vibe-coding tools and coding assistants make it easy to create prototypes, but taking them into production is where most teams struggle. Written by a Staff AI Engineer at Google, this book guides you through scoping use cases, aligning them with business goals, and scaling generative AI adoption. You'll learn how to evaluate LLMs using offline metrics, human-in-the-loop approaches, and statistical testing, as well as how to design architectures such as RAG, vector databases, agents, and memory systems.
You'll also understand how to operationalize these systems with production-grade code, testing practices, and DevOps, MLOps, and LLMOps workflows. The book covers deployment, scaling, and key considerations for security, Responsible AI, observability, and reliability.
By the end of this book, you will be able to design, deploy, and maintain scalable generative AI applications, run A/B tests to measure impact, and apply durable engineering principles so your systems succeed beyond the prototype stage.
*Email sign-up and proof of purchase required
What You Will Learn:
- Design end-to-end generative AI product workflows
- Build and evaluate AI systems with robust metrics
- Implement production-ready code and testing practices
- Apply LLMOps and automation for AI deployments
- Architect scalable systems using modern AI patterns
- Improve reliability with observability and SRE practices
- Run A/B tests to measure product impact effectively
Who this book is for:
Technical leaders, AI engineers, data scientists, software engineers, and architects building generative AI applications. Engineering managers, product leaders, and decision-makers seeking to deploy, scale, and maintain production-grade AI systems will also benefit.
Table of Contents
- Building a Prototype
- Evaluation
- Key Architectures
- From Prototype to Production
- Moving from DevOps and MLOps to LLMOps
- Deploying Your Application
- Ethics and Security
- Observability and Reliability
- Maintaining Your Application
- A/B Testing and Online Experiments