Building and Training Generative AI Models: A Practical Guide to Generative AI Development and Scaling
暫譯: 建立與訓練生成式 AI 模型:生成式 AI 開發與擴展的實用指南
Cronin, Irena
- 出版商: Apress
- 出版日期: 2026-04-02
- 售價: $2,440
- 貴賓價: 9.5 折 $2,318
- 語言: 英文
- 頁數: 625
- 裝訂: Quality Paper - also called trade paper
- ISBN: 9798868823312
- ISBN-13: 9798868823312
-
相關分類:
GAN 生成對抗網絡
海外代購書籍(需單獨結帳)
商品描述
This book is a hands-on, technical guide to building and deploying generative AI models using advanced deep learning architectures like transformers, GANs, VAEs, and diffusion models. Designed for AI engineers, data scientists, and ML practitioners, it offers a practical roadmap from data ingestion to real-world deployment and evaluation.
The book starts by guiding readers on selecting the right model architecture for their application, be it text generation, image synthesis, or multimodal tasks. It then walks through essential components of model training, including dataset handling, self-supervised learning, and core optimisation techniques such as backpropagation, gradient descent, and learning rate scheduling. It also delves into large-scale training infrastructure, covering GPU/TPU usage, distributed computing frameworks, and system-level strategies for scaling performance. Practical guidance is provided on fine-tuning models with domain-specific data and applying reinforcement learning from human feedback (RLHF), model quantisation, and pruning to improve efficiency. Key challenges in generative AI--such as overfitting, bias, hallucination, and data efficiency--are addressed through proven techniques and emerging best practices. Readers will also gain insight into model interpretability and generalisation, ensuring robust and trustworthy outputs. The book demonstrates how to build scalable, production-ready generative systems across domains like media, healthcare, scientific simulation, and design through real-world examples and applied case studies.
By the end, readers will gain an understanding of how to architect, optimise, and apply generative models across diverse domains such as media creation, healthcare, design, scientific simulation, and beyond.
What you will learn;
- Learn how to choose and implement generative models--VAEs, GANs, transformers, and diffusion models--for specific use cases.
- Master training optimization techniques such as backpropagation, gradient descent, adaptive learning rates, and regularization.
- Apply best practices for large-scale training using GPUs, TPUs, and distributed computing frameworks for performance scaling.
- Boost model efficiency through quantization, pruning, fine-tuning, and RLHF to enhance output quality and reduce overhead.
Who this book is for:
AI Engineers and Machine Learning Practitioners looking to build and deploy generative models in real-world applications. Data Scientists working on deep learning projects involving text, vision, audio, or multimodal generation.
商品描述(中文翻譯)
這本書是一本實用的技術指南,專注於使用先進的深度學習架構(如 transformers、GANs、VAEs 和擴散模型)來構建和部署生成式 AI 模型。該書旨在為 AI 工程師、數據科學家和機器學習從業者提供一個從數據攝取到實際部署和評估的實用路線圖。
本書首先指導讀者如何為其應用選擇合適的模型架構,無論是文本生成、圖像合成還是多模態任務。接著,書中詳細介紹了模型訓練的基本組件,包括數據集處理、自我監督學習,以及核心優化技術,如反向傳播、梯度下降和學習率調度。書中還深入探討了大規模訓練基礎設施,涵蓋 GPU/TPU 的使用、分佈式計算框架以及系統級的性能擴展策略。提供了針對特定領域數據的模型微調和應用人類反饋強化學習(RLHF)、模型量化和剪枝以提高效率的實用指導。書中針對生成式 AI 中的主要挑戰,如過擬合、偏見、幻覺和數據效率,通過經驗證的技術和新興的最佳實踐進行了探討。讀者還將深入了解模型的可解釋性和泛化能力,確保輸出結果的穩健性和可信度。這本書展示了如何在媒體、醫療保健、科學模擬和設計等領域構建可擴展的、準備投入生產的生成系統,並通過實際案例和應用案例研究進行說明。
到最後,讀者將了解如何在媒體創作、醫療保健、設計、科學模擬等多個領域架構、優化和應用生成模型。
你將學到的內容:
- 學習如何為特定用例選擇和實施生成模型——VAEs、GANs、transformers 和擴散模型。
- 精通訓練優化技術,如反向傳播、梯度下降、自適應學習率和正則化。
- 應用最佳實踐進行大規模訓練,使用 GPU、TPU 和分佈式計算框架來擴展性能。
- 通過量化、剪枝、微調和 RLHF 提高模型效率,以增強輸出質量並減少開銷。
本書適合的讀者:
希望在實際應用中構建和部署生成模型的 AI 工程師和機器學習從業者。從事涉及文本、視覺、音頻或多模態生成的深度學習項目的數據科學家。
作者簡介
Irena Cronin is SVP of Product for DADOS Technology, which is making an app for the Apple Vision Pro that does data analytics and visualization. She is also the CEO of Infinite Retina, which provides research to help companies develop and implement AI, AR and other new technologies for their businesses. Prior to this, she worked for several years as an equity research analyst and gained extensive experience in evaluating both public and private companies.
Irena has a joint MBA/MA from the University of Southern California and an MS with Distinction in Management and Systems from New York University. She graduated with a BA from the University of Pennsylvania with a major in Economics (summa cum laude). Irena Cronin is the author of the book "Understanding Generative AI Business Applications' published by Apress.
作者簡介(中文翻譯)
Irena Cronin 是 DADOS Technology 的產品高級副總裁,該公司正在為 Apple Vision Pro 開發一款進行數據分析和可視化的應用程式。她同時也是 Infinite Retina 的執行長,該公司提供研究以幫助企業開發和實施人工智慧(AI)、擴增實境(AR)及其他新技術。此之前,她曾擔任幾年的股票研究分析師,並在評估公共和私人公司方面積累了豐富的經驗。
Irena 擁有南加州大學的聯合 MBA/MA 學位,以及紐約大學的管理與系統碩士學位(優異成績)。她以優等榮譽(summa cum laude)從賓夕法尼亞大學畢業,主修經濟學。Irena Cronin 是 Apress 出版的《Understanding Generative AI Business Applications》一書的作者。