Domain-Specific Small Language Models: Efficient AI for Local Deployment
暫譯: 特定領域小型語言模型:本地部署的高效 AI

Iozzia, Guglielmo

  • 出版商: Manning
  • 出版日期: 2026-05-26
  • 售價: $2,360
  • 貴賓價: 9.8$2,312
  • 語言: 英文
  • 頁數: 376
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1633436705
  • ISBN-13: 9781633436701
  • 相關分類: Large language model
  • 海外代購書籍(需單獨結帳)

商品描述

Bigger isn't always better. Train and tune highly focused language models optimized for domain specific tasks.

When you need a language model to respond accurately and quickly about a specific field of knowledge, the sprawling capacity of a LLM may hurt more than it helps. Domain-Specific Small Language Models teaches you to build generative AI models optimized for specific fields.

In Domain-Specific Small Language Models you'll discover:

- Model sizing best practices
- Open source libraries, frameworks, utilities and runtimes
- Fine-tuning techniques for custom datasets
- Hugging Face's libraries for SLMs
- Running SLMs on commodity hardware
- Model optimization or quantization

Perfect for cost- or hardware-constrained environments, Small Language Models (SLMs) train on domain specific data for high-quality results in specific tasks. In Domain-Specific Small Language Models you'll develop SLMs that can generate everything from Python code to protein structures and antibody sequences--all on commodity hardware.

About the book

Domain-Specific Small Language Models teaches you how to create language models that deliver the power of LLMs for specific areas of knowledge. It provides a practical, application-focused counterpart to foundational texts like Sebastian Raschka's Build a Large Language Model (From Scratch), showing you how to adapt large-scale concepts for efficient, specialized use. You'll learn to minimize the computational horsepower your models require, while keeping high-quality performance times and output. You'll appreciate the clear explanations of complex technical concepts alongside working code samples you can run and replicate on your laptop. Plus, you'll learn to develop and deliver RAG systems and AI agents that rely solely on SLMs, and without the costs of foundation model access.

About the reader

For machine learning engineers familiar with Python.

About the author

Guglielmo Iozzia is a Director, ML/AI and Applied Mathematics at MSD. He studied Electronic and Biomedical Engineering at the University of Bologna, has an extensive background in Software and ML/AI Engineering applied to real-life use cases across different industries, such as Biotech Manufacturing, Healthcare, Cloud Operations, and Cyber Security.

Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.

商品描述(中文翻譯)

規模不一定代表更好。訓練和調整針對特定領域任務優化的高度專注語言模型。

當您需要一個語言模型能夠準確且快速地回應特定知識領域時,LLM 的龐大能力可能會適得其反。特定領域的小型語言模型教您如何構建針對特定領域優化的生成式 AI 模型。

特定領域的小型語言模型中,您將發現:

- 模型大小的最佳實踐
- 開源庫、框架、工具和運行時
- 自定義數據集的微調技術
- Hugging Face 的 SLM 庫
- 在普通硬體上運行 SLM
- 模型優化或量化

小型語言模型(SLMs)非常適合成本或硬體受限的環境,針對特定領域數據進行訓練,以在特定任務中獲得高質量的結果。在特定領域的小型語言模型中,您將開發能夠生成從 Python 代碼到蛋白質結構和抗體序列的 SLM,所有這些都可以在普通硬體上完成。

關於本書

特定領域的小型語言模型教您如何創建能夠為特定知識領域提供 LLM 功能的語言模型。它提供了一個實用的、以應用為重點的對應於基礎文本,如 Sebastian Raschka 的 從零開始構建大型語言模型,展示如何將大規模概念調整為高效、專業的使用。您將學會最小化模型所需的計算能力,同時保持高質量的性能時間和輸出。您會欣賞對複雜技術概念的清晰解釋,以及可以在您的筆記本電腦上運行和複製的工作代碼示例。此外,您將學會開發和交付僅依賴 SLM 的 RAG 系統和 AI 代理,而無需基礎模型訪問的成本。

關於讀者

適合熟悉 Python 的機器學習工程師。

關於作者

Guglielmo Iozzia 是 MSD 的 ML/AI 和應用數學總監。他在博洛尼亞大學學習電子和生物醫學工程,擁有廣泛的軟體和 ML/AI 工程背景,應用於生物技術製造、醫療保健、雲端運營和網路安全等不同產業的實際案例。

購買印刷書籍時,您將獲得 Manning 提供的免費電子書(PDF 或 ePub),以及訪問在線 liveBook 格式(及其 AI 助手,能夠用任何語言回答您的問題)。

作者簡介

Guglielmo Iozzia is a Director, ML/AI and Applied Mathematics at MSD. He studied Electronic and Biomedical Engineering at the University of Bologna, has an extensive background in Software and ML/AI Engineering applied to real-life use cases across different industries, such as Biotech Manufacturing, Healthcare, Cloud Operations, and Cyber Security.

作者簡介(中文翻譯)

Guglielmo Iozzia 是 MSD 的機器學習/人工智慧及應用數學總監。他在博洛尼亞大學學習電子與生物醫學工程,擁有廣泛的軟體及機器學習/人工智慧工程背景,並應用於生物科技製造、醫療保健、雲端運營和網路安全等不同產業的實際案例中。