Delta Lake: The Definitive Guide: Modern Data Lakehouse Architectures with Data Lakes (Paperback)
暫譯: Delta Lake:權威指南:現代數據湖倉架構與數據湖(平裝本)
Lee, Denny, Wentling, Tristen, Haines, Scott
- 出版商: O'Reilly
- 出版日期: 2024-12-10
- 定價: $2,700
- 售價: 9.5 折 $2,565
- 貴賓價: 9.0 折 $2,430
- 語言: 英文
- 頁數: 380
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1098151941
- ISBN-13: 9781098151942
-
相關分類:
Data-mining
立即出貨 (庫存=1)
買這商品的人也買了...
-
Arduino 官方正版 Genuino 101$1,700$1,700 -
Raspberry Pi 3 Model B+ (UK製)$1,720$1,685 -
晉昇軟體最高殿堂:Jenkins2 持續整合大師之路$600$474 -
$1,320Deep Learning with JavaScript: Neural Networks in Tensorflow.Js -
$499事件流實戰 -
MongoDB 技術手冊, 3/e (MongoDB: The Definitive Guide: Powerful and Scalable Data Storage, 3/e)$780$616 -
區塊鏈生存指南:帶你用 Python 寫出區塊鏈!(iT邦幫忙鐵人賽系列書)$520$405 -
$1,840Multithreaded JavaScript: Concurrency Beyond the Event Loop -
建構機器學習管道|運用 TensorFlow 實現模型生命週期自動化 (Building Machine Learning Pipelines: Automating Model Life Cycles with Tensorflow)$580$458 -
Cloud Finops: Collaborative, Real-Time Cloud Value Decision Making (Paperback)$2,498$2,367 -
使用 GitOps 實現 Kubernetes 的持續部署:模式、流程及工具$714$678 -
$652客戶留存數據分析與預測 -
資料科學 SQL 工作術 – 以 MySQL 為例與情境式 ChatGPT 輔助學習 (SQL for Data Scientists - A Beginner’s Guide for Building Datasets for Analysis)$630$497 -
Learning Github Actions: Automation and Integration of CI/CD with Github (Paperback)$2,137$2,025 -
資料治理技術手冊 (Data Governance: The Definitive Guide)$580$458 -
資料科學:困難部分 (Data Science: The Hard Parts: Techniques for Excelling at Data Science)$680$537 -
OpenTelemetry 入門指南:建立全面可觀測性架構(iThome鐵人賽系列書)【軟精裝】(封面有些許摺痕,不介意在下單)$750$495 -
數據湖倉$299$284 -
Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale (Paperback)$2,327$2,205 -
Apache Airflow Best Practices: A practical guide to orchestrating data workflow with Apache Airflow (Paperback)$1,700$1,615 -
CI/CD Design Patterns: Design and implement CI/CD using proven design patterns (Paperback)$1,650$1,567 -
$504GitHub Copilot 編程指南 -
$607CUDA 並行編程與性能優化 -
本地端 Ollama × LangChain × LangGraph × LangSmith 開發手冊:打造 RAG、Agent、SQL 應用$750$592 -
最強 AI 組合技!NotebookLM / Gemini / Nano Banana / Veo 3 【影音生成進化版】$499$424
商品描述
Ready to simplify the process of building data lakehouses and data pipelines at scale? In this practical guide, learn how Delta Lake is helping data engineers, data scientists, and data analysts overcome key data reliability challenges with modern data engineering and management techniques.
Authors Denny Lee, Tristen Wentling, Scott Haines, and Prashanth Babu (with contributions from Delta Lake maintainer R. Tyler Croy) share expert insights on all things Delta Lake--including how to run batch and streaming jobs concurrently and accelerate the usability of your data. You'll also uncover how ACID transactions bring reliability to data lakehouses at scale.
This book helps you:
- Understand key data reliability challenges and how Delta Lake solves them
- Explain the critical role of Delta transaction logs as a single source of truth
- Learn the Delta Lake ecosystem with technologies like Apache Flink, Kafka, and Trino
- Architect data lakehouses with the medallion architecture
- Optimize Delta Lake performance with features like deletion vectors and liquid clustering
商品描述(中文翻譯)
準備好簡化大規模建構資料湖屋和資料管道的過程了嗎?在這本實用指南中,了解 Delta Lake 如何幫助資料工程師、資料科學家和資料分析師克服現代資料工程和管理技術中的關鍵資料可靠性挑戰。
作者 Denny Lee、Tristen Wentling、Scott Haines 和 Prashanth Babu(並有 Delta Lake 維護者 R. Tyler Croy 的貢獻)分享了有關 Delta Lake 的專家見解,包括如何同時運行批次和串流作業,並加速資料的可用性。您還將發現 ACID 交易如何在大規模資料湖屋中帶來可靠性。
這本書幫助您:
- 理解關鍵的資料可靠性挑戰以及 Delta Lake 如何解決這些問題
- 解釋 Delta 交易日誌作為單一真相來源的關鍵角色
- 學習 Delta Lake 生態系統,了解 Apache Flink、Kafka 和 Trino 等技術
- 使用獎牌架構設計資料湖屋
- 利用刪除向量和液態聚類等功能優化 Delta Lake 的性能