Architecting a Modern Data Warehouse for Large Enterprises: Build Multi-Cloud Modern Distributed Data Warehouses with Azure and Aws
暫譯: 為大型企業架構現代數據倉庫:使用 Azure 和 AWS 建立多雲現代分佈式數據倉庫
Kumar, Anjani, Mishra, Abhishek, Kumar, Sanjeev
- 出版商: Apress
- 出版日期: 2023-12-28
- 售價: $2,050
- 貴賓價: 9.5 折 $1,948
- 語言: 英文
- 頁數: 266
- 裝訂: Quality Paper - also called trade paper
- ISBN: 9798868800283
- ISBN-13: 9798868800283
-
相關分類:
Amazon Web Services、Microsoft Azure
海外代購書籍(需單獨結帳)
相關主題
商品描述
Design and architect new generation cloud-based data warehouses using Azure and AWS. This book provides an in-depth understanding of how to build modern cloud-native data warehouses, as well as their history and evolution.
The book starts by covering foundational data warehouse concepts, and introduces modern features such as distributed processing, big data storage, data streaming, and processing data on the cloud. You will gain an understanding of the synergy, relevance, and usage data warehousing standard practices in the modern world of distributed data processing. The authors walk you through the essential concepts of Data Mesh, Data Lake, Lakehouse, and Delta Lake. And they demonstrate the services and offerings available on Azure and AWS that deal with data orchestration, data democratization, data governance, data security, and business intelligence.
After completing this book, you will be ready to design and architect enterprise-grade, cloud-based modern data warehouses using industry best practices and guidelines.
What You Will Learn
- Understand the core concepts underlying modern data warehouses
- Design and build cloud-native data warehouses
- Gain a practical approach to architecting and building data warehouses on Azure and AWS
- Implement modern data warehousing components such as Data Mesh, Data Lake, Delta Lake, and Lakehouse
- Process data through pandas and evaluate your model's performance using metrics such as F1-score, precision, and recall
- Apply deep learning to supervised, semi-supervised, and unsupervised anomaly detection tasks for tabular datasets and time series applications
Who This Book Is For
Experienced developers, cloud architects, and technology enthusiasts looking to build cloud-based modern data warehouses using Azure and AWS
商品描述(中文翻譯)
設計和架構新一代基於雲端的資料倉儲,使用 Azure 和 AWS。本書深入探討如何建立現代雲原生資料倉儲,以及它們的歷史和演變。
本書首先涵蓋資料倉儲的基本概念,並介紹現代特性,如分散式處理、大數據儲存、資料串流以及在雲端處理資料。您將了解在現代分散式資料處理世界中,資料倉儲標準實踐的協同作用、相關性和使用情況。作者將引導您了解 Data Mesh、Data Lake、Lakehouse 和 Delta Lake 的基本概念。他們還展示了 Azure 和 AWS 上可用的服務和產品,這些服務涉及資料編排、資料民主化、資料治理、資料安全和商業智慧。
完成本書後,您將能夠使用行業最佳實踐和指導方針,設計和架構企業級的基於雲端的現代資料倉儲。
您將學到的內容:
- 理解現代資料倉儲的核心概念
- 設計和建立雲原生資料倉儲
- 獲得在 Azure 和 AWS 上架構和建立資料倉儲的實用方法
- 實現現代資料倉儲組件,如 Data Mesh、Data Lake、Delta Lake 和 Lakehouse
- 通過 pandas 處理資料,並使用 F1-score、精確度和召回率等指標評估模型的性能
- 對表格數據集和時間序列應用進行監督式、半監督式和非監督式的異常檢測任務應用深度學習
本書適合對象:
有經驗的開發人員、雲端架構師和技術愛好者,尋求使用 Azure 和 AWS 建立基於雲端的現代資料倉儲。
作者簡介
Anjani Kumar is the Managing Director and Founder of MultiCloud4u, a rapidly growing startup that helps clients and partners seamlessly implement data-driven solutions for their digital businesses. With a background in computer science, Anjani began his career researching and developing multi-lingual systems that were powered by distributed processing and data synchronization across remote regions of India. He later collaborated with companies such as Mahindra Satyam, Microsoft, RBS, and Sapient to create data warehouses and other data-based systems that could handle high-volume data processing and transformation.
Abhishek Mishra is a Cloud Architect at a leading organization and has more than a decade and a half of experience building and architecting software solutions for large and complex enterprises across the globe. He has deep expertise in enabling digital transformations for his customers using the cloud and artificial intelligence.
Sanjeev Kumar heads up a global data and analytics practice at the leading and oldest multinational shoe company with headquarters in Switzerland. He has 19+ years of experience working for organizations modeling modern data solutions in multiple industries. He has consulted with some of the top multinational firms and enabled digital transformation for large enterprises using modern data warehouses in the cloud. He is an expert in multiple fields of modern data management and execution including data strategy, automation, data governance, architecture, metadata, modeling, business intelligence, data management, and analytics.
作者簡介(中文翻譯)
Anjani Kumar 是 MultiCloud4u 的董事總經理及創辦人,這是一家快速成長的初創公司,幫助客戶和合作夥伴無縫地實施數據驅動的解決方案,以支持他們的數位業務。Anjani 擁有計算機科學背景,開始他的職業生涯時專注於研究和開發多語言系統,這些系統利用分散式處理和數據同步技術,覆蓋印度偏遠地區。後來,他與 Mahindra Satyam、Microsoft、RBS 和 Sapient 等公司合作,創建數據倉庫及其他能夠處理高容量數據處理和轉換的數據系統。
Abhishek Mishra 是一家領先組織的雲架構師,擁有超過十五年的經驗,為全球大型和複雜企業構建和設計軟體解決方案。他在利用雲端和人工智慧促進客戶的數位轉型方面擁有深厚的專業知識。
Sanjeev Kumar 在一家總部位於瑞士的領先且歷史最悠久的跨國鞋業公司負責全球數據和分析實務。他擁有超過19年的經驗,為多個行業的組織建模現代數據解決方案。他曾與一些頂尖的跨國公司合作,並利用雲端中的現代數據倉庫促進大型企業的數位轉型。他在現代數據管理和執行的多個領域中都是專家,包括數據策略、自動化、數據治理、架構、元數據、建模、商業智慧、數據管理和分析。