Pentaho Data Integration Cookbook, 2/e(Paperback)
Alex Meadows, Adrián Sergio Pulvirenti, María Carina Roldán
- 出版商: Packt Publishing
- 出版日期: 2013-09-10
- 售價: $2,170
- 貴賓價: 9.5 折 $2,062
- 語言: 英文
- 頁數: 462
- 裝訂: Paperback
- ISBN: 1783280670
- ISBN-13: 9781783280674
海外代購書籍(需單獨結帳)
相關主題
商品描述
The premier open source ETL tool is at your command with this recipe-packed cookbook. Learn to use data sources in Kettle, avoid pitfalls, and dig out the advanced features of Pentaho Data Integration the easy way.
Overview
- Intergrate Kettle in integration with other components of the Pentaho Business Intelligence Suite, to build and publish Mondrian schemas,create reports, and populatedashboards
- This book contains an organized sequence of recipes packed with screenshots, tables, and tips so you can complete the tasks as efficiently as possible
- Manipulate your data by exploring, transforming, validating, integrating, and performing data analysis
In Detail
Pentaho Data Integration is the premier open source ETL tool, providing easy, fast, and effective ways to move and transform data. While PDI is relatively easy to pick up, it can take time to learn the best practices so you can design your transformations to process data faster and more efficiently. If you are looking for clear and practical recipes that will advance your skills in Kettle, then this is the book for you.
Pentaho Data Integration Cookbook Second Edition guides you through the features of explains the Kettle features in detail and provides easy to follow recipes on file management and databases that can throw a curve ball to even the most experienced developers.
Pentaho Data Integration Cookbook Second Edition provides updates to the material covered in the first edition as well as new recipes that show you how to use some of the key features of PDI that have been released since the publication of the first edition. You will learn how to work with various data sources – from relational and NoSQL databases, flat files, XML files, and more. The book will also cover best practices that you can take advantage of immediately within your own solutions, like building reusable code, data quality, and plugins that can add even more functionality.
Pentaho Data Integration Cookbook Second Edition will provide you with the recipes that cover the common pitfalls that even seasoned developers can find themselves facing. You will also learn how to use various data sources in Kettle as well as advanced features.
What you will learn from this book
- Configure Kettle to connect to relational and NoSQL databases and web applications like SalesForce, explore them, and perform CRUD operations
- Utilize plugins to get even more functionality into your Kettle jobs
- Embed Java code in your transformations to gain performance and flexibility
- Execute and reuse transformations and jobs in different ways
- Integrate Kettle with Pentaho Reporting, Pentaho Dashboards, Community Data Access, and the Pentaho BI Platform
- Interface Kettle with cloud-based applications
- Learn how to control and manipulate data flows
- Utilize Kettle to create datasets for analytics
Approach
Pentaho Data Integration Cookbook Second Edition is written in a cookbook format, presenting examples in the style of recipes.This allows you to go directly to your topic of interest, or follow topics throughout a chapter to gain a thorough in-depth knowledge.
Who this book is written for
Pentaho Data Integration Cookbook Second Edition is designed for developers who are familiar with the basics of Kettle but who wish to move up to the next level.It is also aimed at advanced users that want to learn how to use the new features of PDI as well as and best practices for working with Kettle.
商品描述(中文翻譯)
首屈一指的開源 ETL 工具隨時為您服務,這本食譜豐富的食譜書將帶您學習如何使用 Kettle 的數據來源,避免常見的陷阱,並輕鬆挖掘 Pentaho Data Integration 的進階功能。
概述
- 將 Kettle 與 Pentaho Business Intelligence Suite 的其他組件整合,以建立和發布 Mondrian 架構、創建報告和填充儀表板。
- 本書包含有組織的食譜序列,配有截圖、表格和提示,讓您能夠高效地完成任務。
- 通過探索、轉換、驗證、整合和執行數據分析來操作您的數據。
詳細內容
Pentaho Data Integration 是首屈一指的開源 ETL 工具,提供簡單、快速和有效的方式來移動和轉換數據。雖然 PDI 相對容易上手,但學習最佳實踐可能需要時間,以便您能設計出更快、更高效的數據處理轉換。如果您正在尋找清晰且實用的食譜來提升您在 Kettle 的技能,那麼這本書就是為您而寫。
Pentaho Data Integration Cookbook 第二版詳細介紹了 Kettle 的功能,並提供易於遵循的文件管理和數據庫食譜,即使是最有經驗的開發者也可能會感到困惑。
Pentaho Data Integration Cookbook 第二版對第一版中涵蓋的材料進行了更新,並新增了食譜,展示如何使用自第一版出版以來發布的一些 PDI 主要功能。您將學習如何處理各種數據來源——從關聯和 NoSQL 數據庫、平面文件、XML 文件等。本書還將涵蓋您可以立即在自己的解決方案中利用的最佳實踐,例如構建可重用的代碼、數據質量和可以增加更多功能的插件。
Pentaho Data Integration Cookbook 第二版將提供涵蓋即使是資深開發者也可能面臨的常見陷阱的食譜。您還將學習如何在 Kettle 中使用各種數據來源以及進階功能。
您將從本書中學到的內容
- 配置 Kettle 以連接到關聯和 NoSQL 數據庫以及像 SalesForce 這樣的網絡應用,探索它們並執行 CRUD 操作。
- 利用插件為您的 Kettle 任務增添更多功能。
- 在轉換中嵌入 Java 代碼以獲得性能和靈活性。
- 以不同方式執行和重用轉換和任務。
- 將 Kettle 與 Pentaho Reporting、Pentaho Dashboards、Community Data Access 和 Pentaho BI Platform 整合。
- 將 Kettle 與基於雲的應用程序接口。
- 學習如何控制和操作數據流。
- 利用 Kettle 創建分析用的數據集。
方法
Pentaho Data Integration Cookbook 第二版以食譜格式編寫,呈現示例的風格如同食譜。這使您能夠直接跳到感興趣的主題,或在整個章節中跟隨主題以獲得深入的知識。
本書的讀者對象
Pentaho Data Integration Cookbook 第二版旨在為熟悉 Kettle 基礎的開發者設計,但希望提升到更高的水平。它也針對希望學習如何使用 PDI 新功能以及 Kettle 的最佳實踐的進階用戶。