Beginning Spark
暫譯: 開始使用 Spark

Madhukara Phatak

  • 出版商: Apress
  • 出版日期: 2016-05-08
  • 售價: $1,700
  • 貴賓價: 9.5$1,615
  • 語言: 英文
  • 頁數: 300
  • 裝訂: Paperback
  • ISBN: 1484213092
  • ISBN-13: 9781484213094
  • 相關分類: Spark
  • 海外代購書籍(需單獨結帳)

商品描述

Take a deep dive into Apache Spark and the big data ecosystem. You will acquire an understanding of the next generation of distribution systems, Apache Spark architecture and abstraction, and the Spark ecosystem including Spark SQL, GraphX and MLlib. Beginning Spark provides a practical guide for using Apache Spark in real-world data processing. The author discusses and illustrates how different concepts of Spark are brought together in order to solve complex issues with a data flow system.

With the rise in popularity of distributed systems like Hadoop, more and more people are working in big data processing. A growing number of companies want to build dataflow systems, which can churn huge amounts of data to gain insights for their business. Since Hadoop was a first generation, open source distributed system, there is a need for a next generation distributed system to take data processing to next level. Apache Spark is the next step in that direction. Spark brings a great flexibility and compositional system to the big data world by revolutionizing the field itself. 

商品描述(中文翻譯)

深入探討 Apache Spark 及其大數據生態系統。您將了解下一代分散式系統、Apache Spark 架構與抽象,以及 Spark 生態系統,包括 Spark SQL、GraphX 和 MLlib。《Beginning Spark》提供了一個實用指南,幫助您在現實世界的數據處理中使用 Apache Spark。作者討論並說明了如何將 Spark 的不同概念結合在一起,以解決數據流系統中的複雜問題。

隨著像 Hadoop 這樣的分散式系統日益受到歡迎,越來越多的人投身於大數據處理。越來越多的公司希望建立數據流系統,這些系統能夠處理大量數據,以獲取商業洞察。由於 Hadoop 是第一代開源分散式系統,因此需要一個下一代的分散式系統來將數據處理提升到新的水平。Apache Spark 是朝這個方向邁出的下一步。Spark 通過徹底改變這個領域,為大數據世界帶來了極大的靈活性和組合性系統。