Incremental learning for large scale data stream analytics in a complex environment
暫譯: 複雜環境中大規模資料流分析的增量學習
Za'in, Choiru
- 出版商: Routledge
- 出版日期: 2023-04-16
- 售價: $2,550
- 貴賓價: 9.5 折 $2,423
- 語言: 英文
- 頁數: 356
- 裝訂: Quality Paper - also called trade paper
- ISBN: 1805263242
- ISBN-13: 9781805263241
海外代購書籍(需單獨結帳)
相關主題
商品描述
Nowadays, in the era of the Internet of Things (IoT), data are generated from devices/sensors in the form of text, images, videos, etc. Data from sensors arrive continuously from multiple sources, different environments as data streams [1]. As a result, vast volumes of data can be generated in the cloud over time. Furthermore, data streams are also characterized by non-stationary environments that come from real-world applications [2]. Due to the high business demands, these enormous volumes of data streams need to be learned immediately as they arrive for decision-making purposes [3]. While large-scale data streams have a high potential to improve effective decision making, learning from this data is challenging. Based on [4], a general problem large-scale/big data of big data includes 5V (Variety, Velocity, Volume, Value, and Veracity) characteristics. In machine learning literature, of these five characteristics, there are at least two main issues in learning from data streams which trigger further investigation: huge volumes and velocity. Another issue of data streams is the nonstationary characteristics of data streams [5]. The vast volumes and velocity characteristics causes the generation of big data due to the high speed of arrival data streams. In contrast, the non-stationary characteristic is related to the changing of data distribution over time. These challenges provide excellent opportunities for many research directions.
商品描述(中文翻譯)
在當今物聯網(IoT)時代,數據以文本、圖像、視頻等形式從設備/傳感器中生成。來自多個來源和不同環境的傳感器數據持續不斷地以數據流的形式到達[1]。因此,隨著時間的推移,雲端中可以生成大量的數據。此外,數據流的特徵還包括來自現實世界應用的非穩定環境[2]。由於商業需求的高漲,這些龐大的數據流需要在到達時立即進行學習,以便用於決策[3]。雖然大規模數據流具有改善有效決策的高潛力,但從這些數據中學習卻是具有挑戰性的。根據[4],大數據的一般問題包括5V(多樣性、速度、體量、價值和真實性)特徵。在機器學習文獻中,這五個特徵中至少有兩個主要問題在於從數據流中學習,這引發了進一步的研究:龐大的數據量和速度。數據流的另一個問題是數據流的非穩定特徵[5]。龐大的數據量和速度特徵導致由於數據流到達速度快而產生大數據。相對而言,非穩定特徵與數據分佈隨時間變化有關。這些挑戰為許多研究方向提供了良好的機會。