Monitoring Distributed Data Streams

106
Следующее
Популярные
20 дней – 4023:15
Ludic Design for Accessibility
Опубликовано 6 сентября 2016, 16:36
Monitoring data streams in a distributed system is the focus of much research in recent years. Most of the proposed schemes, however, deal with monitoring simple aggregated values, such as the frequency of appearance of items in the streams. More involved challenges, such as the important task of feature selection (e.g., by monitoring the information gain of various features), still require very high communication overhead using naive, centralized algorithms. We present a novel geometric approach by which an arbitrary global monitoring task can be split into a set of  constraints applied locally on each of the streams. The constraints are used to locally filter out data increments that do not affect the monitoring outcome, thus avoiding unnecessary communication. As a result, our approach enables monitoring of arbitrary threshold functions over distributed data streams in an efficient manner. We present experimental results on real-world data which demonstrate that our algorithms are highly scalable, and reducing the communication load by orders of magnitude in comparison to centralized algorithms. Joint works with Tsachi Sharfman and Daniel Keren
автотехномузыкадетское