Mostrar el registro sencillo del ítem
Online clustering of distributed streaming data using belief propagation techniques
dc.creator | Halkidi, M. | en |
dc.creator | Koutsopoulos, I. | en |
dc.date.accessioned | 2015-11-23T10:29:31Z | |
dc.date.available | 2015-11-23T10:29:31Z | |
dc.date.issued | 2011 | |
dc.identifier | 10.1109/MDM.2011.63 | |
dc.identifier.isbn | 9780769544366 | |
dc.identifier.issn | 15516245 | |
dc.identifier.uri | http://hdl.handle.net/11615/28323 | |
dc.description.abstract | Extraction of patterns out of streaming data that are generated from geographically dispersed devices is a major challenge in data mining. The sequential, distributed fashion in which data become available to the decision maker, together with the fact that the decision maker needs to rely only on recently received data due to storage and communication constraints, render the objective of keeping track of data evolution a nontrivial one. We consider a set of distributed nodes that communicate directly with a central location. We address the problem of clustering distributed streaming data through a two-level clustering approach. We adopt belief propagation techniques to perform stream clustering at both levels. At the node level, a batch of data arrives at each time slot, and the goal is to maintain a set of salient data (local exemplars) at each time slot, which best represents the data received up to that slot. At each epoch, the local exemplars from distributed nodes are sent to the central location, which in turn performs a second-level clustering on them to derive a data synopsis global for the whole system. The local exemplars that emerge from the second level clustering procedure are fed back to the nodes with appropriately modified weights which reflect their importance in global clustering. As demonstrated by our experiments, the two-level belief propagation-based clustering approach together with the feedback is ideal for handling data from different nodes, as it has the same performance in terms of clustering quality with the case where the clustering is performed on the raw data sent from nodes to the central location. © 2011 IEEE. | en |
dc.source.uri | http://www.scopus.com/inward/record.url?eid=2-s2.0-82055165077&partnerID=40&md5=96d1b50f96a8708a50f9e3747e5a96dc | |
dc.subject | Belief propagation | en |
dc.subject | Clustering approach | en |
dc.subject | Clustering procedure | en |
dc.subject | Clustering quality | en |
dc.subject | Communication constraints | en |
dc.subject | Data evolution | en |
dc.subject | Data synopsis | en |
dc.subject | Decision makers | en |
dc.subject | Distributed nodes | en |
dc.subject | Distributed streaming | en |
dc.subject | Global clustering | en |
dc.subject | Online-clustering | en |
dc.subject | Second level | en |
dc.subject | Streaming data | en |
dc.subject | Time slots | en |
dc.subject | Decision making | en |
dc.subject | Information management | en |
dc.subject | Data handling | en |
dc.title | Online clustering of distributed streaming data using belief propagation techniques | en |
dc.type | conferenceItem | en |
Ficheros en el ítem
Ficheros | Tamaño | Formato | Ver |
---|---|---|---|
No hay ficheros asociados a este ítem. |