Afficher la notice abrégée

dc.creatorMoutafis P., Mavrommatis G., Velentzas P.en
dc.date.accessioned2023-01-31T09:02:15Z
dc.date.available2023-01-31T09:02:15Z
dc.date.issued2020
dc.identifier10.1145/3437120.3437345
dc.identifier.isbn9781450388979
dc.identifier.urihttp://hdl.handle.net/11615/76819
dc.description.abstractGiven two datasets of points (called Query and Training), the Group (K) Nearest-Neighbor (GKNN) query retrieves (K) points of the Training with the smallest sum of distances to every point of the Query. This spatial query has been studied during the recent years and several performance improving techniques and pruning heuristics have been proposed. In a previous work, we presented the first MapReduce algorithm, consisting of alternating local and parallel phases, which can be used to effectively process the GKNN query when the Query fits in memory, while the Training one belongs to the Big Data category. In subsequent works, we presented several improvements on the first version of the algorithm. In this paper we present yet another improvement, which consists in the prepartitioning of the Training dataset. As shown in the experimentation section, this technique significantly reduces data transfer and total running time of the algorithm. Furthermore, the prepartitioning of the Training dataset is performed only once and can be reused with multiple Query datasets, leading to faster response times. © 2020 ACM.en
dc.language.isoenen
dc.sourceACM International Conference Proceeding Seriesen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85102390004&doi=10.1145%2f3437120.3437345&partnerID=40&md5=0c5389084e02c8f8bfab4329d3dd34bf
dc.subjectData transferen
dc.subjectData categoriesen
dc.subjectGroup nearest neighbor queriesen
dc.subjectImproving techniquesen
dc.subjectMultiple queriesen
dc.subjectNearest neighborsen
dc.subjectSpatial queriesen
dc.subjectSum of distancesen
dc.subjectTraining dataseten
dc.subjectNearest neighbor searchen
dc.subjectAssociation for Computing Machineryen
dc.titlePrepartitioning in MapReduce Processing of Group Nearest-Neighbor Queryen
dc.typeconferenceItemen


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée