Εμφάνιση απλής εγγραφής

dc.creatorGarcía-García F., Corral A., Iribarne L., Mavrommatis G., Vassilakopoulos M.en
dc.date.accessioned2023-01-31T07:39:42Z
dc.date.available2023-01-31T07:39:42Z
dc.date.issued2017
dc.identifier10.1007/978-3-319-66917-5_15
dc.identifier.isbn9783319669168
dc.identifier.issn03029743
dc.identifier.urihttp://hdl.handle.net/11615/71956
dc.description.abstractDue to the ubiquitous use of spatial data applications and the large amounts of spatial data that these applications generate, the processing of large-scale distance joins in distributed systems is becoming increasingly popular. Two of the most studied distance join queries are the K Closest Pair Query (KCPQ) and the ε Distance Join Query (ε DJQ). The KCPQ finds the K closest pairs of points from two datasets and the ε DJQ finds all the possible pairs of points from two datasets, that are within a distance threshold ε of each other. Distributed cluster-based computing systems can be classified in Hadoop-based and Spark-based systems. Based on this classification, in this paper, we compare two of the most current and leading distributed spatial data management systems, namely SpatialHadoop and LocationSpark, by evaluating the performance of existing and newly proposed parallel and distributed distance join query algorithms in different situations with big real-world datasets. As a general conclusion, while SpatialHadoop is more mature and robust system, LocationSpark is the winner with respect to the total execution time. © 2017, Springer International Publishing AG.en
dc.language.isoenen
dc.sourceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85030150182&doi=10.1007%2f978-3-319-66917-5_15&partnerID=40&md5=055168a0a54b57c8773da9b7f5b88c0c
dc.subjectClassification (of information)en
dc.subjectCluster computingen
dc.subjectData handlingen
dc.subjectDatabase systemsen
dc.subjectDistributed computer systemsen
dc.subjectInformation managementen
dc.subjectInformation systemsen
dc.subjectLocationen
dc.subjectManagement information systemsen
dc.subjectQuery languagesen
dc.subjectSearch enginesen
dc.subjectComputing systemen
dc.subjectDistributed clustersen
dc.subjectDistributed systemsen
dc.subjectLocationSparken
dc.subjectReal-world datasetsen
dc.subjectSpatial data managementen
dc.subjectSpatial data processingen
dc.subjectSpatialHadoopen
dc.subjectSpatial distributionen
dc.subjectSpringer Verlagen
dc.titleA comparison of distributed spatial data management systems for processing distance join queriesen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής