Zur Kurzanzeige

dc.creatorMeimaris M., Papastefanatos G., Vassiliadis P., Anagnostopoulos I.en
dc.date.accessioned2023-01-31T08:58:35Z
dc.date.available2023-01-31T08:58:35Z
dc.date.issued2018
dc.identifier10.1016/j.is.2018.02.010
dc.identifier.issn03064379
dc.identifier.urihttp://hdl.handle.net/11615/76489
dc.description.abstractThe increasing availability of diverse multidimensional data on the web has led to the creation and adoption of common vocabularies and practices that facilitate sharing, aggregating and reusing data from remote origins. One prominent example in the Web of Data is the RDF Data Cube vocabulary, which has recently attracted great attention from the industrial, government and academic sectors as the de facto representational model for publishing open multidimensional data. As a result, different datasets share terms from common code lists and hierarchies, this way creating an implicit relatedness between independent sources. Identifying and analyzing relationships between disparate data sources is a major prerequisite for enabling traditional business analytics at the web scale. However, discovery of instance-level relationships between datasets becomes a computationally costly procedure, as typically all pairs of records must be compared. In this paper, we define three types of relationships between multidimensional observations, namely full containment, partial containment and complementarity, and we propose four methods for efficient and scalable computation of these relationships. We conduct an extensive experimental evaluation over both real and synthetic datasets, comparing with traditional query-based and inference-based alternatives, and we show how our methods provide efficient and scalable solutions. © 2018 Elsevier Ltden
dc.language.isoenen
dc.sourceInformation Systemsen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85042927895&doi=10.1016%2fj.is.2018.02.010&partnerID=40&md5=b330371145b4dc9518ebfa90f9314e28
dc.subjectHardwareen
dc.subjectInformation systemsen
dc.subjectLatexesen
dc.subjectBusiness analyticsen
dc.subjectelsarticle.clsen
dc.subjectElsevieren
dc.subjectExperimental evaluationen
dc.subjectIndependent sourcesen
dc.subjectMultidimensional dataen
dc.subjectMultidimensional observationen
dc.subjectTemplateen
dc.subjectComputational methodsen
dc.subjectElsevier Ltden
dc.titleComputational methods and optimizations for containment and complementarity in web data cubesen
dc.typejournalArticleen


Dateien zu dieser Ressource

DateienGrößeFormatAnzeige

Zu diesem Dokument gibt es keine Dateien.

Das Dokument erscheint in:

Zur Kurzanzeige