Mostra i principali dati dell'item

dc.creatorSavva F., Anagnostopoulos C., Triantafillou P., Kolomvatsos K.en
dc.date.accessioned2023-01-31T09:54:22Z
dc.date.available2023-01-31T09:54:22Z
dc.date.issued2020
dc.identifier10.1145/3410448
dc.identifier.issn15564681
dc.identifier.urihttp://hdl.handle.net/11615/78829
dc.description.abstractAnalysts wishing to explore multivariate data spaces, typically issue queries involving selection operators, i.e., range or equality predicates, which define data subspaces of potential interest. Then, they use aggregation functions, the results of which determine a subspace's interestingness for further exploration and deeper analysis. However, Aggregate Query (AQ) results are scalars and convey limited information and explainability about the queried subspaces for enhanced exploratory analysis. Analysts have no way of identifying how these results are derived or how they change w.r.t query (input) parameter values. We address this shortcoming by aiding analysts to explore and understand data subspaces by contributing a novel explanation mechanism based on machine learning. We explain AQ results using functions obtained by a three-fold joint optimization problem which assume the form of explainable piecewise-linear regression functions. A key feature of the proposed solution is that the explanation functions are estimated using past executed queries. These queries provide a coarse grained overview of the underlying aggregate function (generating the AQ results) to be learned. Explanations for future, previously unseen AQs can be computed without accessing the underlying data and can be used to further explore the queried data subspaces, without issuing more queries to the backend analytics engine. We evaluate the explanation accuracy and efficiency through theoretically grounded metrics over real-world and synthetic datasets and query workloads. © 2020 ACM.en
dc.language.isoenen
dc.sourceACM Transactions on Knowledge Discovery from Dataen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85092744633&doi=10.1145%2f3410448&partnerID=40&md5=e07b8925576d5f10fab8ae31912f7baf
dc.subjectPiecewise linear techniquesen
dc.subjectAggregate functionen
dc.subjectAggregation functionsen
dc.subjectExploratory analysisen
dc.subjectJoint optimizationen
dc.subjectLimited informationen
dc.subjectPiecewise linear regressionen
dc.subjectRegression functionen
dc.subjectSelection operatorsen
dc.subjectAggregatesen
dc.subjectAssociation for Computing Machineryen
dc.titleLarge-scale Data Exploration Using Explanatory Regression Functionsen
dc.typejournalArticleen


Files in questo item

FilesDimensioneFormatoMostra

Nessun files in questo item.

Questo item appare nelle seguenti collezioni

Mostra i principali dati dell'item