Now showing items 1-2 of 2

    • Exploratory analysis of a terabyte scale web corpus 

      Kolias, V.; Anagnostopoulos, I.; Kayafas, E. (2014)
      In this paper we present a preliminary analysis over the largest publicly accessible web dataset: The Common Crawl Corpus. We measure nine web characteristics from two levels of granularity using MapReduce and we comment ...
    • Large-scale Data Exploration Using Explanatory Regression Functions 

      Savva F., Anagnostopoulos C., Triantafillou P., Kolomvatsos K. (2020)
      Analysts wishing to explore multivariate data spaces, typically issue queries involving selection operators, i.e., range or equality predicates, which define data subspaces of potential interest. Then, they use aggregation ...