dc.creator | Nikitas N., Konstantinou I., Kalogeraki V., Koziris N. | en |
dc.date.accessioned | 2023-01-31T09:40:13Z | |
dc.date.available | 2023-01-31T09:40:13Z | |
dc.date.issued | 2021 | |
dc.identifier | 10.1109/BigData52589.2021.9671899 | |
dc.identifier.isbn | 9781665439022 | |
dc.identifier.uri | http://hdl.handle.net/11615/77187 | |
dc.description.abstract | While there has been a lot of effort in recent years in optimising Big Data systems like Apache Spark and Hadoop, the all-to-all transfer of data between a MapReduce computation step, i.e., the shuffle data mechanism between cluster nodes remains always a serious bottleneck. In this work, we present Cherry, an open-source distributed task-aware Caching sHuffle sErvice for seRveRless analYtics. Our thorough experiments on a cloud testbed using realistic and synthetic workloads showcase that Cherry can achieve an almost 23% to 39% reduction in completion of the reduce stage with small shuffle block sizes, a 10% reduction in execution time on real workloads, while it can efficiently handle Spark execution failures with a constant task time re-computation overhead compared to existing approaches. © 2021 IEEE. | en |
dc.language.iso | en | en |
dc.source | Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021 | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85125342604&doi=10.1109%2fBigData52589.2021.9671899&partnerID=40&md5=48ef089b92c1d923b8c4de80317ab11d | |
dc.subject | Advanced Analytics | en |
dc.subject | Big data | en |
dc.subject | Cluster computing | en |
dc.subject | Data transfer | en |
dc.subject | % reductions | en |
dc.subject | All to alls | en |
dc.subject | Big data analytic framework | en |
dc.subject | Cloud-computing | en |
dc.subject | Data systems | en |
dc.subject | Distributed systems | en |
dc.subject | Distributed tasks | en |
dc.subject | Map-reduce | en |
dc.subject | Serverless architecture | en |
dc.subject | Task-aware | en |
dc.subject | Data Analytics | en |
dc.subject | Institute of Electrical and Electronics Engineers Inc. | en |
dc.title | Cherry: A Distributed Task-Aware Shuffle Service for Serverless Analytics | en |
dc.type | conferenceItem | en |