Mostra i principali dati dell'item
Hadoop MapReduce Performance on SSDs for Analyzing Social Networks
dc.creator | Bakratsas M., Basaras P., Katsaros D., Tassiulas L. | en |
dc.date.accessioned | 2023-01-31T07:35:22Z | |
dc.date.available | 2023-01-31T07:35:22Z | |
dc.date.issued | 2018 | |
dc.identifier | 10.1016/j.bdr.2017.06.001 | |
dc.identifier.issn | 22145796 | |
dc.identifier.uri | http://hdl.handle.net/11615/71066 | |
dc.description.abstract | The advent of Solid State Drives (SSDs) stimulated a lot of research to investigate and exploit to the extent possible the potentials of the new drive. The focus of this work is on the investigation of the relative performance and benefits of SSDs versus hard disk drives (HDDs) when they are used as underlying storage for Hadoop's MapReduce. In particular, we depart from all earlier relevant works in that we do not use their workloads, but examine MapReduce tasks and data suitable for performing analysis of complex networks which present different execution patterns. Despite the plethora of algorithms and implementations for complex network analysis, we carefully selected our “benchmarking methods” so that they include methods that perform both local and network-wide operations in a complex network, and also they are generic enough in the sense that they can be used as primitives for more sophisticated network processing applications. We evaluated the performance of SSDs and HDDs by executing these algorithms on real social network data and excluding the effects of network bandwidth which can severely bias the results. The obtained results confirmed in part earlier studies which showed that SSDs are beneficial to Hadoop. However, we also provided solid evidence that the processing pattern of the running application has a significant role, and thus future studies must not blindly add SSDs to Hadoop, but they should build components for assessing the type of processing pattern of the application and then direct the data to the appropriate storage medium. © 2017 Elsevier Inc. | en |
dc.language.iso | en | en |
dc.source | Big Data Research | en |
dc.source.uri | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85025140606&doi=10.1016%2fj.bdr.2017.06.001&partnerID=40&md5=1d7a0de0a1ec60a9163fd3370abb745b | |
dc.subject | Benchmarking | en |
dc.subject | Complex networks | en |
dc.subject | Hard disk storage | en |
dc.subject | Social networking (online) | en |
dc.subject | Algorithm and implementation | en |
dc.subject | Hadoop | en |
dc.subject | Hadoop MapReduce | en |
dc.subject | Hard Disk Drive | en |
dc.subject | Magnetic disk | en |
dc.subject | Map-reduce | en |
dc.subject | Performance | en |
dc.subject | Relative performance | en |
dc.subject | Social network | en |
dc.subject | Solid state disks | en |
dc.subject | MapReduce | en |
dc.subject | Elsevier Inc. | en |
dc.title | Hadoop MapReduce Performance on SSDs for Analyzing Social Networks | en |
dc.type | journalArticle | en |
Files in questo item
Files | Dimensione | Formato | Mostra |
---|---|---|---|
Nessun files in questo item. |