Mostra i principali dati dell'item

dc.creatorTziouvaras A., Dimitriou G., Foukalas F., Stamoulis G.en
dc.date.accessioned2023-01-31T10:22:34Z
dc.date.available2023-01-31T10:22:34Z
dc.date.issued2020
dc.identifier10.1145/3437120.3437288
dc.identifier.isbn9781450388979
dc.identifier.urihttp://hdl.handle.net/11615/80272
dc.description.abstractModern processor architectures face a throughput scaling problem as the performance bottleneck shifts from the core pipeline to the data transfer operations between the dynamic random access memory (DRAM) and the processor chip. To address such issue researchers have proposed the near-data processing (NDP) paradigm in which the instruction execution is moved to the DRAM die thus, lowering the data movement between the processor and the DRAM. Previous NDP works focus on specific application types and thus the general purpose application execution paradigm is neglected. In this work we propose an NDP methodology for low power general purpose loop acceleration. For this reason we design and implement a hardware loop accelerator from the ground up to improve the throughput and lower the power consumption of general purpose loops. We adopt a novel loop scheduling approach which enables the loop accelerator to take advantage of the dataflow parallelism of the executing loop and we implement our design on the logic layer of a hybrid memory cube (HMC) DRAM. Post-layout simulations demonstrate an average speedup factor of 20.5x when executing kernels from various scientific fields while the energy consumption is reduced by a factor of 9.3x over the host CPU execution. © 2020 ACM.en
dc.language.isoenen
dc.sourceACM International Conference Proceeding Seriesen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85102379127&doi=10.1145%2f3437120.3437288&partnerID=40&md5=bebee6fa4c3ac76e968d819e94806d60
dc.subjectData handlingen
dc.subjectData transferen
dc.subjectDynamic random access storageen
dc.subjectEnergy utilizationen
dc.subjectIntegrated circuit designen
dc.subjectApplication executionen
dc.subjectDesign and implementsen
dc.subjectDynamic random access memoryen
dc.subjectLoop accelerationen
dc.subjectLoop acceleratorsen
dc.subjectModern processorsen
dc.subjectPerformance bottlenecksen
dc.subjectPost layout simulationen
dc.subjectPipeline processing systemsen
dc.subjectAssociation for Computing Machineryen
dc.titleLow power general purpose loop acceleration for NDP applicationsen
dc.typeconferenceItemen


Files in questo item

FilesDimensioneFormatoMostra

Nessun files in questo item.

Questo item appare nelle seguenti collezioni

Mostra i principali dati dell'item