Εμφάνιση απλής εγγραφής

dc.creatorTziouvaras A., Dimitriou G., Foukalas F., Stamoulis G.en
dc.date.accessioned2023-01-31T10:22:34Z
dc.date.available2023-01-31T10:22:34Z
dc.date.issued2020
dc.identifier10.1145/3437120.3437288
dc.identifier.isbn9781450388979
dc.identifier.urihttp://hdl.handle.net/11615/80272
dc.description.abstractModern processor architectures face a throughput scaling problem as the performance bottleneck shifts from the core pipeline to the data transfer operations between the dynamic random access memory (DRAM) and the processor chip. To address such issue researchers have proposed the near-data processing (NDP) paradigm in which the instruction execution is moved to the DRAM die thus, lowering the data movement between the processor and the DRAM. Previous NDP works focus on specific application types and thus the general purpose application execution paradigm is neglected. In this work we propose an NDP methodology for low power general purpose loop acceleration. For this reason we design and implement a hardware loop accelerator from the ground up to improve the throughput and lower the power consumption of general purpose loops. We adopt a novel loop scheduling approach which enables the loop accelerator to take advantage of the dataflow parallelism of the executing loop and we implement our design on the logic layer of a hybrid memory cube (HMC) DRAM. Post-layout simulations demonstrate an average speedup factor of 20.5x when executing kernels from various scientific fields while the energy consumption is reduced by a factor of 9.3x over the host CPU execution. © 2020 ACM.en
dc.language.isoenen
dc.sourceACM International Conference Proceeding Seriesen
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85102379127&doi=10.1145%2f3437120.3437288&partnerID=40&md5=bebee6fa4c3ac76e968d819e94806d60
dc.subjectData handlingen
dc.subjectData transferen
dc.subjectDynamic random access storageen
dc.subjectEnergy utilizationen
dc.subjectIntegrated circuit designen
dc.subjectApplication executionen
dc.subjectDesign and implementsen
dc.subjectDynamic random access memoryen
dc.subjectLoop accelerationen
dc.subjectLoop acceleratorsen
dc.subjectModern processorsen
dc.subjectPerformance bottlenecksen
dc.subjectPost layout simulationen
dc.subjectPipeline processing systemsen
dc.subjectAssociation for Computing Machineryen
dc.titleLow power general purpose loop acceleration for NDP applicationsen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής