Εμφάνιση απλής εγγραφής

dc.creatorKrisilias A., Provatas N., Koziris N., Konstantinou I.en
dc.date.accessioned2023-01-31T08:47:11Z
dc.date.available2023-01-31T08:47:11Z
dc.date.issued2021
dc.identifier10.1109/BigData52589.2021.9671461
dc.identifier.isbn9781665439022
dc.identifier.urihttp://hdl.handle.net/11615/75519
dc.description.abstractOver the recent years, deep learning is widely being used in a variety of different fields and applications. The constant growth of data used to train complex models, has opened research in the distributed learning. In this domain, two main architectures are used to train models in a distribution fashion, all-reduce and parameter server. Both support synchronous learning, while parameter server also supports asynchronous learning. These architectures are adopted by tech companies, which have developed multiple systems for this purpose. Among the most popular and widely used distributed deep learning systems are Google TensorFlow, Facebook PyTorch and Apache MXNet. In this paper, we quantify the performance gap between these systems and present a detailed analysis to discuss the parameters that affect their execution time. Overall, in synchronous learning setups, TensorFlow is slower compared to PyTorch by average 2.65X, while the latter lags MXNet by average 1.38X. Regarding asynchronous learning, MXNet is faster by average 3.22X in respect with TensorFlow. © 2021 IEEE.en
dc.language.isoenen
dc.sourceProceedings - 2021 IEEE International Conference on Big Data, Big Data 2021en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85125328524&doi=10.1109%2fBigData52589.2021.9671461&partnerID=40&md5=676a2f735c964bb8342a92fc2191a5e4
dc.subjectDeep learningen
dc.subjectImage classificationen
dc.subjectApache MXNeten
dc.subjectAsynchronous learningen
dc.subjectDistributed deep learningen
dc.subjectGoogle tensorflowen
dc.subjectGoogle+en
dc.subjectImages classificationen
dc.subjectLearning frameworksen
dc.subjectPerformances evaluationen
dc.subjectPytorchen
dc.subjectSynchronous learningen
dc.subjectBenchmarkingen
dc.subjectInstitute of Electrical and Electronics Engineers Inc.en
dc.titleA Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloadsen
dc.typeconferenceItemen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής