A Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloads

Krisilias A., Provatas N., Koziris N., Konstantinou I.

dc.creator	Krisilias A., Provatas N., Koziris N., Konstantinou I.	en
dc.date.accessioned	2023-01-31T08:47:11Z
dc.date.available	2023-01-31T08:47:11Z
dc.date.issued	2021
dc.identifier	10.1109/BigData52589.2021.9671461
dc.identifier.isbn	9781665439022
dc.identifier.uri	http://hdl.handle.net/11615/75519
dc.description.abstract	Over the recent years, deep learning is widely being used in a variety of different fields and applications. The constant growth of data used to train complex models, has opened research in the distributed learning. In this domain, two main architectures are used to train models in a distribution fashion, all-reduce and parameter server. Both support synchronous learning, while parameter server also supports asynchronous learning. These architectures are adopted by tech companies, which have developed multiple systems for this purpose. Among the most popular and widely used distributed deep learning systems are Google TensorFlow, Facebook PyTorch and Apache MXNet. In this paper, we quantify the performance gap between these systems and present a detailed analysis to discuss the parameters that affect their execution time. Overall, in synchronous learning setups, TensorFlow is slower compared to PyTorch by average 2.65X, while the latter lags MXNet by average 1.38X. Regarding asynchronous learning, MXNet is faster by average 3.22X in respect with TensorFlow. © 2021 IEEE.	en
dc.language.iso	en	en
dc.source	Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021	en
dc.source.uri	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85125328524&doi=10.1109%2fBigData52589.2021.9671461&partnerID=40&md5=676a2f735c964bb8342a92fc2191a5e4
dc.subject	Deep learning	en
dc.subject	Image classification	en
dc.subject	Apache MXNet	en
dc.subject	Asynchronous learning	en
dc.subject	Distributed deep learning	en
dc.subject	Google tensorflow	en
dc.subject	Google+	en
dc.subject	Images classification	en
dc.subject	Learning frameworks	en
dc.subject	Performances evaluation	en
dc.subject	Pytorch	en
dc.subject	Synchronous learning	en
dc.subject	Benchmarking	en
dc.subject	Institute of Electrical and Electronics Engineers Inc.	en
dc.title	A Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloads	en
dc.type	conferenceItem	en

Αρχεία σε αυτό το τεκμήριο

Αρχεία	Μέγεθος	Τύπος	Προβολή
Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19705]

Εμφάνιση απλής εγγραφής

A Performance Evaluation of Distributed Deep Learning Frameworks on CPU Clusters Using Image Classification Workloads

Αρχεία σε αυτό το τεκμήριο

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Related items

Εξυπνοι και αλληλεπιδρώμενοι πράκτορες e-learning, smartive e-learning agents - smart and interactive e-learning agents ﻿

Μηχανική και ενισχυτική μάθηση μέσω του αλγορίθμου Q-learning ﻿

Motivating Engineer Students in E-learning Courses with Problem Based Learning and Self-Regulated Learning on the apT2CLE4‘Research Methods’ Environment ﻿

Εξυπνοι και αλληλεπιδρώμενοι πράκτορες e-learning, smartive e-learning agents - smart and interactive e-learning agents

Μηχανική και ενισχυτική μάθηση μέσω του αλγορίθμου Q-learning

Motivating Engineer Students in E-learning Courses with Problem Based Learning and Self-Regulated Learning on the apT2CLE4‘Research Methods’ Environment