Feed Forward Neural Network Sparsification with Dynamic Pruning

A recent hot research topic in deep learning concerns the reduction of the model size of a neural network by pruning, in order to minimize its training and inference cost and thus, being capable of running on devices with memory constraints. In this paper, we employ a pruning technique to sparsify a Multi-Layer Perceptron (MLP) during training, in which the number of topology connections, being pruned and restored, is not stable, but it adopts either one of the following rules: Linear Decreasing Variation (LDV) rule or Oscillating Variation (OSV) rule or Exponential Decay (EXD) rule. We conducted experiments on three MLP Network topologies, implemented with Keras, using the Fashion-MNIST dataset and results showed that the EXD method is a clear winner since, in that case our proposed sparse network has a faster convergence than the dense version of the same one, while it achieves approximately the same high accuracy (around 90%). Furthermore, it is shown that the memory footprint of the aforementioned sparse techniques is at least 95% less instead of the dense version of the network, due to the weights removed. Finally, we present an improved version of the SET implementation in Keras, using Callbacks API, making the SET implementation more efficient. © 2021 ACM.

URI

http://hdl.handle.net/11615/72812

Collections

Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19735]

Feed Forward Neural Network Sparsification with Dynamic Pruning

Autore

Data

Language

DOI

Soggetto

Abstract

URI

Collections

Related items

Deep Endoscopic Visual Measurements ﻿

Pose recognition using convolutional neural networks on omni-directional images ﻿

Resource-efficient TDNN Architectures for Audio-visual Speech Recognition ﻿

Deep Endoscopic Visual Measurements

Pose recognition using convolutional neural networks on omni-directional images

Resource-efficient TDNN Architectures for Audio-visual Speech Recognition