Design Space Exploration of a Sparse MobileNetV2 Using High-Level Synthesis and Sparse Matrix Techniques on FPGAs
Ημερομηνία
2022Γλώσσα
en
Λέξη-κλειδί
Επιτομή
Convolution Neural Networks (CNNs) are gaining ground in deep learning and Artificial Intelligence (AI) domains, and they can benefit from rapid prototyping in order to produce efficient and low-power hardware designs. The inference process of a Deep Neural Network (DNN) is considered a computationally intensive process that requires hardware accelerators to operate in real-world scenarios due to the low latency requirements of real-time applications. As a result, High-Level Synthesis (HLS) tools are gaining popularity since they provide attractive ways to reduce design time complexity directly in register transfer level (RTL). In this paper, we implement a MobileNetV2 model using a state-of-the-art HLS tool in order to conduct a design space exploration and to provide insights on complex hardware designs which are tailored for DNN inference. Our goal is to combine design methodologies with sparsification techniques to produce hardware accelerators that achieve comparable error metrics within the same order of magnitude with the corresponding state-of-the-art systems while also significantly reducing the inference latency and resource utilization. Toward this end, we apply sparse matrix techniques on a MobileNetV2 model for efficient data representation, and we evaluate our designs in two different weight pruning approaches. Experimental results are evaluated with respect to the CIFAR-10 data set using several different design methodologies in order to fully explore their effects on the performance of the model under examination. © 2022 by the authors. Licensee MDPI, Basel, Switzerland.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Optimization driven multi-hop network design and experimentation: The approach of the FP7 project OPNEX
Choumas, K.; Keranidis, S.; Koutsopoulos, I.; Korakis, T.; Tassiulas, L.; Juraschek, F.; Günes, M.; Baccelli, E.; Misiorek, P.; Szwabe, A.; Salonidis, T.; Lundgren, H. (2012)The OPNEX project exemplifies system and optimization theory as the foundations for algorithms that provably maximize capacity of wireless networks. The algorithms termed in abstract network models have been converted to ... -
A heuristic approach for overlay content-caching network design in 5G wireless networks
AlMomani A.A., Argyriou A., Erol-Kantarci M. (2016)With a wide variety of mobile applications and social networks, there is a growing demand for anytime, anywhere access to high quality video and other multimedia content. Fifth generation wireless networks are aiming to ... -
Lightweight jammer localization in wireless networks: System design and implementation
Pelechrinis, K.; Koutsopoulos, I.; Broustis, I.; Krishnamurthy, S. V. (2009)Jamming attacks have become prevalent during the last few years, due to the shared nature and the open access to the wireless medium. Finding the location of a jamming device is of great importance for restoring normal ...