Cited time in webofscience Cited time in scopus

AutoRelax: HW-SW Co-Optimization for Efficient SpGEMM Operations with Automated Relaxation in Deep Learning

Title
AutoRelax: HW-SW Co-Optimization for Efficient SpGEMM Operations with Automated Relaxation in Deep Learning
Author(s)
Park, SehunKim, Jae-joonKung, Jaeha
Issued Date
2022-07
Citation
IEEE Transactions on Emerging Topics in Computing, v.10, no.3, pp.1428 - 1442
Type
Article
Author Keywords
Computational modelingData modelsDeep learningHardwareIndexesSparse matricesTraining
Keywords
Energy efficiencyModeling languagesSpeech recognitionEnergy reductionHardware architectureHigher compression ratiosIts efficienciesLevel of relaxationPruning algorithmsSoftware approachSparse matrix formatsDeep learning
ISSN
2168-6750
Abstract
We propose a HW-SW co-optimization technique to perform energy-efficient spGEMM operations for deep learning. First, we present an automated pruning algorithm, named AutoRelax, that allows some level of relaxation to achieve higher compression ratio. Since the benefit of the proposed pruning algorithm may be limited by the sparsity level of a given weight matrix, we present additional steps to further improve its efficiency. Along with the software approach, we also present a hardware architecture for processing sparse GEMM operations to maximize the benefit of the proposed pruning algorithm and sparse matrix format. To validate the efficiency of our co-optimization methodology, we evaluated the proposed method on three benchmarks, language modeling, speech recognition and image classification. As a result, our approach improved on-chip performance of spGEMM operations by 9.5027.57% and achieved energy reductions of 15.3533.28% considering DRAM accesses over other sparse accelerators. IEEE
URI
http://hdl.handle.net/20.500.11750/15594
DOI
10.1109/TETC.2021.3089848
Publisher
IEEE Computer Society
Related Researcher
  • 궁재하 Kung, Jaeha
  • Research Interests 딥러닝; 가속하드웨어; 저전력 하드웨어; 고성능 시스템
Files in This Item:

There are no files associated with this item.

Appears in Collections:
Department of Electrical Engineering and Computer Science Intelligent Digital Systems Lab 1. Journal Articles

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE