DGIST Scholar: Deep Partitioned Training from Near-Storage Computing to DNN Accelerators

Cited time in webofscience

Cited time in scopus

Deep Partitioned Training from Near-Storage Computing to DNN Accelerators

Title: Deep Partitioned Training from Near-Storage Computing to DNN Accelerators

Author(s): Jang, Yongjoo ; Kim, Sejin ; Kim, Daehoon ; Lee, Sungjin ; Kung, Jaeha

DGIST Authors: Jang, Yongjoo ; Kim, Sejin ; Kim, Daehoon ; Lee, Sungjin ; Kung, Jaeha

Author Keywords: Computational modeling ; Data models ; DNN accelerators ; Indexes ; Kernel ; Near-storage computing ; Parallel processing ; Random access memory ; Training ; Training deep neural networks ; Workload partitioning

Keywords: Virtual storage ; Batch sizes ; Computing devices ; Fpga prototypes ; Training time ; Storage as a service (STaaS) ; Deep neural networks ; Recommender systems

Abstract: In this paper, we present deep partitioned training to accelerate computations involved in training DNN models. This is the first work that partitions a DNN model across storage devices, an NPU and a host CPU forming a unified compute node for training workloads. To validate the benefit of using the proposed system during DNN training, a trace-based simulator or an FPGA prototype is used to estimate the overall performance and obtain the layer index to be partitioned that provides the minimum latency. As a case study, we select two benchmarks, i.e., vision-related tasks and a recommendation system. As a result, the training time reduces by 12.2~31.0% with four near-storage computing devices in vision-related tasks with a mini-batch size of 512 and 40.6~44.7% with one near-storage computing device in the selected recommendation system with a mini-batch size of 64. CCBY

Related Researcher

Kim, Daehoon
Research Interests Computer Architecture and Systems; Virtualization; Cloud Computing

Appears in Collections:: Department of Electrical Engineering and Computer Science Computer Architecture and Systems Lab 1. Journal Articles; Department of Electrical Engineering and Computer Science Data-Intensive Computing Systems Laboratory 1. Journal Articles; Department of Electrical Engineering and Computer Science Intelligent Digital Systems Lab 1. Journal Articles

qrcode

DGIST

DGIST Scholar was built with support from the OAK distribution project by the National Library of Korea.

You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Library Services Team, DGIST 333. Techno Jungang-daero, Hyeonpung-myeon, Dalseong-gun, Daegu, 42988, Republic of Korea.