Communities & Collections
Researchers & Labs
Titles
DGIST
LIBRARY
DGIST R&D
Detail View
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
2. Conference Papers
SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
Han, Changhun
;
Chwa, Hoon Sung
;
Lee, Kilho
;
Oh, Sangeun
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
2. Conference Papers
Citations
WEB OF SCIENCE
Citations
SCOPUS
Metadata Downloads
XML
Excel
Title
SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
Issued Date
2023-07-12
Citation
Han, Changhun. (2023-07-12). SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU. Design Automation Conference, 23709164. doi: 10.1109/DAC56929.2023.10247661
Type
Conference Paper
ISBN
9798350323481
ISSN
0738-100X
Abstract
Deep neural networks (DNNs) have been deployed in many safety-critical real-time embedded systems. To support DNN tasks in real-time, most previous studies focused on GPU or CPU. However, Edge TPU has not yet been studied for real-time guarantees. This paper presents a real-time DNNs framework for Edge TPU to satisfy multiple DNN inference tasks' timing requirements. The proposed framework provides 1) SRAM allocation and model partitioning techniques and 2) a MIP-based algorithm that determines the amount of SRAM and the number of segments for each task. The experiment result shows that our framework provides 79% higher schedulability than the existing Edge TPU system. © 2023 IEEE.
URI
http://hdl.handle.net/20.500.11750/47897
DOI
10.1109/DAC56929.2023.10247661
Publisher
ACM Special Interest Group on Design Automation (SIGDA), IEEE Council on Electronic Design Automation (CEDA)
Show Full Item Record
File Downloads
There are no files associated with this item.
공유
공유하기
Related Researcher
Chwa, Hoonsung
좌훈승
Department of Electrical Engineering and Computer Science
read more
Total Views & Downloads