Detail View

SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

Title
SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
Issued Date
2023-07-12
Citation
Han, Changhun. (2023-07-12). SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU. Design Automation Conference, 23709164. doi: 10.1109/DAC56929.2023.10247661
Type
Conference Paper
ISBN
9798350323481
ISSN
0738-100X
Abstract
Deep neural networks (DNNs) have been deployed in many safety-critical real-time embedded systems. To support DNN tasks in real-time, most previous studies focused on GPU or CPU. However, Edge TPU has not yet been studied for real-time guarantees. This paper presents a real-time DNNs framework for Edge TPU to satisfy multiple DNN inference tasks' timing requirements. The proposed framework provides 1) SRAM allocation and model partitioning techniques and 2) a MIP-based algorithm that determines the amount of SRAM and the number of segments for each task. The experiment result shows that our framework provides 79% higher schedulability than the existing Edge TPU system. © 2023 IEEE.
URI
http://hdl.handle.net/20.500.11750/47897
DOI
10.1109/DAC56929.2023.10247661
Publisher
ACM Special Interest Group on Design Automation (SIGDA), IEEE Council on Electronic Design Automation (CEDA)
Show Full Item Record

File Downloads

  • There are no files associated with this item.

공유

qrcode
공유하기

Related Researcher

좌훈승
Chwa, Hoonsung좌훈승

Department of Electrical Engineering and Computer Science

read more

Total Views & Downloads