Detail View
SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
WEB OF SCIENCE
SCOPUS
- Title
- SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
- Issued Date
- 2023-07-12
- Citation
- Han, Changhun. (2023-07-12). SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU. Design Automation Conference, 23709164. doi: 10.1109/DAC56929.2023.10247661
- Type
- Conference Paper
- ISBN
- 9798350323481
- ISSN
- 0738-100X
- Abstract
-
Deep neural networks (DNNs) have been deployed in many safety-critical real-time embedded systems. To support DNN tasks in real-time, most previous studies focused on GPU or CPU. However, Edge TPU has not yet been studied for real-time guarantees. This paper presents a real-time DNNs framework for Edge TPU to satisfy multiple DNN inference tasks' timing requirements. The proposed framework provides 1) SRAM allocation and model partitioning techniques and 2) a MIP-based algorithm that determines the amount of SRAM and the number of segments for each task. The experiment result shows that our framework provides 79% higher schedulability than the existing Edge TPU system. © 2023 IEEE.
더보기
- Publisher
- ACM Special Interest Group on Design Automation (SIGDA), IEEE Council on Electronic Design Automation (CEDA)
File Downloads
- There are no files associated with this item.
공유
Related Researcher
- Chwa, Hoonsung좌훈승
-
Department of Electrical Engineering and Computer Science
Total Views & Downloads
???jsp.display-item.statistics.view???: , ???jsp.display-item.statistics.download???:
