Cited time in webofscience Cited time in scopus

SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU

Title
SPET: Transparent SRAM Allocation and Model Partitioning for Real-time DNN Tasks on Edge TPU
Author(s)
Han, ChanghunChwa, Hoon SungLee, KilhoOh, Sangeun
Issued Date
2023-07-12
Citation
Design Automation Conference, pp.23709164
Type
Conference Paper
ISBN
9798350323481
ISSN
0738-100X
Abstract
Deep neural networks (DNNs) have been deployed in many safety-critical real-time embedded systems. To support DNN tasks in real-time, most previous studies focused on GPU or CPU. However, Edge TPU has not yet been studied for real-time guarantees. This paper presents a real-time DNNs framework for Edge TPU to satisfy multiple DNN inference tasks' timing requirements. The proposed framework provides 1) SRAM allocation and model partitioning techniques and 2) a MIP-based algorithm that determines the amount of SRAM and the number of segments for each task. The experiment result shows that our framework provides 79% higher schedulability than the existing Edge TPU system. © 2023 IEEE.
URI
http://hdl.handle.net/20.500.11750/47897
DOI
10.1109/DAC56929.2023.10247661
Publisher
ACM Special Interest Group on Design Automation (SIGDA), IEEE Council on Electronic Design Automation (CEDA)
Related Researcher
  • 좌훈승 Chwa, Hoon Sung
  • Research Interests Real-Time Systems; Real-Time AI Services; Cyber-Physical Systems; Mobile Systems
Files in This Item:

There are no files associated with this item.

Appears in Collections:
Department of Electrical Engineering and Computer Science Real-Time Computing Lab 2. Conference Papers

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE