DGIST Scholar: DNN-SAM: Split-and-Merge DNN Execution for Real-Time Object Detection

Department of Electrical Engineering and Computer Science Real-Time Computing Lab 2. Conference Papers

Cited time in webofscience

Cited time in scopus

DNN-SAM: Split-and-Merge DNN Execution for Real-Time Object Detection

Title: DNN-SAM: Split-and-Merge DNN Execution for Real-Time Object Detection

Author(s): Kang, Woosung ; Chung, Siwoo ; Kim, Jeremy Yuhyun ; Lee, Youngmoon ; Lee, Kilho ; Lee, Jinkyu ; Shin, Kang G. ; Chwa, Hoon Sung

Issued Date: 2022-05-04

Citation: IEEE Real-Time and Embedded Technology and Applications Symposium, pp.160 - 172

Type: Conference Paper

ISBN: 9781665499989

ISSN: 1545-3421

Abstract: As real-time object detection systems, such as autonomous cars, need to process input images acquired from multiple cameras, they face significant challenges in delivering accurate and timely inferences often based on machine learning (ML). To meet these challenges, we want to provide different levels of object detection accuracy and timeliness to different portions within each input image with different criticality levels. Specifically, we develop DNN-SAM, a dynamic Split-And-Merge Deep Neural Network (DNN) execution and scheduling framework, that enables seamless split-and-merge DNN execution for unmodified DNN models. Instead of processing an entire input image once in a full DNN model, DNN-SAM first splits a DNN inference task into two smaller sub-tasks-a mandatory sub-task dedicated for a safety-critical (cropped) portion of each image and an optional sub-task for processing a down-scaled image-then executes them independently, and finally merges their results into a complete inference. To achieve DNN-SAM's timely and accurate detection of objects in each image, we also develop two scheduling algorithms that prioritize sub-tasks according to their criticality levels and adaptively adjust the scale of the input image to meet the timing constraints while minimizing the response time of mandatory sub-tasks or maximizing the accuracy of optional sub-tasks. We have implemented and evaluated DNN-SAM on a representative ML framework. Our evaluation shows DNN-SAM to improve detection accuracy in the safety-critical region by 2.0-3.7× and lower average inference latency by 4.8-9.7× over existing approaches without violating any timing constraints. © 2022 IEEE.

URI: http://hdl.handle.net/20.500.11750/46853

DOI: 10.1109/RTAS54340.2022.00021

Publisher: Institute of Electrical and Electronics Engineers Inc.

Related Researcher

Chwa, Hoon Sung
Research Interests Real-Time Systems; Real-Time AI Services; Cyber-Physical Systems; Mobile Systems

Files in This Item:: There are no files associated with this item.

Appears in Collections:: Department of Electrical Engineering and Computer Science Real-Time Computing Lab 2. Conference Papers

Show Full Item Record

qrcode

DGIST

DGIST Scholar was built with support from the OAK distribution project by the National Library of Korea.

You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Library Services Team, DGIST 333. Techno Jungang-daero, Hyeonpung-myeon, Dalseong-gun, Daegu, 42988, Republic of Korea.

DGIST Library Repository

BROWSE

DGIST

BROWSE