Communities & Collections
Researchers & Labs
Titles
DGIST
LIBRARY
DGIST R&D
Detail View
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
2. Conference Papers
RT-Swap: Addressing GPU Memory Bottlenecks for Real-Time Multi-DNN Inference
Kang, Woosung
;
Lee, Jinkyu
;
Lee, Youngmoon
;
Oh, Sangeun
;
Lee, Kilho
;
Chwa, Hoon Sung
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
2. Conference Papers
Citations
WEB OF SCIENCE
Citations
SCOPUS
Metadata Downloads
XML
Excel
Title
RT-Swap: Addressing GPU Memory Bottlenecks for Real-Time Multi-DNN Inference
Issued Date
2024-05-15
Citation
Kang, Woosung. (2024-05-15). RT-Swap: Addressing GPU Memory Bottlenecks for Real-Time Multi-DNN Inference. IEEE Real-Time and Embedded Technology and Applications Symposium, 373–385. doi: 10.1109/RTAS61025.2024.00037
Type
Conference Paper
ISBN
9798350358414
ISSN
1545-3421
Abstract
The increasing complexity and memory demands of Deep Neural Networks (DNNs) for real-Time systems pose new significant challenges, one of which is the GPU memory capacity bottleneck, where the limited physical memory inside GPUs impedes the deployment of sophisticated DNN models. This paper presents, to the best of our knowledge, the first study of addressing the GPU memory bottleneck issues, while simultaneously ensuring the timely inference of multiple DNN tasks. We propose RT-Swap, a real-Time memory management framework, that enables transparent and efficient swap scheduling of memory objects, employing the relatively larger CPU memory to extend the available GPU memory capacity, without compromising timing guarantees. We have implemented RT-Swap on top of representative machine-learning frameworks, demonstrating its effectiveness in making significantly more DNN task sets schedulable at least 72% over existing approaches even when the task sets demand up to 96.2% more memory than the GPU's physical capacity. © 2024 IEEE.
URI
http://hdl.handle.net/20.500.11750/57553
DOI
10.1109/RTAS61025.2024.00037
Publisher
IEEE Computer Society
Show Full Item Record
File Downloads
There are no files associated with this item.
공유
공유하기
Related Researcher
Chwa, Hoonsung
좌훈승
Department of Electrical Engineering and Computer Science
read more
Total Views & Downloads