Communities & Collections
Researchers & Labs
Titles
DGIST
LIBRARY
DGIST R&D
Detail View
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
1. Journal Articles
Timing guarantees for inference of AI models in embedded systems
Lee, Seunghoon
;
Kang, Woosung
;
Bertogna, Marko
;
Chwa, Hoon Sung
;
Lee, Jinkyu
Department of Electrical Engineering and Computer Science
Real-Time Computing Lab
1. Journal Articles
Citations
WEB OF SCIENCE
Citations
SCOPUS
Metadata Downloads
XML
Excel
Title
Timing guarantees for inference of AI models in embedded systems
Issued Date
2025-06
Citation
Real-Time Systems, v.61, no.2, pp.259 - 267
Type
Article
Author Keywords
Timing guarantees
;
Embedded systems
;
Machine learning
;
Inference
ISSN
0922-6443
Abstract
Machine learning (ML) is increasingly being integrated into real-time embedded systems, enabling intelligent decision-making in applications such as autonomous driving and industrial automation. However, ensuring predictable execution of deep neural network (DNN) inference remains a major challenge, as real-time systems must meet strict timing constraints to guarantee safety and reliability. This paper identifies key challenges in achieving real-time AI inference in embedded systems, including limited memory capacity, high energy consumption, efficient multi-DNN scheduling, and heterogeneous resource management. To address these challenges, we emphasize the need for advanced scheduling algorithms to efficiently allocate heterogeneous computing resources across multiple DNNs, hierarchical memory management to reduce memory bottlenecks, and real-time neural architecture search and optimization techniques to enhance AI model performance under strict timing constraints. Furthermore, we discuss future research directions aimed at improving real-time AI execution, including time-predictable scheduling frameworks to ensure consistent inference latency, cross-device AI workload management to optimize resource utilization across heterogeneous processors, and benchmarking methodologies to systematically evaluate performance, timing guarantees, and energy efficiency in real-time AI systems. Advancing these research areas will enhance the reliability, efficiency, and scalability of AI-driven embedded systems, bridging the gap between ML advancements and real-time system requirements. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.
URI
https://scholar.dgist.ac.kr/handle/20.500.11750/58571
DOI
10.1007/s11241-025-09445-9
Publisher
Springer Nature
Show Full Item Record
File Downloads
There are no files associated with this item.
공유
공유하기
Related Researcher
Chwa, Hoonsung
좌훈승
Department of Electrical Engineering and Computer Science
read more
Total Views & Downloads