WEB OF SCIENCE
SCOPUS
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Lee, Seunghoon | - |
| dc.contributor.author | Kang, Woosung | - |
| dc.contributor.author | Bertogna, Marko | - |
| dc.contributor.author | Chwa, Hoon Sung | - |
| dc.contributor.author | Lee, Jinkyu | - |
| dc.date.accessioned | 2025-07-02T19:40:10Z | - |
| dc.date.available | 2025-07-02T19:40:10Z | - |
| dc.date.created | 2025-06-30 | - |
| dc.date.issued | 2025-06 | - |
| dc.identifier.issn | 0922-6443 | - |
| dc.identifier.uri | https://scholar.dgist.ac.kr/handle/20.500.11750/58571 | - |
| dc.description.abstract | Machine learning (ML) is increasingly being integrated into real-time embedded systems, enabling intelligent decision-making in applications such as autonomous driving and industrial automation. However, ensuring predictable execution of deep neural network (DNN) inference remains a major challenge, as real-time systems must meet strict timing constraints to guarantee safety and reliability. This paper identifies key challenges in achieving real-time AI inference in embedded systems, including limited memory capacity, high energy consumption, efficient multi-DNN scheduling, and heterogeneous resource management. To address these challenges, we emphasize the need for advanced scheduling algorithms to efficiently allocate heterogeneous computing resources across multiple DNNs, hierarchical memory management to reduce memory bottlenecks, and real-time neural architecture search and optimization techniques to enhance AI model performance under strict timing constraints. Furthermore, we discuss future research directions aimed at improving real-time AI execution, including time-predictable scheduling frameworks to ensure consistent inference latency, cross-device AI workload management to optimize resource utilization across heterogeneous processors, and benchmarking methodologies to systematically evaluate performance, timing guarantees, and energy efficiency in real-time AI systems. Advancing these research areas will enhance the reliability, efficiency, and scalability of AI-driven embedded systems, bridging the gap between ML advancements and real-time system requirements. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025. | - |
| dc.language | English | - |
| dc.publisher | Springer Nature | - |
| dc.title | Timing guarantees for inference of AI models in embedded systems | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1007/s11241-025-09445-9 | - |
| dc.identifier.wosid | 001511369300001 | - |
| dc.identifier.scopusid | 2-s2.0-105008410537 | - |
| dc.identifier.bibliographicCitation | Real-Time Systems, v.61, no.2, pp.259 - 267 | - |
| dc.description.isOpenAccess | FALSE | - |
| dc.subject.keywordAuthor | Timing guarantees | - |
| dc.subject.keywordAuthor | Embedded systems | - |
| dc.subject.keywordAuthor | Machine learning | - |
| dc.subject.keywordAuthor | Inference | - |
| dc.citation.endPage | 267 | - |
| dc.citation.number | 2 | - |
| dc.citation.startPage | 259 | - |
| dc.citation.title | Real-Time Systems | - |
| dc.citation.volume | 61 | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
| dc.type.docType | Article | - |
Department of Electrical Engineering and Computer Science