WEB OF SCIENCE
SCOPUS
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | 서보건 | - |
| dc.contributor.author | 김지선 | - |
| dc.contributor.author | 손준우 | - |
| dc.contributor.author | 박명옥 | - |
| dc.contributor.author | 김기섭 | - |
| dc.date.accessioned | 2026-01-21T21:40:14Z | - |
| dc.date.available | 2026-01-21T21:40:14Z | - |
| dc.date.created | 2026-01-03 | - |
| dc.date.issued | 2026-01 | - |
| dc.identifier.issn | 1225-6382 | - |
| dc.identifier.uri | https://scholar.dgist.ac.kr/handle/20.500.11750/59402 | - |
| dc.description.abstract | The rapid advancement of robotics and deep learning has increasingly accelerated the use of Embodied AI, where robots autonomously explore and reason in complex real-world environments. With the growing demand for domestic service robots, efficient navigation in unfamiliar settings has become even more crucial. Object Goal Navigation (OGN) is a fundamental task for this capability, requiring a robot to find and reach a user-specified object in an unknown environment. Solving OGN demands advanced perception, contextual reasoning, and effective exploration strategies. Recent Vision-Language Models (VLMs) and Large Language Models (LLMs) provide agents with external common knowledge and reasoning capabilities. This paper poses the critical question: “Where should VLM/LLM knowledge be fused into Object Goal Navigation?” We categorize knowledge integration into the three stages adapted from the Perception-Prediction-Planning paradigm to offer a structured survey of Object Goal Navigation approaches shaped by the VLM era. We conclude by discussing current dataset limitations and future directions, including further studies on socially interactive navigation and operation in mixed indoor - outdoor environments. | - |
| dc.language | Korean | - |
| dc.publisher | Korean Society of Automotive Engineers | - |
| dc.title | VLM 시대의 목표 객체 탐색을 위한 지식 융합 전략 서베이 | - |
| dc.title.alternative | Where to Fuse in the VLM Era: A Survey on Integrating Knowledge into Object Goal Navigation | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.7467/KSAE.2026.34.1.119 | - |
| dc.identifier.bibliographicCitation | Transactions of the Korean Society of Automotive Engineers, v.34, no.1, pp.119 - 131 | - |
| dc.identifier.kciid | ART003277209 | - |
| dc.description.isOpenAccess | TRUE | - |
| dc.subject.keywordAuthor | 실내탐색 | - |
| dc.subject.keywordAuthor | 비전언어모델 | - |
| dc.subject.keywordAuthor | 거대언어모델 | - |
| dc.subject.keywordAuthor | 목표객체탐색 | - |
| dc.subject.keywordAuthor | Indoor navigation | - |
| dc.subject.keywordAuthor | Vision language model | - |
| dc.subject.keywordAuthor | Large language model | - |
| dc.subject.keywordAuthor | Object goal navigation | - |
| dc.citation.endPage | 131 | - |
| dc.citation.number | 1 | - |
| dc.citation.startPage | 119 | - |
| dc.citation.title | Transactions of the Korean Society of Automotive Engineers | - |
| dc.citation.volume | 34 | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.type.docType | Article | - |