Detail View

VLM 시대의 목표 객체 탐색을 위한 지식 융합 전략 서베이
Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

DC Field Value Language
dc.contributor.author 서보건 -
dc.contributor.author 김지선 -
dc.contributor.author 손준우 -
dc.contributor.author 박명옥 -
dc.contributor.author 김기섭 -
dc.date.accessioned 2026-01-21T21:40:14Z -
dc.date.available 2026-01-21T21:40:14Z -
dc.date.created 2026-01-03 -
dc.date.issued 2026-01 -
dc.identifier.issn 1225-6382 -
dc.identifier.uri https://scholar.dgist.ac.kr/handle/20.500.11750/59402 -
dc.description.abstract The rapid advancement of robotics and deep learning has increasingly accelerated the use of Embodied AI, where robots autonomously explore and reason in complex real-world environments. With the growing demand for domestic service robots, efficient navigation in unfamiliar settings has become even more crucial. Object Goal Navigation (OGN) is a fundamental task for this capability, requiring a robot to find and reach a user-specified object in an unknown environment. Solving OGN demands advanced perception, contextual reasoning, and effective exploration strategies. Recent Vision-Language Models (VLMs) and Large Language Models (LLMs) provide agents with external common knowledge and reasoning capabilities. This paper poses the critical question: “Where should VLM/LLM knowledge be fused into Object Goal Navigation?” We categorize knowledge integration into the three stages adapted from the Perception-Prediction-Planning paradigm to offer a structured survey of Object Goal Navigation approaches shaped by the VLM era. We conclude by discussing current dataset limitations and future directions, including further studies on socially interactive navigation and operation in mixed indoor - outdoor environments. -
dc.language Korean -
dc.publisher Korean Society of Automotive Engineers -
dc.title VLM 시대의 목표 객체 탐색을 위한 지식 융합 전략 서베이 -
dc.title.alternative Where to Fuse in the VLM Era: A Survey on Integrating Knowledge into Object Goal Navigation -
dc.type Article -
dc.identifier.doi 10.7467/KSAE.2026.34.1.119 -
dc.identifier.bibliographicCitation Transactions of the Korean Society of Automotive Engineers, v.34, no.1, pp.119 - 131 -
dc.identifier.kciid ART003277209 -
dc.description.isOpenAccess TRUE -
dc.subject.keywordAuthor 실내탐색 -
dc.subject.keywordAuthor 비전언어모델 -
dc.subject.keywordAuthor 거대언어모델 -
dc.subject.keywordAuthor 목표객체탐색 -
dc.subject.keywordAuthor Indoor navigation -
dc.subject.keywordAuthor Vision language model -
dc.subject.keywordAuthor Large language model -
dc.subject.keywordAuthor Object goal navigation -
dc.citation.endPage 131 -
dc.citation.number 1 -
dc.citation.startPage 119 -
dc.citation.title Transactions of the Korean Society of Automotive Engineers -
dc.citation.volume 34 -
dc.description.journalRegisteredClass scopus -
dc.description.journalRegisteredClass kci -
dc.type.docType Article -
Show Simple Item Record

File Downloads

공유

qrcode
공유하기

Related Researcher

손준우
Son, Joonwoo손준우

Division of AI, Big data and Block chain

read more

Total Views & Downloads