Detail View

DC Field Value Language
dc.contributor.author Lee, Seung-eon -
dc.contributor.author Kim, Soopil -
dc.contributor.author An, Sion -
dc.contributor.author Lee, Sang-Chul -
dc.contributor.author Park, Sang Hyun -
dc.date.accessioned 2026-02-04T21:10:13Z -
dc.date.available 2026-02-04T21:10:13Z -
dc.date.created 2025-09-12 -
dc.date.issued 2025-08-07 -
dc.identifier.isbn 9798400714542 -
dc.identifier.issn 2154-817X -
dc.identifier.uri https://scholar.dgist.ac.kr/handle/20.500.11750/59905 -
dc.description.abstract AI-based automatic visual inspection systems have been extensively researched to streamline various industrial products' labor-intensive anomaly detection processes. Despite significant advancements, detecting logical anomalies remains challenging due to the multitude of rules governing the assembly of multiple components to create a normal product. Existing methods have relied solely on image information for anomaly detection, resulting in limited accuracy as they fail to account for these diverse complex rules. Instead, humans detect anomalies by comparing the image with pre-defined logic which can be clearly expressed with natural language. Inspired by the human decision process, we propose a logical anomaly detection model that leverages text-based logic like human reasoning. With user-defined rules (i.e., positive rules) and logically distinct negative rules, we train the model using component-aware contrastive learning that increases the similarity between images and positive rules while decreasing the similarity with negative rules. However, accurately comparing textual and visual features is challenging due to multiple components, each governed by different rules, within a single image. To address this, we developed a zero-shot related region detection technique, which guides the model's focus on components relevant to each rule. We evaluated the proposed model on three public datasets and achieved state-of-the-art results in a few-shot logical anomaly detection task. Our findings highlight the potential of integrating vision-language models to enhance logical anomaly detection and utilizing text-based logic in complex industrial settings. -
dc.language English -
dc.publisher Association for Computing Machinery -
dc.relation.ispartof Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining -
dc.title Logical Anomaly Detection with Text-based Logic via Component-Aware Contrastive Language-Image Training -
dc.type Conference Paper -
dc.identifier.doi 10.1145/3711896.3737032 -
dc.identifier.wosid 001592428900112 -
dc.identifier.scopusid 2-s2.0-105014587756 -
dc.identifier.bibliographicCitation ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp.1274 - 1285 -
dc.identifier.url https://kdd2025.kdd.org/research-track-papers-2/ -
dc.citation.conferenceDate 2025-08-03 -
dc.citation.conferencePlace CN -
dc.citation.conferencePlace Toronto -
dc.citation.endPage 1285 -
dc.citation.startPage 1274 -
dc.citation.title ACM SIGKDD Conference on Knowledge Discovery and Data Mining -
Show Simple Item Record

File Downloads

  • There are no files associated with this item.

공유

qrcode
공유하기

Related Researcher

김수필
Kim, Soopil김수필

Division of Intelligent Robotics

read more

Total Views & Downloads

???jsp.display-item.statistics.view???: , ???jsp.display-item.statistics.download???: