Detail View
Logical Anomaly Detection with Text-based Logic via Component-Aware Contrastive Language-Image Training
WEB OF SCIENCE
SCOPUS
- Title
- Logical Anomaly Detection with Text-based Logic via Component-Aware Contrastive Language-Image Training
- Issued Date
- 2025-08-07
- Citation
- ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp.1274 - 1285
- Type
- Conference Paper
- ISBN
- 9798400714542
- ISSN
- 2154-817X
- Abstract
-
AI-based automatic visual inspection systems have been extensively researched to streamline various industrial products' labor-intensive anomaly detection processes. Despite significant advancements, detecting logical anomalies remains challenging due to the multitude of rules governing the assembly of multiple components to create a normal product. Existing methods have relied solely on image information for anomaly detection, resulting in limited accuracy as they fail to account for these diverse complex rules. Instead, humans detect anomalies by comparing the image with pre-defined logic which can be clearly expressed with natural language. Inspired by the human decision process, we propose a logical anomaly detection model that leverages text-based logic like human reasoning. With user-defined rules (i.e., positive rules) and logically distinct negative rules, we train the model using component-aware contrastive learning that increases the similarity between images and positive rules while decreasing the similarity with negative rules. However, accurately comparing textual and visual features is challenging due to multiple components, each governed by different rules, within a single image. To address this, we developed a zero-shot related region detection technique, which guides the model's focus on components relevant to each rule. We evaluated the proposed model on three public datasets and achieved state-of-the-art results in a few-shot logical anomaly detection task. Our findings highlight the potential of integrating vision-language models to enhance logical anomaly detection and utilizing text-based logic in complex industrial settings.
더보기
- Publisher
- Association for Computing Machinery
File Downloads
- There are no files associated with this item.
공유
Total Views & Downloads
???jsp.display-item.statistics.view???: , ???jsp.display-item.statistics.download???:
