当前位置: X-MOL 学术Comput. Ind. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
TDAD: Self-supervised industrial anomaly detection with a two-stage diffusion model
Computers in Industry ( IF 8.2 ) Pub Date : 2024-09-26 , DOI: 10.1016/j.compind.2024.104192
Changyun Wei, Hui Han, Yu Xia, Ze Ji

Visual anomaly detection has emerged as a highly applicable solution in practical industrial manufacturing, owing to its notable effectiveness and efficiency. However, it also presents several challenges and uncertainties. To address the complexity of anomaly types and the high cost associated with data annotation, this paper introduces a self-supervised learning framework called TDAD, based on a two-stage diffusion model. TDAD consists of three key components: anomaly synthesis, image reconstruction, and defect segmentation. It is trained end-to-end, with the goal of improving pixel-level segmentation accuracy of anomalies and reducing false detection rates. By synthesizing anomalies from normal samples, designing a diffusion model-based reconstruction network, and incorporating a multiscale semantic feature fusion module for defect segmentation, TDAD achieves state-of-the-art performance in image-level detection and anomaly localization on challenging and widely used datasets such as MVTec and VisA benchmarks.

中文翻译:


TDAD:采用两阶段扩散模型的自监督工业异常检测



视觉异常检测因其显着的有效性和效率而成为实际工业制造中高度适用的解决方案。然而,它也带来了一些挑战和不确定性。为了解决异常类型的复杂性以及与数据注释相关的高成本,本文引入了一种名为 TDAD 的自监督学习框架,该框架基于两阶段扩散模型。 TDAD 由三个关键部分组成:异常合成、图像重建和缺陷分割。它经过端到端训练,目标是提高异常的像素级分割精度并降低错误检测率。通过合成正常样本中的异常,设计基于扩散模型的重建网络,并结合用于缺陷分割的多尺度语义特征融合模块,TDAD 在具有挑战性和广泛性的图像级检测和异常定位方面实现了最先进的性能使用 MVTec 和 VisA 基准等数据集。
更新日期:2024-09-26
down
wechat
bug