论文标题
使用基于分数的生成模型的无监督视觉缺陷检测
Unsupervised Visual Defect Detection with Score-Based Generative Model
论文作者
论文摘要
广泛讨论了一个关键问题的异常检测(AD)。在本文中,我们在许多工业应用中专门研究一个特定问题,即视觉缺陷检测(VDD)。实际上,缺陷图像样本非常罕见且难以收集。因此,我们专注于无监督的视觉缺陷检测和本地化任务,并基于最近基于得分的生成模型提出了一个新型框架,该模型通过迭代通过随机微分方程(SDES)来合成真实图像。我们的工作灵感来自以下事实:在原始图像中注入噪声,可能会在剥离过程中将缺陷更改为正常情况(即重建)。首先,基于假设异常数据位于正常数据分布的低概率密度区域的假设,我们解释了一种常见的现象,这种现象是将基于重建方法应用于VDD的常见现象:正常像素在重建过程中也会改变。其次,由于重建和原始图像之间正常像素的差异,正常数据分布的时间依赖性梯度值(即得分)被用作度量而不是重建损失,以评估缺陷。第三,开发了一种新颖的$ T $量表方法,以大大减少所需的迭代次数,从而加速推理过程。这些实践使我们的模型能够以无监督的方式概括VDD,同时保持合理的性能。我们在几个数据集上评估我们的方法以证明其有效性。
Anomaly Detection (AD), as a critical problem, has been widely discussed. In this paper, we specialize in one specific problem, Visual Defect Detection (VDD), in many industrial applications. And in practice, defect image samples are very rare and difficult to collect. Thus, we focus on the unsupervised visual defect detection and localization tasks and propose a novel framework based on the recent score-based generative models, which synthesize the real image by iterative denoising through stochastic differential equations (SDEs). Our work is inspired by the fact that with noise injected into the original image, the defects may be changed into normal cases in the denoising process (i.e., reconstruction). First, based on the assumption that the anomalous data lie in the low probability density region of the normal data distribution, we explain a common phenomenon that occurs when reconstruction-based approaches are applied to VDD: normal pixels also change during the reconstruction process. Second, due to the differences in normal pixels between the reconstructed and original images, a time-dependent gradient value (i.e., score) of normal data distribution is utilized as a metric, rather than reconstruction loss, to gauge the defects. Third, a novel $T$ scales approach is developed to dramatically reduce the required number of iterations, accelerating the inference process. These practices allow our model to generalize VDD in an unsupervised manner while maintaining reasonably good performance. We evaluate our method on several datasets to demonstrate its effectiveness.