论文标题
用于工业异常检测的非对称学生教师网络
Asymmetric Student-Teacher Networks for Industrial Anomaly Detection
论文作者
论文摘要
通常使用异常检测(AD)方法来解决工业缺陷检测,其中没有或仅仅是可能发生的缺陷的数据。这项工作发现了以前未知的AD学生教师方法的问题,并提出了一种解决方案,其中两个神经网络经过培训,可以为无缺陷的培训示例产生相同的输出。学生教师网络的核心假设是,两个网络的输出之间的距离对于异常情况较大,因为它们在培训中没有。但是,以前的方法遭受了学生和教师架构的相似性,因此对于异常情况而言,距离很小。因此,我们提出了不对称的学生教师网络(AST)。我们训练标准化流量作为教师的密度估计,并作为学生作为学生的传统喂养网络触发大距离的异常距离:与正常数据相比,标准化流量的双向范围都会导致教师输出的分歧。在培训分配之外,由于其根本不同的架构,学生无法模仿这种分歧。我们的AST网络通过归一流的流量弥补了错误估计的可能性,这在先前的工作中被用于异常检测。我们表明,我们的方法对当前最相关的缺陷检测数据集MVTEC AD和MVTEC 3D-AD产生有关图像级和3D数据的图像级异常检测的最新结果。
Industrial defect detection is commonly addressed with anomaly detection (AD) methods where no or only incomplete data of potentially occurring defects is available. This work discovers previously unknown problems of student-teacher approaches for AD and proposes a solution, where two neural networks are trained to produce the same output for the defect-free training examples. The core assumption of student-teacher networks is that the distance between the outputs of both networks is larger for anomalies since they are absent in training. However, previous methods suffer from the similarity of student and teacher architecture, such that the distance is undesirably small for anomalies. For this reason, we propose asymmetric student-teacher networks (AST). We train a normalizing flow for density estimation as a teacher and a conventional feed-forward network as a student to trigger large distances for anomalies: The bijectivity of the normalizing flow enforces a divergence of teacher outputs for anomalies compared to normal data. Outside the training distribution the student cannot imitate this divergence due to its fundamentally different architecture. Our AST network compensates for wrongly estimated likelihoods by a normalizing flow, which was alternatively used for anomaly detection in previous work. We show that our method produces state-of-the-art results on the two currently most relevant defect detection datasets MVTec AD and MVTec 3D-AD regarding image-level anomaly detection on RGB and 3D data.