论文标题
DeepRepair:在现实世界中的操作环境中为DNNS的样式引导修复
DeepRepair: Style-Guided Repairing for DNNs in the Real-world Operational Environment
论文作者
论文摘要
深度神经网络(DNN)由于高性能(例如,图像分类的高精度)而广泛应用于各个领域的各种现实世界应用。然而,部署后训练有素的DNN通常会在操作环境中实际使用期间会引起错误(即预测不正确)在部署的操作环境下,同时不损害其处理正常或清洁数据的能力。我们可以在操作环境中噪声因子引起的实践中可以收集的故障样本的数量通常受到限制。因此,基于我们可以收集的有限故障样本来修复更多类似的故障,这是一个挑战。 在本文中,我们提出了一种样式引导的数据增强,以在操作环境中修复DNN。我们提出了一种样式转移方法,以通过数据增强来学习和将故障数据中未知的故障模式介绍到培训数据中。此外,我们进一步提出了基于聚类的失败数据生成,以实现更有效的样式引导数据增强。我们对现实世界中可能发生的15个降解因子进行了大规模评估,并与四种最先进的数据增强方法和两种DNN维修方法进行了比较,这表明我们的方法可以显着增强运营环境中损坏的数据的部署DNN,并且在清洁数据集中可以更好地准确。
Deep neural networks (DNNs) are being widely applied for various real-world applications across domains due to their high performance (e.g., high accuracy on image classification). Nevertheless, a well-trained DNN after deployment could oftentimes raise errors during practical use in the operational environment due to the mismatching between distributions of the training dataset and the potential unknown noise factors in the operational environment, e.g., weather, blur, noise etc. Hence, it poses a rather important problem for the DNNs' real-world applications: how to repair the deployed DNNs for correcting the failure samples (i.e., incorrect prediction) under the deployed operational environment while not harming their capability of handling normal or clean data. The number of failure samples we can collect in practice, caused by the noise factors in the operational environment, is often limited. Therefore, It is rather challenging how to repair more similar failures based on the limited failure samples we can collect. In this paper, we propose a style-guided data augmentation for repairing DNN in the operational environment. We propose a style transfer method to learn and introduce the unknown failure patterns within the failure data into the training data via data augmentation. Moreover, we further propose the clustering-based failure data generation for much more effective style-guided data augmentation. We conduct a large-scale evaluation with fifteen degradation factors that may happen in the real world and compare with four state-of-the-art data augmentation methods and two DNN repairing methods, demonstrating that our method can significantly enhance the deployed DNNs on the corrupted data in the operational environment, and with even better accuracy on clean datasets.