论文标题
校准的一级分类用于无监督时间序列异常检测
Calibrated One-class Classification for Unsupervised Time Series Anomaly Detection
论文作者
论文摘要
时间序列异常检测在维持各个域中的系统可用性方面具有重要作用。当前的研究线中的当前工作主要集中在学习数据正常性上,通过设计高级神经网络结构和新的重建/预测学习目标,深入而全面地学习。但是,他们的单级学习过程可能会被无监督范式下的训练数据(即异常污染)中的潜在异常误导。他们的学习过程还缺乏对异常的知识。因此,他们经常学习一个有偏见的,不准确的正态边界。为了解决这些问题,本文提出了通过基于不确定性建模的校准和基于天然异常的校准来校准单级分类,以实现污染污染,耐污染的数据正态性。具体而言,我们的方法会自适应地惩罚不确定的预测,以限制优化过程中异常污染中的不规则样本,同时鼓励对常规样本的自信预测以确保有效的正态性学习。这在很大程度上减轻了异常污染的负面影响。我们的方法还通过扰动创建本地异常示例,以模拟时间序列异常行为。通过区分这些虚拟异常,我们的单级学习得到了进一步的校准,以形成更精确的正态性边界。十个现实世界数据集的广泛实验表明,我们的模型在16个最先进的竞争者中取得了重大改进。
Time series anomaly detection is instrumental in maintaining system availability in various domains. Current work in this research line mainly focuses on learning data normality deeply and comprehensively by devising advanced neural network structures and new reconstruction/prediction learning objectives. However, their one-class learning process can be misled by latent anomalies in training data (i.e., anomaly contamination) under the unsupervised paradigm. Their learning process also lacks knowledge about the anomalies. Consequently, they often learn a biased, inaccurate normality boundary. To tackle these problems, this paper proposes calibrated one-class classification for anomaly detection, realizing contamination-tolerant, anomaly-informed learning of data normality via uncertainty modeling-based calibration and native anomaly-based calibration. Specifically, our approach adaptively penalizes uncertain predictions to restrain irregular samples in anomaly contamination during optimization, while simultaneously encouraging confident predictions on regular samples to ensure effective normality learning. This largely alleviates the negative impact of anomaly contamination. Our approach also creates native anomaly examples via perturbation to simulate time series abnormal behaviors. Through discriminating these dummy anomalies, our one-class learning is further calibrated to form a more precise normality boundary. Extensive experiments on ten real-world datasets show that our model achieves substantial improvement over sixteen state-of-the-art contenders.