论文标题
为AIOPS构建大型现实世界基准数据集
Constructing Large-Scale Real-World Benchmark Datasets for AIOps
论文作者
论文摘要
最近,AIOPS(IT操作的人工智能)在学术界和行业中进行了很好的研究,以实现自动化和有效的软件服务管理。已经为AIOPS进行了大量努力,包括异常检测,根本原因本地化,事件管理等。但是,在私人数据集中评估了大多数现有作品,因此无法保证它们的一般性和实际绩效。缺乏公共大规模的现实世界数据集使研究人员和工程师无法增强AIOPS的发展。为了解决这一难题,在这项工作中,我们介绍了三个有关AIOPS的公共现实世界,大规模数据集,主要针对KPI异常检测,根本原因在多维数据上定位,以及失败发现和诊断。更重要的是,我们基于这些数据集在2018/2019/2020举行了三场比赛,吸引了数千个团队参加。将来,我们将继续发布更多数据集并举办竞争,以进一步促进AIOPS的发展。
Recently, AIOps (Artificial Intelligence for IT Operations) has been well studied in academia and industry to enable automated and effective software service management. Plenty of efforts have been dedicated to AIOps, including anomaly detection, root cause localization, incident management, etc. However, most existing works are evaluated on private datasets, so their generality and real performance cannot be guaranteed. The lack of public large-scale real-world datasets has prevented researchers and engineers from enhancing the development of AIOps. To tackle this dilemma, in this work, we introduce three public real-world, large-scale datasets about AIOps, mainly aiming at KPI anomaly detection, root cause localization on multi-dimensional data, and failure discovery and diagnosis. More importantly, we held three competitions in 2018/2019/2020 based on these datasets, attracting thousands of teams to participate. In the future, we will continue to publish more datasets and hold competitions to promote the development of AIOps further.