论文标题
简单性的优势:网络设备工作负载预测的轻量级模型
Superiority of Simplicity: A Lightweight Model for Network Device Workload Prediction
论文作者
论文摘要
IT系统的快速增长和分布增加了它们的复杂性,并加剧了操作和维护。为了控制大量主机和连接网络的控制,采用并不断增强监视解决方案。他们收集各种关键的性能指标(KPI)(例如CPU利用率,分配的内存等),并提供有关系统状态的详细信息。在一段时间内存储此类指标自然会提高基于过去观察结果来预测未来KPI进展的动机。尽管存在各种时间序列的预测方法,但预测IT系统的进度KPI非常困难。首先,诸如CPU利用率或分配的内存之类的KPI类型非常不同,并且很难用同一模型表达。其次,由于软软或固件更新和硬件现代化,系统组件是互连并不断变化的。因此,必须预期经常进行重新调整或微调。因此,我们提出了基于历史观察结果的KPI系列预测的轻量级解决方案。它由由两个模型组成的加权异质合奏方法组成 - 神经网络和平均预测指标。作为集合方法,使用加权求和,从而使用启发式式来设置权重。在可用的FedCSIS 2020挑战数据集上评估了建模方法,并在初步10%的测试数据上获得了总体$ R^2 $得分为0.10,并且在完整的测试数据上达到了0.15。我们在以下GitHub存储库上发布代码:https://github.com/citlab/fed_challenge
The rapid growth and distribution of IT systems increases their complexity and aggravates operation and maintenance. To sustain control over large sets of hosts and the connecting networks, monitoring solutions are employed and constantly enhanced. They collect diverse key performance indicators (KPIs) (e.g. CPU utilization, allocated memory, etc.) and provide detailed information about the system state. Storing such metrics over a period of time naturally raises the motivation of predicting future KPI progress based on past observations. Although, a variety of time series forecasting methods exist, forecasting the progress of IT system KPIs is very hard. First, KPI types like CPU utilization or allocated memory are very different and hard to be expressed by the same model. Second, system components are interconnected and constantly changing due to soft- or firmware updates and hardware modernization. Thus a frequent model retraining or fine-tuning must be expected. Therefore, we propose a lightweight solution for KPI series prediction based on historic observations. It consists of a weighted heterogeneous ensemble method composed of two models - a neural network and a mean predictor. As ensemble method a weighted summation is used, whereby a heuristic is employed to set the weights. The modelling approach is evaluated on the available FedCSIS 2020 challenge dataset and achieves an overall $R^2$ score of 0.10 on the preliminary 10% test data and 0.15 on the complete test data. We publish our code on the following github repository: https://github.com/citlab/fed_challenge