论文标题
使用生成预测来迈向更好的远程时间序列预测
Towards Better Long-range Time Series Forecasting using Generative Forecasting
论文作者
论文摘要
远程时间序列的预测通常基于两种现有的预测策略之一:直接预测和迭代预测,前者提供较低的偏见,高方差预测,后者导致差异较低,高偏见预测。在本文中,我们提出了一种称为生成预测(GENF)的新预测策略,该策略在接下来的几个时间步骤中生成合成数据,然后根据生成和观察到的数据进行远程预测。从理论上讲,我们证明GENF能够更好地平衡预测差异和偏见,从而导致预测误差要小得多。我们通过三个组件实现GENF:(i)基于合成时间序列数据生成的基于wasserstein生成的对抗网络(GAN)的发电机,称为CWGAN-TS。 (ii)一个基于变压器的预测变量,该预测变量使用生成和观察到的数据进行远程预测。 (iii)一种信息理论聚类算法,以改善CWGAN-TS和基于变压器预测指标的训练。五个公共数据集的实验结果表明,GENF的表现明显优于各种各样的最先进的基准和经典方法。具体而言,与基准相比,我们发现预测性能提高了5%-11%,而参数降低了15%-50%。最后,我们进行了一项消融研究,以进一步探索和证明包含GENF的组件的有效性。
Long-range time series forecasting is usually based on one of two existing forecasting strategies: Direct Forecasting and Iterative Forecasting, where the former provides low bias, high variance forecasts and the latter leads to low variance, high bias forecasts. In this paper, we propose a new forecasting strategy called Generative Forecasting (GenF), which generates synthetic data for the next few time steps and then makes long-range forecasts based on generated and observed data. We theoretically prove that GenF is able to better balance the forecasting variance and bias, leading to a much smaller forecasting error. We implement GenF via three components: (i) a novel conditional Wasserstein Generative Adversarial Network (GAN) based generator for synthetic time series data generation, called CWGAN-TS. (ii) a transformer based predictor, which makes long-range predictions using both generated and observed data. (iii) an information theoretic clustering algorithm to improve the training of both the CWGAN-TS and the transformer based predictor. The experimental results on five public datasets demonstrate that GenF significantly outperforms a diverse range of state-of-the-art benchmarks and classical approaches. Specifically, we find a 5% - 11% improvement in predictive performance (mean absolute error) while having a 15% - 50% reduction in parameters compared to the benchmarks. Lastly, we conduct an ablation study to further explore and demonstrate the effectiveness of the components comprising GenF.