论文标题
用于学习动态系统的基于物理和数据驱动的建模
Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems
论文作者
论文摘要
当某些变量未观察到时,我们如何学习动态系统来进行预测?例如,在Covid-19,我们想预测被感染和死亡案件的数量,但我们不知道易感和暴露的人的计数。尽管力学室模型被广泛用于流行病模型,但数据驱动的模型正在促进疾病预测。我们首先将基于物理模型的学习形式化为Autoode,该模型利用自动差异来估计模型参数。通过一项关于Covid-19预测的基准研究,我们注意到基于物理学的机械模型大大优于深度学习。与最佳的深度学习竞争者相比,我们的方法的平均绝对误差为7天预测,平均绝对错误的平均绝对错误减少了57.4%。这种性能差异突出了由于分配转移而导致的动态系统学习中的概括问题。我们确定可以发生分布变化的两种情况:数据域的变化和参数域(系统动力学)的变化。通过对几个动态系统的系统实验,我们发现深度学习模型在两种情况下都无法很好地预测。尽管对分销转变的大量研究集中在数据域的变化上,但我们的工作呼吁重新考虑学习动态系统的概括。
How can we learn a dynamical system to make forecasts, when some variables are unobserved? For instance, in COVID-19, we want to forecast the number of infected and death cases but we do not know the count of susceptible and exposed people. While mechanics compartment models are widely used in epidemic modeling, data-driven models are emerging for disease forecasting. We first formalize the learning of physics-based models as AutoODE, which leverages automatic differentiation to estimate the model parameters. Through a benchmark study on COVID-19 forecasting, we notice that physics-based mechanistic models significantly outperform deep learning. Our method obtains a 57.4% reduction in mean absolute errors for 7-day ahead COVID-19 forecasting compared with the best deep learning competitor. Such performance differences highlight the generalization problem in dynamical system learning due to distribution shift. We identify two scenarios where distribution shift can occur: changes in data domain and changes in parameter domain (system dynamics). Through systematic experiments on several dynamical systems, we found that deep learning models fail to forecast well under both scenarios. While much research on distribution shift has focused on changes in the data domain, our work calls attention to rethink generalization for learning dynamical systems.