论文标题
物理学指导的机器学习预测:朝着可解释性和概括性
Physics-guided machine learning for wind-farm power prediction: Toward interpretability and generalizability
论文作者
论文摘要
随着来自模拟和实验的可用数据量增加,针对数据驱动模型的风向功率预测的研究已大大增加。尽管数据驱动的模型可以成功预测具有与训练集合的风电场的功能,但与基于物理学的模型相比,它们通常没有高度的外推以未见病例的灵活性。在本文中,我们专注于具有改进的可解释性和可推广性水平的数据驱动模型,这些模型可以预测风电场中涡轮机的性能。为了准备数据集,根据操作风电场的布局定义了几种情况,并进行了大量的计算流体动力学模拟。后来使用了极端梯度提升算法来构建模型,该模型具有涡轮级的几何输入,并结合了基于物理模型的效率作为特征。在训练之后,为了分析模型的概括能力,将它们对具有不同操作条件,流入湍流水平和风网布线的看不见病例的预测与公园模型和经验分析高斯唤醒模型进行了比较。结果表明,物理引导的机器学习模型的表现优于两个基于物理的模型,显示出高度的概括性,并且该机器对基于物理学的指南模型的选择不敏感。
With the increasing amount of available data from simulations and experiments, research for the development of data-driven models for wind-farm power prediction has increased significantly. While the data-driven models can successfully predict the power of a wind farm with similar characteristics as those in the training ensemble, they generally do not have a high degree of flexibility for extrapolation to unseen cases in contrast to the physics-based models. In this paper, we focus on data-driven models with improved interpretability and generalizability levels that can predict the performance of turbines in wind farms. To prepare the datasets, several cases are defined based on the layouts of operational wind farms, and massive computational fluid dynamics simulations are performed. The extreme gradient boosting algorithm is used afterward to build models, which have turbine-level geometric inputs in combination with the efficiency from physics-based models as the features. After training, to analyze the models' capability in generalization, their predictions for the unseen cases with different operating conditions, inflow turbulence levels, and wind-farm layouts are compared to the Park model and an empirical-analytical Gaussian wake model. Results show that the physics-guided machine-learning models outperform both physics-based models showing a high degree of generalizability, and the machine is not sensitive to the choice of the physics-based guide model.