论文标题
PDE的各向异性,稀疏和可解释的物理信息神经网络
Anisotropic, Sparse and Interpretable Physics-Informed Neural Networks for PDEs
论文作者
论文摘要
对使用深神经网络(DNN)来求解部分微分方程(PDES)的兴趣越来越大。尽管承诺这种方法成立,但在各个方面都可以改善它们。这样的两个缺点是(i)相对于经典数值方法,它们的计算效率低下,以及(ii)训练有素的DNN模型的非解剖性。在这项工作中,我们介绍了Aspinn,这是我们早期工作的各向异性扩展,称为Spinn-Sparse,物理信息和可解释的神经网络 - 解决解决这两个问题的PDE。 AspinnS概括了径向基函数网络。我们证明,使用涉及椭圆形和双曲线PDE的各种示例表明,我们提出的特殊体系结构比通用DNN更有效,而同时也可以直接解释。此外,由于每个节点的局部影响区域的各向异性,使用Aspinn比使用Spinn捕获溶液所需的较少的节点需要更少的节点。阿斯平的可解释性转化为对其重量和偏见的现成可视化,从而对训练有素的模型的性质有了更多的了解。反过来,这提供了一个系统的过程,可以根据计算解决方案的质量改进体系结构。因此,Aspinns是经典数值算法和基于现代DNN的求解PDE之间的有效桥梁。在此过程中,我们还将Aspinns的培训简化为更接近监督学习算法的形式。
There has been a growing interest in the use of Deep Neural Networks (DNNs) to solve Partial Differential Equations (PDEs). Despite the promise that such approaches hold, there are various aspects where they could be improved. Two such shortcomings are (i) their computational inefficiency relative to classical numerical methods, and (ii) the non-interpretability of a trained DNN model. In this work we present ASPINN, an anisotropic extension of our earlier work called SPINN--Sparse, Physics-informed, and Interpretable Neural Networks--to solve PDEs that addresses both these issues. ASPINNs generalize radial basis function networks. We demonstrate using a variety of examples involving elliptic and hyperbolic PDEs that the special architecture we propose is more efficient than generic DNNs, while at the same time being directly interpretable. Further, they improve upon the SPINN models we proposed earlier in that fewer nodes are require to capture the solution using ASPINN than using SPINN, thanks to the anisotropy of the local zones of influence of each node. The interpretability of ASPINN translates to a ready visualization of their weights and biases, thereby yielding more insight into the nature of the trained model. This in turn provides a systematic procedure to improve the architecture based on the quality of the computed solution. ASPINNs thus serve as an effective bridge between classical numerical algorithms and modern DNN based methods to solve PDEs. In the process, we also streamline the training of ASPINNs into a form that is closer to that of supervised learning algorithms.