论文标题
Lorentz Group Equivariant自动编码器
Lorentz group equivariant autoencoders
论文作者
论文摘要
最近在开发机器学习(ML)模型(HEP)中进行了针对分类,仿真和异常检测等任务的重要工作。通常,这些模型是根据计算机视觉或自然语言处理中为数据集设计的,这些模型缺乏适合HEP数据的归纳偏见,例如对其固有的对称性的等效性。这些偏见已被证明可以使模型更具性能和可解释,并减少所需的培训数据量。为此,我们开发了Lorentz Group AutoCoder(LGAE),这是一种自动编码器模型,就适当的正直的Lorentz Group $ \ mathrm {so}^+(3,1)$而言,其潜在空间生活在该组的代表中。我们在LHC的喷气机上介绍了架构和几个实验结果,并在几种压缩,重建和异常检测指标上发现它优于图形和卷积神经网络基线模型。我们还展示了这种模型模型在分析自动编码器的潜在空间方面的优势,该模型可以改善此类ML模型发现的潜在异常的解释性。
There has been significant work recently in developing machine learning (ML) models in high energy physics (HEP) for tasks such as classification, simulation, and anomaly detection. Often these models are adapted from those designed for datasets in computer vision or natural language processing, which lack inductive biases suited to HEP data, such as equivariance to its inherent symmetries. Such biases have been shown to make models more performant and interpretable, and reduce the amount of training data needed. To that end, we develop the Lorentz group autoencoder (LGAE), an autoencoder model equivariant with respect to the proper, orthochronous Lorentz group $\mathrm{SO}^+(3,1)$, with a latent space living in the representations of the group. We present our architecture and several experimental results on jets at the LHC and find it outperforms graph and convolutional neural network baseline models on several compression, reconstruction, and anomaly detection metrics. We also demonstrate the advantage of such an equivariant model in analyzing the latent space of the autoencoder, which can improve the explainability of potential anomalies discovered by such ML models.