论文标题
关于持续同源性的有效性
On the effectiveness of persistent homology
论文作者
论文摘要
持续的同源性(PH)是拓扑数据分析中最流行的方法之一。即使在许多不同类型的应用中都使用了pH,但其成功背后的原因仍然难以捉摸。特别是,尚不知道哪种问题最有效,或者在多大程度上可以检测几何或拓扑特征。这项工作的目的是确定与数据分析相比,pH性能良好甚至更好的某些类型的问题。我们考虑三个基本形状分析任务:从形状采样的2D和3D点云中检测孔的数量,曲率和凸度。实验表明,pH在这些任务中取得了成功,超过了几个基线,包括PointNet,这是一个精确地受到点云的属性启发的体系结构。此外,我们观察到pH对于有限的计算资源和有限的培训数据以及分布外测试数据,包括各种数据转换和噪声,仍然有效。为了进行凸度检测,我们提供了理论上的保证,即pH在$ \ mathbb {r}^d $中对此任务有效,并演示了在植物叶图像的Flavia数据集上检测凸度度量的。由于形状分类在理解数学和物理结构和对象中的关键作用,在许多应用中,这项工作的发现将提供一些有关适合pH的问题类型的知识,以便它可以 - 可以 - 借用Wigner 1960-``从Wigner 1960-``在未来的研究中都保持有效,并扩展到我们的愉悦,但要对我们的愉悦进行愉悦”,但对我们较少的应用程序进行了多样的应用程序。
Persistent homology (PH) is one of the most popular methods in Topological Data Analysis. Even though PH has been used in many different types of applications, the reasons behind its success remain elusive; in particular, it is not known for which classes of problems it is most effective, or to what extent it can detect geometric or topological features. The goal of this work is to identify some types of problems where PH performs well or even better than other methods in data analysis. We consider three fundamental shape analysis tasks: the detection of the number of holes, curvature and convexity from 2D and 3D point clouds sampled from shapes. Experiments demonstrate that PH is successful in these tasks, outperforming several baselines, including PointNet, an architecture inspired precisely by the properties of point clouds. In addition, we observe that PH remains effective for limited computational resources and limited training data, as well as out-of-distribution test data, including various data transformations and noise. For convexity detection, we provide a theoretical guarantee that PH is effective for this task in $\mathbb{R}^d$, and demonstrate the detection of a convexity measure on the FLAVIA data set of plant leaf images. Due to the crucial role of shape classification in understanding mathematical and physical structures and objects, and in many applications, the findings of this work will provide some knowledge about the types of problems that are appropriate for PH, so that it can - to borrow the words from Wigner 1960 - ``remain valid in future research, and extend, to our pleasure", but to our lesser bafflement, to a variety of applications.