论文标题
可证明嘈杂的稀疏子空间聚类,使用贪婪的邻居选择:基于连贯的透视图
Provable Noisy Sparse Subspace Clustering using Greedy Neighbor Selection: A Coherence-Based Perspective
论文作者
论文摘要
使用基于贪婪的邻居选择(例如匹配的追踪(MP)和正交匹配追踪(OMP)),稀疏子空间聚类(SSC)被称为传统基于L1的基于L1的方法的流行计算效率替代品。在确定性有限的噪声腐败下,在本文中,我们得出了基于连贯的足够条件,可确保使用MP/OMP正确识别正确的邻居识别。我们的分析利用两个噪声数据点之间的最大/最小内部产物,受噪声水平的已知上限。获得的足够条件清楚地揭示了噪声对基于贪婪的邻居恢复的影响。具体而言,它断言,只要噪声足够小,以使所得的扰动残差向量保持接近所需的子空间,MP和OMP都可以成功返回正确的邻居子集。一个惊人的发现是,当地面真相子空间彼此良好分离并且噪声不是大的,基于MP的迭代,同时享受较低的算法复杂性时,产生了较小的残留物扰动,从而更好地识别正确的邻居,并且可以识别出较高的全球数据聚集精度。广泛的数值实验用于证实我们的理论研究。
Sparse subspace clustering (SSC) using greedy-based neighbor selection, such as matching pursuit (MP) and orthogonal matching pursuit (OMP), has been known as a popular computationally-efficient alternative to the conventional L1-minimization based methods. Under deterministic bounded noise corruption, in this paper we derive coherence-based sufficient conditions guaranteeing correct neighbor identification using MP/OMP. Our analyses exploit the maximum/minimum inner product between two noisy data points subject to a known upper bound on the noise level. The obtained sufficient condition clearly reveals the impact of noise on greedy-based neighbor recovery. Specifically, it asserts that, as long as noise is sufficiently small so that the resultant perturbed residual vectors stay close to the desired subspace, both MP and OMP succeed in returning a correct neighbor subset. A striking finding is that, when the ground truth subspaces are well-separated from each other and noise is not large, MP-based iterations, while enjoying lower algorithmic complexity, yield smaller perturbation of residuals, thereby better able to identify correct neighbors and, in turn, achieving higher global data clustering accuracy. Extensive numerical experiments are used to corroborate our theoretical study.