论文标题
通过与随机奇异值分解的连接对草图和项目的尖锐分析
Sharp Analysis of Sketch-and-Project Methods via a Connection to Randomized Singular Value Decomposition
论文作者
论文摘要
素描和项目是一个框架,它统一了许多用于求解线性系统及其变体的迭代方法,以及对非线性优化问题的进一步扩展。它包括流行的方法,例如随机kaczmarz,坐标下降,凸优化的牛顿方法的变体等。在本文中,我们开发了一个理论框架,以根据草图和项目方法的收敛速度获得敏锐的保证。我们的方法是:(1)表明收敛速率至少与草图尺寸线性提高,并且当数据矩阵表现出某些光谱衰减时甚至更快。 (2)允许稀疏的草图矩阵,比密集的草图更有效,比子采样方法更强大。特别是,我们的结果解释了一种观察到的现象,即素描矩阵的根本分物不影响素描和项目的Per迭代收敛速率。为了获得我们的结果,我们为预期的素描投影矩阵开发了新的非反应光谱范围,这些矩阵具有独立的关注;我们在迭代素描和项目求解器的收敛速率与随机奇异值分解的近似误差之间建立了连接,这是一种广泛使用的一击素描算法,用于低级别近似值。我们的实验支持该理论,并证明即使极稀疏的草图也表现出我们框架预测的收敛性。
Sketch-and-project is a framework which unifies many known iterative methods for solving linear systems and their variants, as well as further extensions to non-linear optimization problems. It includes popular methods such as randomized Kaczmarz, coordinate descent, variants of the Newton method in convex optimization, and others. In this paper, we develop a theoretical framework for obtaining sharp guarantees on the convergence rate of sketch-and-project methods. Our approach is the first to: (1) show that the convergence rate improves at least linearly with the sketch size, and even faster when the data matrix exhibits certain spectral decays; and (2) allow for sparse sketching matrices, which are more efficient than dense sketches and more robust than sub-sampling methods. In particular, our results explain an observed phenomenon that a radical sparsification of the sketching matrix does not affect the per iteration convergence rate of sketch-and-project. To obtain our results, we develop new non-asymptotic spectral bounds for the expected sketched projection matrix, which are of independent interest; and we establish a connection between the convergence rates of iterative sketch-and-project solvers and the approximation error of randomized singular value decomposition, which is a widely used one-shot sketching algorithm for low-rank approximation. Our experiments support the theory and demonstrate that even extremely sparse sketches exhibit the convergence properties predicted by our framework.