论文标题
正规化剂估计量的随机优化
Stochastic Optimization for Regularized Wasserstein Estimators
论文作者
论文摘要
最佳传输是优化的基本问题,可以在考虑几何方面时比较概率分布。它的最佳客观价值,即Wasserstein距离,在整个机器学习和统计数据的许多应用中都使用了分布之间的重要损失。在此问题及其正则版本上的最新算法进度使这些工具越来越受欢迎。但是,现有技术需要解决优化问题以获得损失的单个梯度,从而减慢了一阶方法以最大程度地减少需要许多此类梯度计算的损失之和。在这项工作中,我们介绍了一种算法来解决Wasserstein估计器的正规化版本,每个步骤的时间是sublinear在问题的自然维度中。我们引入了双重公式,并通过随机梯度步骤对其进行优化,可以直接从样品中计算出,而无需在每个步骤上解决其他优化问题。这样做,共同执行估计和计算任务。我们表明,该算法可以扩展到其他任务,包括对Wasserstein Barycenters的估计。我们提供理论保证并通过合成数据实验说明算法的性能。
Optimal transport is a foundational problem in optimization, that allows to compare probability distributions while taking into account geometric aspects. Its optimal objective value, the Wasserstein distance, provides an important loss between distributions that has been used in many applications throughout machine learning and statistics. Recent algorithmic progress on this problem and its regularized versions have made these tools increasingly popular. However, existing techniques require solving an optimization problem to obtain a single gradient of the loss, thus slowing down first-order methods to minimize the sum of losses, that require many such gradient computations. In this work, we introduce an algorithm to solve a regularized version of this problem of Wasserstein estimators, with a time per step which is sublinear in the natural dimensions of the problem. We introduce a dual formulation, and optimize it with stochastic gradient steps that can be computed directly from samples, without solving additional optimization problems at each step. Doing so, the estimation and computation tasks are performed jointly. We show that this algorithm can be extended to other tasks, including estimation of Wasserstein barycenters. We provide theoretical guarantees and illustrate the performance of our algorithm with experiments on synthetic data.