论文标题
具有多个内部最小值的双重双重方法
A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima
论文作者
论文摘要
双重优化发现了在现代机器学习问题中的广泛应用,例如超参数优化,神经架构搜索,元学习等。尽管具有独特的内部最小点(例如,内部功能是强烈的凸面)的二聚体问题是充分理解的,因此具有多个内部最小点的问题仍然是挑战和开放。为此问题设计的现有算法适用于限制情况,并且不能完全保证融合。在本文中,我们采用了双重优化的重新重新制定以限制优化,并通过原始的双二线优化(PDBO)算法解决了问题。 PDBO不仅解决了多个内部最小挑战,而且还具有完全一阶效率的情况,而无需涉及二阶Hessian和Jacobian计算,而不是大多数现有的基于梯度的二杆算法。我们进一步表征了PDBO的收敛速率,PDBO的收敛速率是与多个内部最小值的双光线优化的首个已知的非反应收敛保证。我们的实验证明了所提出的方法的期望性能。
Bilevel optimization has found extensive applications in modern machine learning problems such as hyperparameter optimization, neural architecture search, meta-learning, etc. While bilevel problems with a unique inner minimal point (e.g., where the inner function is strongly convex) are well understood, such a problem with multiple inner minimal points remains to be challenging and open. Existing algorithms designed for such a problem were applicable to restricted situations and do not come with a full guarantee of convergence. In this paper, we adopt a reformulation of bilevel optimization to constrained optimization, and solve the problem via a primal-dual bilevel optimization (PDBO) algorithm. PDBO not only addresses the multiple inner minima challenge, but also features fully first-order efficiency without involving second-order Hessian and Jacobian computations, as opposed to most existing gradient-based bilevel algorithms. We further characterize the convergence rate of PDBO, which serves as the first known non-asymptotic convergence guarantee for bilevel optimization with multiple inner minima. Our experiments demonstrate desired performance of the proposed approach.