论文标题

统一:统一的政策设计框架,用于解决机器学习的约束优化问题

UNIFY: a Unified Policy Designing Framework for Solving Constrained Optimization Problems with Machine Learning

论文作者

Silvestri, Mattia, De Filippo, Allegra, Lombardi, Michele, Milano, Michela

论文摘要

机器学习(ML)和受约束优化(CO)之间的相互作用最近成为兴趣日益增加的主题,导致了新的多产的研究领域涵盖(例如)以决策为中心的学习和约束强化学习。这种方法努力解决多个阶段的不确定性下的复杂决策问题,涉及显式(成本函数,约束)和隐性知识(来自数据),并可能受到执行时间限制。尽管已经取得了良好的成功,但现有方法在适用性和有效性方面仍然存在局限性。对于此类问题,我们提出了Unify,这是为复杂决策问题设计解决方案政策的统一框架。我们的方法依赖于两个阶段的巧妙分解,即无约束的ML模型和CO问题,以利用每种方法的强度,同时弥补其弱点。通过少量设计工作,Unify可以概括几种现有方法,从而扩展其适用性。我们证明了两个实际问题的方法有效性,即能量管理系统和具有随机覆盖要求的设置多覆盖。最后,我们重点介绍了我们方法和未来研究方向的一些当前挑战,这些挑战可以受益于这两个领域的交叉利用。

The interplay between Machine Learning (ML) and Constrained Optimization (CO) has recently been the subject of increasing interest, leading to a new and prolific research area covering (e.g.) Decision Focused Learning and Constrained Reinforcement Learning. Such approaches strive to tackle complex decision problems under uncertainty over multiple stages, involving both explicit (cost function, constraints) and implicit knowledge (from data), and possibly subject to execution time restrictions. While a good degree of success has been achieved, the existing methods still have limitations in terms of both applicability and effectiveness. For problems in this class, we propose UNIFY, a unified framework to design a solution policy for complex decision-making problems. Our approach relies on a clever decomposition of the policy in two stages, namely an unconstrained ML model and a CO problem, to take advantage of the strength of each approach while compensating for its weaknesses. With a little design effort, UNIFY can generalize several existing approaches, thus extending their applicability. We demonstrate the method effectiveness on two practical problems, namely an Energy Management System and the Set Multi-cover with stochastic coverage requirements. Finally, we highlight some current challenges of our method and future research directions that can benefit from the cross-fertilization of the two fields.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源