论文标题
最佳响应动力学的效率
The Efficiency of Best-Response Dynamics
论文作者
论文摘要
最佳响应(BR)动力学是一种自然方法,玩家通过本地搜索方法朝着纯粹的NASH平衡前进。达到的平衡质量可能在很大程度上取决于选择玩家以执行最佳响应动作的顺序。 {\ em veatiator规则} $ s $是选择下一个偏差播放器的方法。我们提供了量化不同偏差规则的性能的措施。在所有初始配置文件$ p $中,{\ em效率低下}是最大比率,与$ p $的最差平衡的社会成本与$ p $的社会成本达到$ s $之间,从$ p $达到最佳平衡的社会成本。这种低效率始终位于$ 1 $和无政府状态的{\ em价格}之间。 我们研究了网络编队游戏和工作调度游戏中各种偏差规则的效率低下(两者都是拥堵游戏,BR Dynamics总是会收敛到纯NE)。对于某些类型的游戏,我们计算最佳偏差规则。此外,我们定义并研究了一类新的偏离规则,称为{\ em local}偏离者规则。这样的规则选择下一个偏离者作为一组限制参数的函数,并满足一种称为{\ em独立性无关玩家的独立性的自然条件}。我们介绍了一些本地偏离者规则的效率低下的上限,还表明,对于某些类别的游戏,没有本地偏离者规则可以保证低效率低于无政府状态的价格。
Best response (BR) dynamics is a natural method by which players proceed toward a pure Nash equilibrium via a local search method. The quality of the equilibrium reached may depend heavily on the order by which players are chosen to perform their best response moves. A {\em deviator rule} $S$ is a method for selecting the next deviating player. We provide a measure for quantifying the performance of different deviator rules. The {\em inefficiency} of a deviator rule $S$ is the maximum ratio, over all initial profiles $p$, between the social cost of the worst equilibrium reachable by $S$ from $p$ and the social cost of the best equilibrium reachable from $p$. This inefficiency always lies between $1$ and the {\em price of anarchy}. We study the inefficiency of various deviator rules in network formation games and job scheduling games (both are congestion games, where BR dynamics always converges to a pure NE). For some classes of games, we compute optimal deviator rules. Furthermore, we define and study a new class of deviator rules, called {\em local} deviator rules. Such rules choose the next deviator as a function of a restricted set of parameters, and satisfy a natural condition called {\em independence of irrelevant players}. We present upper bounds on the inefficiency of some local deviator rules, and also show that for some classes of games, no local deviator rule can guarantee inefficiency lower than the price of anarchy.