论文标题
市场冲击下的平台行为:基于模拟框架和基于加强学习的研究
Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study
论文作者
论文摘要
我们研究经济平台(例如,亚马逊,Uber Eats,Instacart)的行为,例如COVID-19锁定,以及对平台上施加的不同法规考虑的影响。为此,我们在动态的多周期环境中开发了平台经济的多代理健身环境,并可能发生经济冲击。买卖双方被建模为经济动机的代理商,选择是否支付相应的费用来使用该平台。我们将平台的问题作为部分可观察到的马尔可夫决策过程,并使用深度强化学习来建模其费用设定和匹配行为。我们考虑两种主要类型的监管框架:(1)税收政策和(2)平台费用限制,并提供广泛的模拟实验,以在最佳平台响应下表征监管权衡。我们的结果表明,尽管许多干预措施与精致的平台演员无效,但我们确定了一种特殊的法规 - 将费用确定为最佳的,最佳的预投入费用,同时仍允许平台选择如何匹配买方对卖方的需求,以促进效率,卖方,卖方的多样性和整体经济体系的弹性。
We study the behavior of an economic platform (e.g., Amazon, Uber Eats, Instacart) under shocks, such as COVID-19 lockdowns, and the effect of different regulation considerations imposed on a platform. To this end, we develop a multi-agent Gym environment of a platform economy in a dynamic, multi-period setting, with the possible occurrence of economic shocks. Buyers and sellers are modeled as economically-motivated agents, choosing whether or not to pay corresponding fees to use the platform. We formulate the platform's problem as a partially observable Markov decision process, and use deep reinforcement learning to model its fee setting and matching behavior. We consider two major types of regulation frameworks: (1) taxation policies and (2) platform fee restrictions, and offer extensive simulated experiments to characterize regulatory tradeoffs under optimal platform responses. Our results show that while many interventions are ineffective with a sophisticated platform actor, we identify a particular kind of regulation -- fixing fees to optimal, pre-shock fees while still allowing a platform to choose how to match buyer demands to sellers -- as promoting the efficiency, seller diversity, and resilience of the overall economic system.