论文标题
马尔可夫和非马克维亚流程具有积极的决策策略来解决COVID-19
Markovian And Non-Markovian Processes with Active Decision Making Strategies For Addressing The COVID-19 Pandemic
论文作者
论文摘要
我们从5月1日至8月31日的美国六个州研究和预测COVID-19的演变,并在部分观察到的马尔可夫决策过程的广泛框架内,根据最大程度地减少损失函数的基础,规定了主动干预策略,例如锁定策略。对于每个州,分析了190天的COVID-19数据(从5月1日起两个北部各州和6月1日的南部州),以估计隔室与与流行病进化相关的其他参数之间的过渡概率。然后,这些数量用于预测未来50天(测试期)在各种政策分配下的流行病的过程,从而导致损失函数在训练范围内的不同值。最佳政策分配是对应最小损失的政策。我们的分析表明,在测试期间,六个州都不需要锁定,尽管不需要谨慎解释没有锁定处方:当然需要继续进行负责任的掩盖使用和社交距离。详细讨论了这类流行病传播所涉及的警告。非马克维亚共同传播(以及更一般的流行传播)的非马克维亚表格的草图被视为该领域未来研究的有吸引力的途径。
We study and predict the evolution of Covid-19 in six US states from the period May 1 through August 31 using a discrete compartment-based model and prescribe active intervention policies, like lockdowns, on the basis of minimizing a loss function, within the broad framework of partially observed Markov decision processes. For each state, Covid-19 data for 40 days (starting from May 1 for two northern states and June 1 for four southern states) are analyzed to estimate the transition probabilities between compartments and other parameters associated with the evolution of the epidemic. These quantities are then used to predict the course of the epidemic in the given state for the next 50 days (test period) under various policy allocations, leading to different values of the loss function over the training horizon. The optimal policy allocation is the one corresponding to the smallest loss. Our analysis shows that none of the six states need lockdowns over the test period, though the no lockdown prescription is to be interpreted with caution: responsible mask use and social distancing of course need to be continued. The caveats involved in modeling epidemic propagation of this sort are discussed at length. A sketch of a non-Markovian formulation of Covid-19 propagation (and more general epidemic propagation) is presented as an attractive avenue for future research in this area.