论文标题
分配目标的治疗建议
Treatment recommendation with distributional targets
论文作者
论文摘要
我们研究了一个决策者的问题,他必须根据实验提供最佳的治疗建议。通过策略建议产生的结果分布的可取性是通过功能性捕获决策者有兴趣优化的分配特征来衡量的。例如,这可能是其固有的不平等,福利,贫困程度或与所需结果分布的距离。如果感兴趣的功能不是准凸的,或者存在约束,则最佳建议可能是治疗的混合物。这大大扩展了必须考虑的一组建议。我们通过获得最大的预期后悔下限来表征问题的困难。此外,我们提出了两个(近)遗憾的政策。第一个策略是静态的,因此适用于在实验阶段依次到达的受试者。第二个政策可以利用受试者连续消除劣等处理,从而在最需要的情况下花费抽样工作来依次到达。
We study the problem of a decision maker who must provide the best possible treatment recommendation based on an experiment. The desirability of the outcome distribution resulting from the policy recommendation is measured through a functional capturing the distributional characteristic that the decision maker is interested in optimizing. This could be, e.g., its inherent inequality, welfare, level of poverty or its distance to a desired outcome distribution. If the functional of interest is not quasi-convex or if there are constraints, the optimal recommendation may be a mixture of treatments. This vastly expands the set of recommendations that must be considered. We characterize the difficulty of the problem by obtaining maximal expected regret lower bounds. Furthermore, we propose two (near) regret-optimal policies. The first policy is static and thus applicable irrespectively of subjects arriving sequentially or not in the course of the experimentation phase. The second policy can utilize that subjects arrive sequentially by successively eliminating inferior treatments and thus spends the sampling effort where it is most needed.