论文标题
什么是随机测试?
What is a randomization test?
论文作者
论文摘要
上个世纪,随机测试的含义在统计教育和实践中变得晦涩难懂。本文详细尝试纠正这种统计的核心概念。引入了一个新的术语“准随机化测试”,以根据理论模型来定义显着性测试,并根据随机的物理行为将这些测试与“随机测试”区分开。这种区别的实际重要性是通过一个真正的跨跨叶之间的群集随机试验来说明的。在最新的随机推断文献的基础上,开发了条件随机测试的一般框架,并给出了一些实用的构建条件事件的方法。然后,应用了拟议的术语和框架,以了解几种广泛使用的(准)随机测试,包括Fisher的精确测试,治疗效果的排列测试,准随机测试的独立性和有条件的独立性,适应性随机化和合成预测。
The meaning of randomization tests has become obscure in statistics education and practice over the last century. This article makes a fresh attempt at rectifying this core concept of statistics. A new term -- "quasi-randomization test" -- is introduced to define significance tests based on theoretical models and distinguish these tests from the "randomization tests" based on the physical act of randomization. The practical importance of this distinction is illustrated through a real stepped-wedge cluster-randomized trial. Building on the recent literature of randomization inference, a general framework of conditional randomization tests is developed and some practical methods to construct conditioning events are given. The proposed terminology and framework are then applied to understand several widely used (quasi-)randomization tests, including Fisher's exact test, permutation tests for treatment effect, quasi-randomization tests for independence and conditional independence, adaptive randomization, and conformal prediction.