论文标题
你不拒绝p值
Thou Shalt Not Reject the P-value
论文作者
论文摘要
自18世纪首次亮相以来,P值一直是基于假设检验的科学发现的重要组成部分。随着统计引擎的加速,问题开始提出,询问基于p值的科学发现在多大程度上是可靠的,并且越来越多地听到了调整显着性水平或禁止P值的声音。受这些问题和讨论的启发,我们在这里询问了p值在科学研究中的有用作用和滥用。对于常见的滥用和误解,我们为从业者提供了适度的建议。此外,我们将统计显着性与临床相关性进行了比较。同时,我们回顾了寻求证据的贝叶斯替代方案。最后,我们讨论了使用荟萃分析从多个研究到总证据的池价值的承诺和风险。综上所述,P值支持一个有用的概率决策系统,并以连续规模提供证据。但是,考虑到科学问题,实验设计(包括模型规范,样本量和显着性水平),统计能力,效果大小和可重复性,其解释必须是上下文。
Since its debut in the 18th century, the P-value has been an important part of hypothesis testing-based scientific discoveries. As the statistical engine accelerates, questions are beginning to be raised, asking to what extent scientific discoveries based on P-values are reliable and reproducible, and the voice calling for adjusting the significance level or banning the P-value has been increasingly heard. Inspired by these questions and discussions, here we enquire into the useful roles and misuses of the P-value in scientific studies. For common misuses and misinterpretations, we provide modest recommendations for practitioners. Additionally, we compare statistical significance with clinical relevance. In parallel, we review the Bayesian alternatives for seeking evidence. Finally, we discuss the promises and risks of using meta-analysis to pool P-values from multiple studies to aggregate evidence. Taken together, the P-value underpins a useful probabilistic decision-making system and provides evidence at a continuous scale. But its interpretation must be contextual, considering the scientific question, experimental design (including the model specification, sample size, and significance level), statistical power, effect size, and reproducibility.