论文标题
通过重复测量的多个评估者的总体协议
Overall Agreement for Multiple Raters with Replicated Measurements
论文作者
论文摘要
通常需要在实践中互换使用多个评估者进行测量或评估。在参与之前,必须通过协议指数评估这些多个评估者之间的协议。尽管诸如覆盖概率和总偏差指数之类的直觉上吸引人的协议指数以及覆盖率概率曲线的相对面积已被扩展,以评估多个评估者之间的总体一致性,但这些扩展程序具有局限性。现有的总体协议指数要么需要正态性和同质性假设,要么不保留最初针对两个评估者定义的指数的直观解释。在本文中,我们根据所有评估者之间的最大成对差异提出了一组新的总体协议指数。提出的新的总体覆盖概率,总体总偏差指数和总体覆盖概率曲线下的相对面积保留了成对版本的原始直觉解释。在没有做出任何分布假设的情况下,我们还提出了一个新的统一的非参数估计和推理方法,基于可以容纳同一评估者复制的通用估计方程的总体指数。在温和的假设下,提出的方差估计器被证明可以在独立的工作相关矩阵下实现效率。进行了不同情况下的仿真研究,以评估有或没有复制的建议估计和推断方法的性能。我们通过使用三名评估者对每个受试者进行三个复制者的评估者使用血压数据来说明方法。
Multiple raters are often needed to be used interchangeably in practice for measurement or evaluation. Assessing agreement among these multiple raters via agreement indices are necessary before their participation. While the intuitively appealing agreement indices such as coverage probability and total deviation index, and relative area under coverage probability curve, have been extended for assessing overall agreement among multiple raters, these extensions have limitations. The existing overall agreement indices either require normality and homogeneity assumptions or did not preserve the intuitive interpretation of the indices originally defined for two raters. In this paper, we propose a new set of overall agreement indices based on maximum pairwise differences among all raters. The proposed new overall coverage probability, overall total deviation index and relative area under overall coverage probability curve retain the original intuitive interpretation from the pairwise version. Without making any distributional assumption, we also propose a new unified nonparametric estimation and inference approach for the overall indices based on generalized estimating equations that can accommodate replications made by the same rater. Under mild assumptions, the proposed variance estimator is shown to achieve efficiency bound under independent working correlation matrix. Simulation studies under different scenarios are conducted to assess the performance of the proposed estimation and inference approach with and without replications. We illustrate the methodology by using a blood pressure data with three raters who made three replications on each subjects.