论文标题

比较软件工程复制的结果

Comparing the Results of Replications in Software Engineering

论文作者

Santos, Adrian, Vegas, Sira, Oivo, Markku, Juristo, Natalia

论文摘要

上下文:有人认为,软件工程复制可用于验证先前实验的结果。但是,尚未商定如何检查结果是否跨复制。此外,一些作者认为,不验证先前实验结果的复制可以用于识别导致差异的上下文变量。目的:研究如何比较SE复制结果的(DIS)相似性,以验证先前实验的结果,并了解如何确定上下文变量是否影响结果。方法:我们运行模拟以了解在验证先前实验结果时如何比较复制结果的不同方法。我们说明了如何处理上下文引起的变化。为此,我们分析了我们对测试驱动开发和测试技术的研究中的三组复制。结果:P值和效应大小的直接比较似乎不适合验证先前实验的结果并检查可能影响软件工程结果的变量。分析方法(例如荟萃分析)应用于评估软件工程复制结​​果的相似性并确定结果中的差异。结论:在基线实验中获得的结果不应再被视为需要复制的结果,而是在较大的图片中的一小部分证据,仅在组装许多小块以完成难题之后才出现。

Context: It has been argued that software engineering replications are useful for verifying the results of previous experiments. However, it has not yet been agreed how to check whether the results hold across replications. Besides, some authors suggest that replications that do not verify the results of previous experiments can be used to identify contextual variables causing the discrepancies. Objective: Study how to assess the (dis)similarity of the results of SE replications when they are compared to verify the results of previous experiments and understand how to identify whether contextual variables are influencing results. Method: We run simulations to learn how different ways of comparing replication results behave when verifying the results of previous experiments. We illustrate how to deal with context-induced changes. To do this, we analyze three groups of replications from our own research on test-driven development and testing techniques. Results: The direct comparison of p-values and effect sizes does not appear to be suitable for verifying the results of previous experiments and examining the variables possibly affecting the results in software engineering. Analytical methods such as meta-analysis should be used to assess the similarity of software engineering replication results and identify discrepancies in results. Conclusion: The results achieved in baseline experiments should no longer be regarded as a result that needs to be reproduced, but as a small piece of evidence within a larger picture that only emerges after assembling many small pieces to complete the puzzle.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源