论文标题

运行受控众包实验的挑战和策略

Challenges and strategies for running controlled crowdsourcing experiments

论文作者

Ramírez, Jorge, Baez, Marcos, Casati, Fabio, Cernuzzi, Luca, Benatallah, Boualem

论文摘要

本文报告了我们在众包平台上进行受控实验时所学到的挑战和教训。众包正在成为一种有吸引力的技术,可以参与实验研究中的多样化和大量主题,使研究人员能够达到规模和完成时间水平,否则在实验室环境中是不可行的。但是,规模和灵活性是以偏见的多个偏差来源以及众包平台的技术限制以及“野外”运行受控实验的挑战所产生的混杂因素的成本。在本文中,我们以对任务设计进行系统评估的经验为探索,描述和量化不受控制的众包实验的潜在影响并得出可能的应对策略的潜在影响。在确定的挑战中,我们可以提及抽样偏见,控制对实验条件的分配,学习效果以及众包结果的可靠性。根据我们的实证研究,潜在的偏见和混杂因素的影响可能相当于在不受控制的环境中收集的数据的效用38 \%;它可以显着改变实验的结果。这些问题最终激发了我们实施CrowdHub,该系统位于主要的众包平台之上,并使研究人员和从业人员可以运行受控的众包项目。

This paper reports on the challenges and lessons we learned while running controlled experiments in crowdsourcing platforms. Crowdsourcing is becoming an attractive technique to engage a diverse and large pool of subjects in experimental research, allowing researchers to achieve levels of scale and completion times that would otherwise not be feasible in lab settings. However, the scale and flexibility comes at the cost of multiple and sometimes unknown sources of bias and confounding factors that arise from technical limitations of crowdsourcing platforms and from the challenges of running controlled experiments in the "wild". In this paper, we take our experience in running systematic evaluations of task design as a motivating example to explore, describe, and quantify the potential impact of running uncontrolled crowdsourcing experiments and derive possible coping strategies. Among the challenges identified, we can mention sampling bias, controlling the assignment of subjects to experimental conditions, learning effects, and reliability of crowdsourcing results. According to our empirical studies, the impact of potential biases and confounding factors can amount to a 38\% loss in the utility of the data collected in uncontrolled settings; and it can significantly change the outcome of experiments. These issues ultimately inspired us to implement CrowdHub, a system that sits on top of major crowdsourcing platforms and allows researchers and practitioners to run controlled crowdsourcing projects.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源