论文标题

谁会留下?利用深度学习来预测公民科学家的参与

Who will stay? Using Deep Learning to predict engagement of citizen scientists

论文作者

Semenov, Alexander, Zhang, Yixin, Ponti, Marisa

论文摘要

由于气候变化所带来的威胁和填补知识空白的资源有限,应考虑公民科学和机器学习来监视沿海和海洋环境。使用瑞典海洋项目中公民科学家的注释活动的数据,我们构建了深层神经网络模型,以预测即将到来的参与。我们测试了模型以识别注释参与中的模式。根据结果​​,可以预测注释者是否会在以后的会议中保持活跃。根据个人公民科学项目的目标,也可能有必要确定那些将离开的志愿者或那些将继续注释的志愿者。这可以通过改变预测的阈值来预测。用于构建模型的参与度指标是基于时间和活动的,可用于推断志愿者的潜在特征,并根据其活动模式预测其任务兴趣。他们可以估计,志愿者是否可以在一定时间内完成给定数量的任务,尽早确定谁可能成为最高贡献者或确定谁可能退出并为他们提供有针对性的干预措施。我们预测模型的新颖性在于使用深神网络和志愿者注释的顺序。我们模型的一个限制是,他们不像许多推荐系统那样使用从用户配置文件构建的嵌入作为输入数据。我们希望包括用户配置文件将提高预测性能。

Citizen science and machine learning should be considered for monitoring the coastal and ocean environment due to the scale of threats posed by climate change and the limited resources to fill knowledge gaps. Using data from the annotation activity of citizen scientists in a Swedish marine project, we constructed Deep Neural Network models to predict forthcoming engagement. We tested the models to identify patterns in annotation engagement. Based on the results, it is possible to predict whether an annotator will remain active in future sessions. Depending on the goals of individual citizen science projects, it may also be necessary to identify either those volunteers who will leave or those who will continue annotating. This can be predicted by varying the threshold for the prediction. The engagement metrics used to construct the models are based on time and activity and can be used to infer latent characteristics of volunteers and predict their task interest based on their activity patterns. They can estimate if volunteers can accomplish a given number of tasks in a certain amount of time, identify early on who is likely to become a top contributor or identify who is likely to quit and provide them with targeted interventions. The novelty of our predictive models lies in the use of Deep Neural Networks and the sequence of volunteer annotations. A limitation of our models is that they do not use embeddings constructed from user profiles as input data, as many recommender systems do. We expect that including user profiles would improve prediction performance.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源