论文标题

Twitter讨论和情感有关Covid-19的大流行:一种机器学习方法

Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach

论文作者

Xue, Jia, Chen, Junxiang, Hu, Ran, Chen, Chen, Zheng, ChengDa, Liu, Xiaoqian, Zhu, Tingshao

论文摘要

该研究的目的是检查冠状病毒病(COVID-19)与Twitter用户发布的推文所产生的相关讨论,关注和情感。 We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 1 to April 21 in 2020. We use a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigram, bigrams, salient topics and themes, and sentiments in the collected Tweets.流行的摘要包括“病毒”,“锁定”和“隔离”。受欢迎的Bigrams包括“ Covid-19”,“待在家里”,“电晕病毒”,“社会疏远”和“新病例”。我们确定了13个讨论主题,并将其分为五个不同的主题,例如“公共卫生措施,减缓COVID-19的传播”,“与Covid-19”,“冠状病毒新闻案例和死亡相关的社会污名”,“美国的Covid-19美国的Covid-19”,在全世界的其余案例中。在所有确定的主题中,冠状病毒传播的主要情感是可以采取的措施,随后是对不同主题的信任,愤怒和恐惧的混合感。当公众讨论冠状病毒新病例和死亡时,公众比其他话题揭示了一种重大的恐惧感。研究表明,可以通过研究Covid-19期间不断发展的公众讨论和情感来利用Twitter数据和机器学习方法进行临时人工学研究。对Twitter讨论和关注的实时监控和评估对于公共卫生紧急响应和计划可能是有希望的。当发生第二波Covid-19或即将出现的大流行的新激增时,已经出现了大流行的恐惧,污名和心理健康问题,可能会继续影响公众的信任。

The objective of the study is to examine coronavirus disease (COVID-19) related discussions, concerns, and sentiments that emerged from tweets posted by Twitter users. We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 1 to April 21 in 2020. We use a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigram, bigrams, salient topics and themes, and sentiments in the collected Tweets. Popular unigrams include "virus," "lockdown," and "quarantine." Popular bigrams include "COVID-19," "stay home," "corona virus," "social distancing," and "new cases." We identify 13 discussion topics and categorize them into five different themes, such as "public health measures to slow the spread of COVID-19," "social stigma associated with COVID-19," "coronavirus news cases and deaths," "COVID-19 in the United States," and "coronavirus cases in the rest of the world". Across all identified topics, the dominant sentiments for the spread of coronavirus are anticipation that measures that can be taken, followed by a mixed feeling of trust, anger, and fear for different topics. The public reveals a significant feeling of fear when they discuss the coronavirus new cases and deaths than other topics. The study shows that Twitter data and machine learning approaches can be leveraged for infodemiology study by studying the evolving public discussions and sentiments during the COVID-19. Real-time monitoring and assessment of the Twitter discussion and concerns can be promising for public health emergency responses and planning. Already emerged pandemic fear, stigma, and mental health concerns may continue to influence public trust when there occurs a second wave of COVID-19 or a new surge of the imminent pandemic.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源