论文标题
一个跨语性的自然语言处理框架,用于流行病管理
A Cross-lingual Natural Language Processing Framework for Infodemic Management
论文作者
论文摘要
COVID-19大流行对卫生系统施加了巨大的压力,卫生系统由于周围的错误信息而进一步紧张。在这种情况下,在正确的时间提供正确的信息至关重要。使用人工智能对信息传播的管理的需求不断增长。因此,我们利用了自然语言处理的潜力来识别需要在群众之间传播的相关信息。在这项工作中,我们提出了一个新颖的跨语性自然语言处理框架,以通过将每日新闻与世界卫生组织的信任准则相匹配,以提供相关信息。拟议的管道部署了NLP的各种技术,例如汇总者,单词嵌入和相似性指标,以向用户提供新闻文章以及相应的医疗保健指南。总共评估了36个模型,并将基于Lexrank的Summarizer与Word2Vec嵌入Word Mover -Mover距离度量标准的组合胜过所有其他模型。这种新型的开源方法可以用作与流行病相关的错误信息传播差异中主动传播相关医疗信息的模板。
The COVID-19 pandemic has put immense pressure on health systems which are further strained due to the misinformation surrounding it. Under such a situation, providing the right information at the right time is crucial. There is a growing demand for the management of information spread using Artificial Intelligence. Hence, we have exploited the potential of Natural Language Processing for identifying relevant information that needs to be disseminated amongst the masses. In this work, we present a novel Cross-lingual Natural Language Processing framework to provide relevant information by matching daily news with trusted guidelines from the World Health Organization. The proposed pipeline deploys various techniques of NLP such as summarizers, word embeddings, and similarity metrics to provide users with news articles along with a corresponding healthcare guideline. A total of 36 models were evaluated and a combination of LexRank based summarizer on Word2Vec embedding with Word Mover distance metric outperformed all other models. This novel open-source approach can be used as a template for proactive dissemination of relevant healthcare information in the midst of misinformation spread associated with epidemics.