论文标题
关于自然语言处理深层模型的解释性
On the Explainability of Natural Language Processing Deep Models
论文作者
论文摘要
尽管最近在深层模型上进行了关于图像和表格数据的深层模型的爆炸爆炸,但文本数据集对Exai社区提出了新的挑战。此类挑战可以归因于文本数据中缺乏输入结构,使用嵌入的单词嵌入,从而增加了模型的不透明度以及在对文本数据进行培训时,深层模型内部工作的难度。 最近,已经开发了解决上述挑战并提出关于自然语言处理(NLP)模型的令人满意的解释的方法。但是,这种方法尚未在一个综合框架中进行研究,在一个综合框架中,提出了一个共同的挑战,并提出了严格的评估实践和指标。我们在NLP领域的EXAI方法民主化的动机中提出了一项调查,该调查研究了NLP模型上的模型不合时宜的以及特定于模型的解释性方法。这种方法可以开发可固有的解释NLP模型,也可以以事后方式在前训练的模型上运行。我们根据它们解释的内容进一步将方法分解为三类:(1)单词嵌入(输入级),(2)NLP模型(处理级别)和(3)模型的决策(输出级)的内部工作。我们还详细介绍了NLP字段中的不同评估方法。最后,我们在附录中介绍了众所周知的神经机器翻译的案例研究,并在NLP领域提出了有希望的Exai的未来研究方向。
While there has been a recent explosion of work on ExplainableAI ExAI on deep models that operate on imagery and tabular data, textual datasets present new challenges to the ExAI community. Such challenges can be attributed to the lack of input structure in textual data, the use of word embeddings that add to the opacity of the models and the difficulty of the visualization of the inner workings of deep models when they are trained on textual data. Lately, methods have been developed to address the aforementioned challenges and present satisfactory explanations on Natural Language Processing (NLP) models. However, such methods are yet to be studied in a comprehensive framework where common challenges are properly stated and rigorous evaluation practices and metrics are proposed. Motivated to democratize ExAI methods in the NLP field, we present in this work a survey that studies model-agnostic as well as model-specific explainability methods on NLP models. Such methods can either develop inherently interpretable NLP models or operate on pre-trained models in a post-hoc manner. We make this distinction and we further decompose the methods into three categories according to what they explain: (1) word embeddings (input-level), (2) inner workings of NLP models (processing-level) and (3) models' decisions (output-level). We also detail the different evaluation approaches interpretability methods in the NLP field. Finally, we present a case-study on the well-known neural machine translation in an appendix and we propose promising future research directions for ExAI in the NLP field.