论文标题
超越XAI:一项系统的研究指导学习
Going Beyond XAI: A Systematic Survey for Explanation-Guided Learning
论文作者
论文摘要
随着深度神经网络(DNN)的社会影响的增长,推进DNN的目标变得更加复杂和多样化,从提高常规模型准确度指标到注入先进的人类美德,例如公平,问责制,透明度(FACCT)和无偏见。最近,可解释的人工智能(XAI)中的技术引起了人们的关注,并极大地帮助了机器学习(ML)工程师理解AI模型。但是,与此同时,我们开始目睹AI社区中XAI以外的新兴需求。根据从XAI学到的见解,我们如何能够更好地授权ML工程师指导他们的DNN,从而可以根据预期提高模型的合理性和性能?本文提供了及时,广泛的文献概述,概述了领域解释引导学习(EGL),这是一个技术领域,通过在模型解释中添加正则化,监督或干预来引导DNNS的推理过程。在此过程中,我们首先提供了EGL及其一般学习范式的正式定义。其次,提供了EGL评估的关键因素,以及对EGL的现有评估程序和指标的摘要和分类。最后,讨论了当前和潜在的未来应用领域以及EGL的方向,并介绍了广泛的实验研究,旨在在各种流行的应用领域(例如计算机视觉(CV)和自然语言处理(NLP)领域)中现有EGL模型之间提供全面的比较研究。
As the societal impact of Deep Neural Networks (DNNs) grows, the goals for advancing DNNs become more complex and diverse, ranging from improving a conventional model accuracy metric to infusing advanced human virtues such as fairness, accountability, transparency (FaccT), and unbiasedness. Recently, techniques in Explainable Artificial Intelligence (XAI) are attracting considerable attention, and have tremendously helped Machine Learning (ML) engineers in understanding AI models. However, at the same time, we started to witness the emerging need beyond XAI among AI communities; based on the insights learned from XAI, how can we better empower ML engineers in steering their DNNs so that the model's reasonableness and performance can be improved as intended? This article provides a timely and extensive literature overview of the field Explanation-Guided Learning (EGL), a domain of techniques that steer the DNNs' reasoning process by adding regularization, supervision, or intervention on model explanations. In doing so, we first provide a formal definition of EGL and its general learning paradigm. Secondly, an overview of the key factors for EGL evaluation, as well as summarization and categorization of existing evaluation procedures and metrics for EGL are provided. Finally, the current and potential future application areas and directions of EGL are discussed, and an extensive experimental study is presented aiming at providing comprehensive comparative studies among existing EGL models in various popular application domains, such as Computer Vision (CV) and Natural Language Processing (NLP) domains.