论文标题
发现功能相关性和解释神经分类器决策的固定会员方法
A Set Membership Approach to Discovering Feature Relevance and Explaining Neural Classifier Decisions
论文作者
论文摘要
神经分类器是非线性系统,为他们所学的给定问题提供了关于模式类别的决策。由分类器计算的每个模式计算的输出构成了某些未知函数的输出的近似值,将模式数据映射到各自的类别。缺乏对这种功能的知识以及神经分类器的复杂性,尤其是当这些功能是深度学习体系结构时,不允许获取有关如何做出特定预测的信息。因此,这些强大的学习系统被视为黑匣子,在关键应用中,它们的使用往往被认为是不合适的。洞悉这种黑匣子操作是解释神经分类器的操作并评估其决策有效性的一种方法。在本文中,我们解决了这个问题,介绍了一种新的方法,以发现哪些特征被训练有素的神经分类器及其如何影响分类器的输出,从而获得了对其决策的解释。尽管功能相关性在这里的机器学习文献中引起了很多关注,但我们根据基于间隔分析的设定会员方法针对的非线性参数估计重新考虑它。因此,提出的方法基于合理的数学方法,而获得的结果构成了对分类器决策前提的可靠估计。
Neural classifiers are non linear systems providing decisions on the classes of patterns, for a given problem they have learned. The output computed by a classifier for each pattern constitutes an approximation of the output of some unknown function, mapping pattern data to their respective classes. The lack of knowledge of such a function along with the complexity of neural classifiers, especially when these are deep learning architectures, do not permit to obtain information on how specific predictions have been made. Hence, these powerful learning systems are considered as black boxes and in critical applications their use tends to be considered inappropriate. Gaining insight on such a black box operation constitutes a one way approach in interpreting operation of neural classifiers and assessing the validity of their decisions. In this paper we tackle this problem introducing a novel methodology for discovering which features are considered relevant by a trained neural classifier and how they affect the classifier's output, thus obtaining an explanation on its decision. Although, feature relevance has received much attention in the machine learning literature here we reconsider it in terms of nonlinear parameter estimation targeted by a set membership approach which is based on interval analysis. Hence, the proposed methodology builds on sound mathematical approaches and the results obtained constitute a reliable estimation of the classifier's decision premises.