论文标题
Proto2Proto:您能以我的方式认出汽车吗?
Proto2Proto: Can you recognize the car, the way I do?
论文作者
论文摘要
原型方法最近由于其内在的可解释性质而引起了很多关注,这是通过原型获得的。随着模型再利用和蒸馏的日益增长的用例,还需要研究将一种模型转移到另一种模型的转移。我们提出了Proto2Proto,这是一种通过知识蒸馏将一个原型零件网络解释性转移到另一个原型零件网络的新颖方法。我们的方法旨在为从教师转移到较浅的学生模型的“黑暗”知识增加解释性。我们提出了两个新的损失:“全局解释”损失和“斑块 - 概要对应关系”损失,以促进这种转移。全球解释损失迫使学生的原型与教师原型接近,而补丁 - 概要型对应损失会执行学生的本地表示与教师的相似。此外,我们提出了三个新颖的指标,以评估学生与老师的接近度,以作为我们设置中的可解释性转移的度量。我们在定性和定量上证明了我们在CUB-200-2011和Stanford Cars数据集上的方法的有效性。我们的实验表明,所提出的方法确实实现了从教师到学生的解释性转移,同时表现出竞争性的表现。
Prototypical methods have recently gained a lot of attention due to their intrinsic interpretable nature, which is obtained through the prototypes. With growing use cases of model reuse and distillation, there is a need to also study transfer of interpretability from one model to another. We present Proto2Proto, a novel method to transfer interpretability of one prototypical part network to another via knowledge distillation. Our approach aims to add interpretability to the "dark" knowledge transferred from the teacher to the shallower student model. We propose two novel losses: "Global Explanation" loss and "Patch-Prototype Correspondence" loss to facilitate such a transfer. Global Explanation loss forces the student prototypes to be close to teacher prototypes, and Patch-Prototype Correspondence loss enforces the local representations of the student to be similar to that of the teacher. Further, we propose three novel metrics to evaluate the student's proximity to the teacher as measures of interpretability transfer in our settings. We qualitatively and quantitatively demonstrate the effectiveness of our method on CUB-200-2011 and Stanford Cars datasets. Our experiments show that the proposed method indeed achieves interpretability transfer from teacher to student while simultaneously exhibiting competitive performance.