基于演化局部可解释性模型无关解释的图像分类深度卷积神经网络解析

Explaining Deep Convolutional Neural Networks for Image Classification by Evolving Local Interpretable Model-agnostic Explanations

摘要 Abstract

深度卷积神经网络已被证明是图像分类中最有效的方法之一,但其严重缺陷在于较差的可解释性。在许多实际应用中,用户需要理解深度卷积神经网络预测背后的原因,以决定是否信任这些预测。为了解决这一问题,首次提出了一种基于遗传算法的新方法,用于自动生成能够帮助用户评估预测合理性的局部解释。此外,该方法具有模型无关性,即可以用于解释任何深度卷积神经网络模型。实验中,以ResNet为例模型进行解释,并选用ImageNet数据集作为基准数据集。进一步通过DenseNet和MobileNet展示了所提方法的模型无关特性。从ImageNet中随机选取的四张图像的演化局部解释表明,这些解释易于被人类识别。并且,所提出的解释成功捕获了样本图像的有意义的可解释特征,能够很好地解释深度卷积神经网络在这四张图像上的预测结果。基于30次实验运行的进一步分析显示,演化出的局部解释还可以提高深度卷积神经网络模型进行预测的概率/置信度。所提出的方法可以在一分钟内获得局部解释,比最先进的方法LIME快十倍以上。

Deep convolutional neural networks have proven their effectiveness, and have been acknowledged as the most dominant method for image classification. However, a severe drawback of deep convolutional neural networks is poor explainability. Unfortunately, in many real-world applications, users need to understand the rationale behind the predictions of deep convolutional neural networks when determining whether they should trust the predictions or not. To resolve this issue, a novel genetic algorithm-based method is proposed for the first time to automatically evolve local explanations that can assist users to assess the rationality of the predictions. Furthermore, the proposed method is model-agnostic, i.e., it can be utilised to explain any deep convolutional neural network models. In the experiments, ResNet is used as an example model to be explained, and the ImageNet dataset is selected as the benchmark dataset. DenseNet and MobileNet are further explained to demonstrate the model-agnostic characteristic of the proposed method. The evolved local explanations on four images, randomly selected from ImageNet, are presented, which show that the evolved local explanations are straightforward to be recognised by humans. Moreover, the evolved explanations can explain the predictions of deep convolutional neural networks on all four images very well by successfully capturing meaningful interpretable features of the sample images. Further analysis based on the 30 runs of the experiments exhibits that the evolved local explanations can also improve the probabilities/confidences of the deep convolutional neural network models in making the predictions. The proposed method can obtain local explanations within one minute, which is more than ten times faster than LIME (the state-of-the-art method).