Counterfactual attribute-based visual explanations for classification

Open Access
Authors
Publication date 06-2021
Journal International Journal of Multimedia Information Retrieval
Event 10th ACM International Conference on Multimedia Retrieval, ICMR 2020
Volume | Issue number 10 | 2
Pages (from-to) 127–140
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper, our aim is to provide human understandable intuitive factual and counterfactual explanations for the decisions of neural networks. Humans tend to reinforce their decisions by providing attributes and counterattributes. Hence, in this work, we utilize attributes as well as examples to provide explanations. In order to provide counterexplanations we make use of directed perturbations to arrive at the counterclass attribute values in doing so, we explain what is present and what is absent in the original image. We evaluate our method when images are misclassified into closer counterclasses as well as when misclassified into completely different counterclasses. We conducted experiments on both finegrained as well as coarsegrained datasets. We verified our attribute-based explanations method both quantitatively and qualitatively and showed that attributes provide discriminating and human understandable explanations for both standard as well as robust networks.
Document type Article
Note In special issue: Best Papers of ACM ICMR 2020.
Language English
Published at https://doi.org/10.1007/s13735-021-00208-3
Downloads
Permalink to this page
Back