Counterfactual attribute-based visual explanations for classification

S. Gulshad; A. Smeulders

doi:https://doi.org/10.1007/s13735-021-00208-3

Counterfactual attribute-based visual explanations for classification

Authors	S. Gulshad A. Smeulders
Publication date	06-2021
Journal	International Journal of Multimedia Information Retrieval
Event	10th ACM International Conference on Multimedia Retrieval, ICMR 2020
Volume \| Issue number	10 \| 2
Pages (from-to)	127–140
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	In this paper, our aim is to provide human understandable intuitive factual and counterfactual explanations for the decisions of neural networks. Humans tend to reinforce their decisions by providing attributes and counterattributes. Hence, in this work, we utilize attributes as well as examples to provide explanations. In order to provide counterexplanations we make use of directed perturbations to arrive at the counterclass attribute values in doing so, we explain what is present and what is absent in the original image. We evaluate our method when images are misclassified into closer counterclasses as well as when misclassified into completely different counterclasses. We conducted experiments on both finegrained as well as coarsegrained datasets. We verified our attribute-based explanations method both quantitatively and qualitatively and showed that attributes provide discriminating and human understandable explanations for both standard as well as robust networks.
Document type	Article
Note	In special issue: Best Papers of ACM ICMR 2020.
Language	English
Published at	https://doi.org/10.1007/s13735-021-00208-3 (Final published version)
Downloads	Gulshad-Smeulders2021_Article_CounterfactualAttribute-basedV (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Counterfactual attribute-based visual explanations for classification