Information-Theoretic Visual Explanation for Black-Box Classifiers
In this work, we attempt to explain the prediction of any black-box classifier from an information-theoretic perspective. For this purpose, we propose two attribution maps: an information gain (IG) map and a point-wise mutual information (PMI) map… IG map provides a class-independent answer to “How informative is each pixel? “, and PMI map offers a class-specific explanation by answering “How much does each pixel support a specific class?” In this manner, we propose (i) a theory-backed attribution method. The attribution […]
Read more