Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
8-2021
Abstract
We propose a novel training methodology---Concept Group Learning (CGL)---that encourages training of interpretable CNN filters by partitioning filters in each layer into concept groups, each of which is trained to learn a single visual concept. We achieve this through a novel regularization strategy that forces filters in the same group to be active in similar image regions for a given layer. We additionally use a regularizer to encourage a sparse weighting of the concept groups in each layer so that a few concept groups can have greater importance than others. We quantitatively evaluate CGL's model interpretability using standard interpretability evaluation techniques and find that our method increases interpretability scores in most cases. Qualitatively we compare the image regions which are most active under filters learned using CGL versus filters learned without CGL and find that CGL activation regions more strongly concentrate around semantically relevant features.
Keywords
Convolutional Neural Networks, Interpretability, Computer Vision.
Discipline
Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 30th International Joint Conference on Artificial Intelligence, Montreal, 2021 August 19-27
First Page
1061
Last Page
1067
ISBN
9780999241196
Identifier
10.24963/ijcai.2021/147
Publisher
IJCAI
City or Country
Montreal, Canada
Citation
VARSHNEYA, Saurabh; LEDENT, Antoine; VANDERMEULEN, Rob; LEI, Yunwen; ENDERS, Matthias; BORTH, Damian; and KLOFT, Marius.
Learning interpretable concept groups in CNNs. (2021). Proceedings of the 30th International Joint Conference on Artificial Intelligence, Montreal, 2021 August 19-27. 1061-1067.
Available at: https://ink.library.smu.edu.sg/sis_research/7206
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.24963/ijcai.2021/147
Included in
Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons