A novel class activation map for visual explanations in multi-object scenes
Class activation maps (CAMs) have emerged as a popular technique to improve model interpretability of deep learning-based models. While existing CAM methods are able to extract salient semantic regions to provide high-confidence pseudo-labels for downstream tasks such as semantic segmentation, they are less effective when dealing with multi-object scenes. In this paper, we design a multi-channel weight assignment scheme that learns from both positive and negative regions to yield an improved CAM model for images comprising multiple objects. We demonstrate the effectiveness of our proposed method on two new data sets, a cat-and-dog dataset and a PASCAL VOC 2012-based multi-object dataset, and show it to compare favourably with other state-of-the-art CAM methods, outperforming them in terms of both mIoU and inter-object activation ratio (IAR), a new evaluation measure proposed to evaluate CAM performance in multi-object scenes.
Funding
Natural Science Foundation of Hunan Province, China [2022GK5002, 2020JJ4746]
Special Foundation for Distinguished Young Scientists of Changsha [kq2209003]
111 Project [No. D23006]
History
School
- Science
Department
- Computer Science
Published in
2023 IEEE International Conference on Image Processing (ICIP)Pages
2615 - 2619Source
2023 IEEE International Conference on Image Processing (ICIP 2023)Publisher
IEEEVersion
- AM (Accepted Manuscript)
Rights holder
© IEEEPublisher statement
© 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.Acceptance date
2023-06-21Publication date
2023-09-11Copyright date
2023ISBN
9781728198354Publisher version
Language
- en