Learning rotation equivalent scene representation from instance-level semantics: A novel top-down perspective

Open Access
Authors
Publication date 03-2023
Journal Computer Vision and Image Understanding
Article number 103635
Volume | Issue number 229
Number of pages 14
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
This paper focuses on rotation variant scene recognition. Different from existing rotation invariant recognition approaches which learn from either rotated images or rotated convolutional filters in a bottom-up manner, a new top-down perspective by learning is explored from instance-level semantic representation. The goal is to eliminate the convolutional feature differences in bottom-up feature propagation caused by the rotation sensitive nature of convolution operation. Our rotation equivalent convolutional neural network (RE-CNN) scheme consists of three components. Firstly, our key instance selection module highlights the instances strongly related to the scene scheme regardless of their orientation. Secondly, our key instance aggregation module builds a scene representation invariant to the position change of each instance caused by rotation. Finally, our semantic fusion module allows the framework to be organized as a whole and implements rotation regularization. Notably, our RE-CNN scheme can be adapted to existing CNNs in a plug-in-and-play manner. Extensive experiments on rotation variant scene recognition benchmarks from four domains demonstrate the state-of-the-art performance and generalization capability of the proposed RE-CNN.
Document type Article
Language English
Published at https://doi.org/10.1016/j.cviu.2023.103635
Downloads
Permalink to this page
Back