Learning rotation equivalent scene representation from instance-level semantics: A novel top-down perspective

Q. Bi; S. You; W. Ji; T. Gevers

doi:https://doi.org/10.1016/j.cviu.2023.103635

Learning rotation equivalent scene representation from instance-level semantics: A novel top-down perspective

Authors	Q. Bi S. You W. Ji T. Gevers
Publication date	03-2023
Journal	Computer Vision and Image Understanding
Article number	103635
Volume \| Issue number	229
Number of pages	14
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	This paper focuses on rotation variant scene recognition. Different from existing rotation invariant recognition approaches which learn from either rotated images or rotated convolutional filters in a bottom-up manner, a new top-down perspective by learning is explored from instance-level semantic representation. The goal is to eliminate the convolutional feature differences in bottom-up feature propagation caused by the rotation sensitive nature of convolution operation. Our rotation equivalent convolutional neural network (RE-CNN) scheme consists of three components. Firstly, our key instance selection module highlights the instances strongly related to the scene scheme regardless of their orientation. Secondly, our key instance aggregation module builds a scene representation invariant to the position change of each instance caused by rotation. Finally, our semantic fusion module allows the framework to be organized as a whole and implements rotation regularization. Notably, our RE-CNN scheme can be adapted to existing CNNs in a plug-in-and-play manner. Extensive experiments on rotation variant scene recognition benchmarks from four domains demonstrate the state-of-the-art performance and generalization capability of the proposed RE-CNN.
Document type	Article
Language	English
Published at	https://doi.org/10.1016/j.cviu.2023.103635 (Final published version)
Downloads	Learning rotation equivalent scene representation from instance-level semantics (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Learning rotation equivalent scene representation from instance-level semantics: A novel top-down perspective