3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

Y. Chen; T. Mensink; E. Gavves

doi:https://doi.org/10.48550/arXiv.1910.01460

3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation

Authors	Y. Chen T. Mensink E. Gavves
Publication date	2019
Book title	2019 International Conference on 3D Vision
Book subtitle	3DV 2019 : proceedings : Quebec, Canada, 15-18 September 2019
ISBN	9781728131320
ISBN (electronic)	9781728131313
Event	2019 International Conference on 3D Vision
Pages (from-to)	173-182
Publisher	Los Alamitos, CA: IEEE Computer Society, Conference Publishing Services
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	A key challenge for RGB-D segmentation is how to effectively incorporate 3D geometric information from the depth channel into 2D appearance features. We propose to model the effective receptive field of 2D convolution based on the scale and locality from the 3D neighborhood. Standard convolutions are local in the image space (u, v), often with a fixed receptive field of 3x3 pixels. We propose to define convolutions local with respect to the corresponding point in the 3D real world space (x, y, z), where the depth channel is used to adapt the receptive field of the convolution, which yields the resulting filters invariant to scale and focusing on the certain range of depth. We introduce 3D Neighborhood Convolution (3DN-Conv), a convolutional operator around 3D neighborhoods. Further, we can use estimated depth to use our RGB-D based semantic segmentation model from RGB input. Experimental results validate that our proposed 3DN-Conv operator improves semantic segmentation, using either ground-truth depth (RGB-D) or estimated depth (RGB).
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.48550/arXiv.1910.01460 (Accepted author manuscript) https://doi.org/10.1109/3DV.2019.00028 (Final published version)
Other links	https://www.proceedings.com/51165.html
Downloads	1910.01460 (Accepted author manuscript)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

3D Neighborhood Convolution: Learning Depth-Aware Features for RGB-D and RGB Semantic Segmentation