Paper
28 July 2023 OLF: RGB-D adaptive late fusion for robust 6D pose estimation
Théo Petitjean, Zongwei Wu, Cédric Demonceaux, Olivier Laligant
Author Affiliations +
Proceedings Volume 12749, Sixteenth International Conference on Quality Control by Artificial Vision; 127490J (2023) https://doi.org/10.1117/12.2690943
Event: Sixteenth International Conference on Quality Control by Artificial Vision, 2023, Albi, France
Abstract
RGB-D 6D pose estimation has recently gained significant research attention due to the complementary information provided by depth data. However, in real-world scenarios, especially in industrial applications, the depth and color images are often more noisy1 . 2 Existing methods typically employ fusion designs that equally average RGB and depth features, which may not be optimal. In this paper, we propose a novel fusion design that adaptively merges RGB-D cues. Our approach involves assigning two learnable weight α1 and α2 to adjust the RGB and depth contributions with respect to the network depth. This enables us to improve the robustness against low-quality depth input in a simple yet effective manner. We conducted extensive experiments on the 6D pose estimation benchmark and demonstrated the effectiveness of our method. We evaluated our network in conjunction with DenseFusion on two datasets (LineMod3 and YCB4) using similar noise scenarios to verify the usefulness of reinforcing the fusion with the α1 and α2 parameters. Our experiments show that our method outperforms existing methods, particularly in low-quality depth input scenarios. We plan to make our source code publicly available for future research.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Théo Petitjean, Zongwei Wu, Cédric Demonceaux, and Olivier Laligant "OLF: RGB-D adaptive late fusion for robust 6D pose estimation", Proc. SPIE 12749, Sixteenth International Conference on Quality Control by Artificial Vision, 127490J (28 July 2023); https://doi.org/10.1117/12.2690943
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Signal to noise ratio

RGB color model

Pose estimation

Feature extraction

Image fusion

Feature fusion

Point clouds

Back to Top