< Back to previous page

Publication

ClueDepth Grasp: Leveraging positional clues of depth for completing depth of transparent objects

Journal Contribution - Journal Article

Obtaining accurate depth information is key to robot grasping tasks. However, for transparent objects, RGB-D cameras have di culty perceiving them owing to the objects’ refraction and reflection properties. This property makes it di cult for humanoid robots to perceive and grasp everyday transparent objects. To remedy this, existing studies usually remove transparent object areas using a model that learns patterns from the remaining opaque areas so that depth estimations can be completed. Notably, this frequently leads to deviations from the ground truth. In this study, we propose a new depth completion method [i.e., ClueDepth Grasp (CDGrasp)] that works more eectively with transparent objects in RGB-D images. Specifically, we propose a ClueDepth module, which leverages the geometry method to filter-out refractive and reflective points while preserving the correct depths, consequently providing crucial positional clues for object location. To acquire su cient features to complete the depth map, we design a DenseFormer network that integrates DenseNet to extract local features and swin-transformer blocks to obtain the required global information. Furthermore, to fully utilize the information obtained from multi-modal visual maps, we devise a Multi-Modal U-Net Module to capture multiscale features. Extensive experiments conducted on the ClearGrasp dataset show that our method achieves state-of-the-art performance in terms of accuracy and generalization of depth completion for transparent objects, and the successful employment of a humanoid robot grasping capability verifies the e cacy of our proposed method.
Journal: Frontiers in Neurorobotics
ISSN: 1662-5218
Volume: 16
Publication year:2022
Keywords:depth completion, transparent objects, grasping, deep learning, robot
Accessibility:Open