Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
11-2020
Abstract
Recent advances in deep learning for 3D point clouds have shown great promises in scene understanding tasks thanks to the introduction of convolution operators to consume 3D point clouds directly in a neural network. Point cloud data, however, could have arbitrary rotations, especially those acquired from 3D scanning. Recent works show that it is possible to design point cloud convolutions with rotation invariance property, but such methods generally do not perform as well as translation-invariant only convolution. We found that a key reason is that compared to point coordinates, rotation-invariant features consumed by point cloud convolution are not as distinctive. To address this problem, we propose a novel convolution operator that enhances feature distinction by integrating global context information from the input point cloud to the convolution. To this end, a globally weighted local reference frame is constructed in each point neighborhood in which the local point set is decomposed into bins. Anchor points are generated in each bin to represent global shape features. A convolution can then be performed to transform the points and anchor features into final rotation-invariant features. We conduct several experiments on point cloud classification, part segmentation, shape retrieval, and normals estimation to evaluate our convolution, which achieves state-of-the-art accuracy under challenging rotations.
Discipline
Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces
Research Areas
Information Systems and Management
Publication
Proceedings of the 2020 International Conference on 3D Vision (3DV), Fukuoka, Japan, November 25-28
First Page
210
Last Page
219
ISBN
9781728181288
Identifier
10.1109/3dv50981.2020.00031
Publisher
IEEE
City or Country
Fukuoka, Japan
Citation
ZHANG, Zhiyuan; HUA, Binh-Son; CHEN, Wei; TIAN, Yibin; and YEUNG, Sai-Kit.
Global context aware convolutions for 3D point cloud understanding. (2020). Proceedings of the 2020 International Conference on 3D Vision (3DV), Fukuoka, Japan, November 25-28. 210-219.
Available at: https://ink.library.smu.edu.sg/sis_research/7941
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
http://doi.org/10.1109/3dv50981.2020.00031
Included in
Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons