Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
3-2024
Abstract
Data augmentation is a powerful technique to enhance the performance of a deep learning task but has received less attention in 3D deep learning. It is well known that when 3D shapes are sparsely represented with low point density, the performance of the downstream tasks drops significantly. This work explores test-time augmentation (TTA) for 3D point clouds. We are inspired by the recent revolution of learning implicit representation and point cloud upsampling, which can produce high-quality 3D surface reconstruction and proximity-to-surface, respectively. Our idea is to leverage the implicit field reconstruction or point cloud upsampling techniques as a systematic way to augment point cloud data. Mainly, we test both strategies by sampling points from the reconstructed results and using the sampled point cloud as test-time augmented data. We show that both strategies are effective in improving accuracy. We observed that point cloud upsampling for test-time augmentation can lead to more significant performance improvement on downstream tasks such as object classification and segmentation on the ModelNet40, ShapeNet, ScanObjectNN, and SemanticKITTI datasets, especially for sparse point clouds.
Discipline
Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces
Research Areas
Information Systems and Management
Publication
Proceedings of the 2024 International Conference on 3D Vision (3DV), Davos, Switzerland, March 18-21
First Page
1543
Last Page
1553
ISBN
9798350362459
Identifier
10.1109/3DV62453.2024.00153
Publisher
IEEE
City or Country
Los Alamitos, CA
Citation
VU, Tuan-Anh; SARKAR, Srinjay; ZHANG, Zhiyuan; HUA, Binh-Son; and YEUNG, Sai-Kit.
Test-time augmentation for 3D point cloud classification and segmentation. (2024). Proceedings of the 2024 International Conference on 3D Vision (3DV), Davos, Switzerland, March 18-21. 1543-1553.
Available at: https://ink.library.smu.edu.sg/sis_research/8963
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/3DV62453.2024.00153
Included in
Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons