RePOSE : 3D human pose estimation via spatio-temporal depth relational consistency
Publication Type
Conference Proceeding Article
Publication Date
10-2024
Abstract
We introduce RePOSE, a simple yet effective approach for addressing occlusion challenges in the learning of 3D human pose estimation (HPE) from videos. Conventional approaches typically employ absolute depth signals as supervision, which are adept at discernible keypoints but become less reliable when keypoints are occluded, resulting in vague and inconsistent learning trajectories for the neural network. RePOSE overcomes this limitation by introducing spatio-temporal relational depth consistency into the supervision signals. The core rationale of our method lies in prioritizing the precise sequencing of occluded keypoints. This is achieved by using a relative depth consistency loss that operates in both spatial and temporal domains. By doing so, RePOSE shifts the focus from learning absolute depth values, which can be misleading in occluded scenarios, to relative positioning, which provides a more robust and reliable cue for accurate pose estimation. This subtle yet crucial shift facilitates more consistent and accurate 3D HPE under occlusion conditions. The elegance of our core idea lies in its simplicity and ease of implementation, requiring only a few lines of code. Extensive experiments validate that RePOSE not only outperforms existing state-of-the-art methods but also significantly enhances the robustness and precision of 3D HPE in challenging occluded environments.
Keywords
3D human pose estimation, Depth relational loss functions
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization; Data Science and Engineering
Publication
Proceedings of the 18th European Conference on Computer Vision (ECCV 2024) : Milan, Italy, September 29 - October 4
Publisher
ECCV
City or Country
Italy
Citation
SUN, Ziming; LIANG, Yuan; MA, Zejun; ZHANG, Tianle; BAO, Linchao; LI, Guiqing; and HE, Shengfeng.
RePOSE : 3D human pose estimation via spatio-temporal depth relational consistency. (2024). Proceedings of the 18th European Conference on Computer Vision (ECCV 2024) : Milan, Italy, September 29 - October 4.
Available at: https://ink.library.smu.edu.sg/sis_research/9803