Delving into multi-illumination monocular depth estimation: A new dataset and method
Publication Type
Journal Article
Publication Date
1-2024
Abstract
Monocular depth prediction has received significant attention in recent years. However, the impact of illumination variations, which can shift scenes to unseen domains, has often been overlooked. To address this, we introduce the first indoor scene dataset featuring RGB-D images captured under multiple illumination conditions, allowing for a comprehensive exploration of indoor depth prediction. Additionally, we propose a novel method, MI-Transformer, which leverages global illumination understanding through large receptive fields to capture depth-attention contexts. This enables our network to overcome local window limitations and effectively mitigate the influence of changing illumination conditions. To evaluate the performance and robustness, we conduct extensive qualitative and quantitative analyses on both the proposed dataset and existing benchmarks, comparing our method with state-of-the-art approaches. The experimental results demonstrate the superiority of our method across various metrics, making it the first solution to achieve robust monocular depth estimation under diverse illumination conditions. We provide the codes, pre-trained models, and dataset openly accessible at https://github.com/ViktorLiang/midepth.
Keywords
Decoding, Depth Estimation, Estimation, Lighting, Multi-illuminations Dataset, Robustness, Three-dimensional displays, Training, Transformer-enhanced Network, Transformers
Discipline
Databases and Information Systems | OS and Networks
Research Areas
Software and Cyber-Physical Systems
Publication
IEEE Transactions on Multimedia
First Page
1
Last Page
15
ISSN
1520-9210
Identifier
10.1109/TMM.2024.3353544
Publisher
Institute of Electrical and Electronics Engineers
Citation
LIANG, Yuan; ZHANG, Zitian; XIAN, Chuhua; and HE, Shengfeng.
Delving into multi-illumination monocular depth estimation: A new dataset and method. (2024). IEEE Transactions on Multimedia. 1-15.
Available at: https://ink.library.smu.edu.sg/sis_research/8658
Copyright Owner and License
Authors
Additional URL
https://doi.org/10.1109/TMM.2024.3353544