Publication Type

Journal Article

Version

publishedVersion

Publication Date

10-2022

Abstract

Cross-view geo-localization is to localize the same geographic target in images from different perspectives, e.g., satellite-view and drone-view. The primary challenge faced by existing methods is the large visual appearance changes across views. Most previous work utilizes the deep neural network to obtain the discriminative representations and directly uses them to accomplish the geo-localization task. However, these approaches ignore that the redundancy retained in the extracted features negatively impacts the result. In this paper, we argue that the information bottleneck (IB) can retain the most relevant information while removing as much redundancy as possible. The variational self-distillation (VSD) strategy provides an accurate and analytical solution to estimate the mutual information. To this end, we propose to learn discriminative representations via variational self-distillation (dubbed LDRVSD). Extensive experiments are conducted on two widely-used datasets University-1652 and CVACT, showing the remarkable performance improvements obtained by our LDRVSD method compared with several state-of-the-art approaches.

Keywords

Geo-localization, Information bottleneck, Image retrieval, Variational self-distillation

Discipline

Graphics and Human Computer Interfaces

Publication

Computers and Electrical Engineering

Volume

103

First Page

1

Last Page

11

ISSN

0045-7906

Identifier

10.1016/j.compeleceng.2022.108335

Publisher

Elsevier

Comments

student pub

Additional URL

https://doi.org/10.1016/j.compeleceng.2022.108335

Share

COinS