Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
8-2013
Abstract
Automated face annotation aims to automatically detect human faces from a photo and further name the faces with the corresponding human names. In this paper, we tackle this open problem by investigating a search-based face annotation (SBFA) paradigm for mining large amounts of web facial images freely available on the WWW. Given a query facial image for annotation, the idea of SBFA is to first search for top-n similar facial images from a web facial image database and then exploit these top-ranked similar facial images and their weak labels for naming the query facial image. To fully mine those information, this paper proposes a novel framework of Learning to Name Faces (L2NF) – a unified multimodal learning approach for search-based face annotation, which consists of the following major components: (i) we enhance the weak labels of top-ranked similar images by exploiting the “label smoothness" assumption; (ii) we construct the multimodal representations of a facial image by extracting different types of features; (iii) we optimize the distance measure for each type of features using distance metric learning techniques; and finally (iv) we learn the optimal combination of multiple modalities for annotation through a learning to rank scheme. We conduct a set of extensive empirical studies on two real-world facial image databases, in which encouraging results show that the proposed algorithms significantly boost the naming accuracy of search-based face annotation task.
Keywords
web facial images, auto face annotation, supervised learning
Discipline
Computer Sciences | Databases and Information Systems | Numerical Analysis and Scientific Computing
Research Areas
Data Science and Engineering
Publication
SIGIR '13: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval: July 28 - August 1, Dublin, Ireland
First Page
443
Last Page
452
ISBN
9781450320344
Identifier
10.1145/2484028.2484040
Publisher
ACM
City or Country
New York
Citation
WANG, Dayong; HOI, Steven C. H.; WU, Pengcheng; ZHU, Jianke; HE, Ying; and MIAO, Chunyan.
Learning to name faces: A multimodal learning scheme for search-based face annotation. (2013). SIGIR '13: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval: July 28 - August 1, Dublin, Ireland. 443-452.
Available at: https://ink.library.smu.edu.sg/sis_research/2336
Copyright Owner and License
Publisher
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/2484028.2484040
Included in
Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons