Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
10-2016
Abstract
Retrieving recipes corresponding to given dish pictures facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. The current approaches mostly focus on recognition of food category based on global dish appearance without explicit analysis of ingredient composition. Such approaches are incapable for retrieval of recipes with unknown food categories, a problem referred to as zero-shot retrieval. On the other hand, content-based retrieval without knowledge of food categories is also difficult to attain satisfactory performance due to large visual variations in food appearance and ingredient composition. As the number of ingredients is far less than food categories, understanding ingredients underlying dishes in principle is more scalable than recognizing every food category and thus is suitable for zero-shot retrieval. Nevertheless, ingredient recognition is a task far harder than food categorization, and this seriously challenges the feasibility of relying on them for retrieval. This paper proposes deep architectures for simultaneous learning of ingredient recognition and food categorization, by exploiting the mutual but also fuzzy relationship between them. The learnt deep features and semantic labels of ingredients are then innovatively applied for zero-shot retrieval of recipes. By experimenting on a large Chinese food dataset with images of highly complex dish appearance, this paper demonstrates the feasibility of ingredient recognition and sheds light on this zero-shot problem peculiar to cooking recipe retrieval.
Keywords
Food categorization, Ingredient recognition, Multitask deep learning, Zero-shot retrieval
Discipline
Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 24th ACM International conference on Multimedia, MM 2016, Amsterdam, October 15-19
First Page
32
Last Page
41
ISBN
9781450336031
Identifier
10.1145/2964284.2964315
Publisher
ACM
City or Country
Amsterdam
Citation
CHEN, Jingjing and NGO, Chong-wah.
Deep-based ingredient recognition for cooking recipe retrieval. (2016). Proceedings of the 24th ACM International conference on Multimedia, MM 2016, Amsterdam, October 15-19. 32-41.
Available at: https://ink.library.smu.edu.sg/sis_research/6498
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Included in
Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons