Publication Type
Journal Article
Version
publishedVersion
Publication Date
12-2020
Abstract
Food recognition has captured numerous research attention for its importance for health-related applications. The existing approaches mostly focus on the categorization of food according to dish names, while ignoring the underlying ingredient composition. In reality, two dishes with the same name do not necessarily share the exact list of ingredients. Therefore, the dishes under the same food category are not mandatorily equal in nutrition content. Nevertheless, due to limited datasets available with ingredient labels, the problem of ingredient recognition is often overlooked. Furthermore, as the number of ingredients is expected to be much less than the number of food categories, ingredient recognition is more tractable in the real-world scenario. This paper provides an insightful analysis of three compelling issues in ingredient recognition. These issues involve recognition in either image-level or region level, pooling in either single or multiple image scales, learning in either single or multi-task manner. The analysis is conducted on a large food dataset, Vireo Food-251, contributed by this paper. The dataset is composed of 169,673 images with 251 popular Chinese food and 406 ingredients. The dataset includes adequate challenges in scale and complexity to reveal the limit of the current approaches in ingredient recognition.
Keywords
Image recognition, Visualization, Phase frequency detectors, Image segmentation, Fish, Deep learning, Shape, Food images, Chinese food, ingredient recognition, deep learning
Discipline
Artificial Intelligence and Robotics | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
IEEE Transactions on Image Processing
Volume
30
First Page
1514
Last Page
1526
ISSN
1057-7149
Identifier
10.1109/TIP.2020.3045639
Publisher
Institute of Electrical and Electronics Engineers
Citation
CHEN, Jingjing; ZHU, Bin; NGO, Chong-wah; CHUA, Tat-Seng; and JIANG, Yu-Gang.
A study of multi-task and region-wise deep learning for food ingredient recognition. (2020). IEEE Transactions on Image Processing. 30, 1514-1526.
Available at: https://ink.library.smu.edu.sg/sis_research/6301
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/TIP.2020.3045639
Included in
Artificial Intelligence and Robotics Commons, Graphics and Human Computer Interfaces Commons