Food Image Recognition via Multi-scale Jigsaw and Reconstruction Network

doi:10.13328/j.cnki.jos.006325

微信服务号

微信订阅号

2025-5-16- 1

Home > Archive>Volume 33, Issue 11, 2022 >4379-4395. DOI:10.13328/j.cnki.jos.006325

PDF HTML XML Export Cite reminder

Food Image Recognition via Multi-scale Jigsaw and Reconstruction Network
DOI:
                        10.13328/j.cnki.jos.006325
                    
Author:
                        LIU Yu-XinLIU Yu-Xin
Key Laboratory of Intelligent Information Processing, Chinese Academy of Sciences (Institute of Computing Technology, Chinese Academy of Sciences), Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
MIN Wei-QingMIN Wei-Qing
Key Laboratory of Intelligent Information Processing, Chinese Academy of Sciences (Institute of Computing Technology, Chinese Academy of Sciences), Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JIANG Shu-QiangJIANG Shu-Qiang
Key Laboratory of Intelligent Information Processing, Chinese Academy of Sciences (Institute of Computing Technology, Chinese Academy of Sciences), Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
RUI YongRUI Yong
Lenovo Group, Beijing 100085, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP393
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Recently, food image recognition has received more and more attention for its wide applications in healthy diet management, smart restaurant, and so on. Unlike other object recognition tasks, food images belong to fine-grained ones with high intra-class variability and inter-class similarity. Furthermore, food images do not have fixed semantic patterns and specific spatial layout. These make food recognition more challenging. This study proposes a multi-scale jigsaw and reconstruction network (MJR-Net) for food recognition. MJR-Net is composed of three parts. The jigsaw and reconstruction module uses a method called destruction and reconstruction learning to destroy and reconstruct the original image to extract local discriminative details. Feature pyramid module can fuse mid-level features of different sizes to capture multi-scale local discriminative features. Channel-wise attention module can model the importance of different feature channels to enhance the discriminative visual patterns and weaken the noise patterns. The study also uses both A-softmax loss and Focal loss to optimize the network by increasing the inter-class variability and reweighting samples respectively. MJR-Net is evaluated on three food datasets (ETH Food-101, Vireo Food-172, and ISIA Food-500). The proposed method achieves 90.82%, 91.37%, and 64.95% accuracy, respectively. Experimental results show that, compared with other food recognition methods, MJR-Net shows greater competitiveness and especially achieves the state-of-the-art recognition performance on Vireo Food-172 and ISIA Food-500. Comprehensive ablation studies and visual analysis also prove the effectiveness of the proposed method.

Key words:food image recognition;deep learning;jigsaw and reconstruction;feature pyramid;attention mechanism

Get Citation

刘宇昕,闵巍庆,蒋树强,芮勇.多尺度拼图重构网络的食品图像识别.软件学报,2022,33(11):4379-4395

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:September 23,2020
Revised:January 11,2021
Adopted:
Online: November 11,2022
Published: November 06,2022

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History