论文标题
长尾食品分类
Long-tailed Food Classification
论文作者
论文摘要
食品分类是基于图像的饮食评估的基本步骤,以预测每个输入图像中食物的类型。但是,在现实世界中,食物形象的预测通常是长尾分布在不同食品类别之间的,这会导致较重的班级不平衡问题和限制性能。此外,现有的长尾分类方法都不集中在食品数据上,由于食品之间的阶层间和较高的类相似性,这可能更具挑战性。在这项工作中,我们首先介绍了两个新的基准数据集,用于长尾食品分类,包括Food101-LT和VFN-LT,其中VFN-LT中的样品数量显示了现实世界中长尾食品分布。然后,我们提出了一个新颖的2阶段框架,通过(1)将头类采样以消除冗余样本以及通过知识蒸馏来维护学习信息,以及(2)通过执行视觉吸引数据增强来过度采样。我们通过与现有的最新长尾分类方法进行比较来展示我们的方法的有效性,并在Food101-LT和VFN-LT基准测试中显示出改善的性能。结果表明,将我们的方法应用于相关的现实生活应用。
Food classification serves as the basic step of image-based dietary assessment to predict the types of foods in each input image. However, food image predictions in a real world scenario are usually long-tail distributed among different food classes, which cause heavy class-imbalance problems and a restricted performance. In addition, none of the existing long-tailed classification methods focus on food data, which can be more challenging due to the lower inter-class and higher intra-class similarity among foods. In this work, we first introduce two new benchmark datasets for long-tailed food classification including Food101-LT and VFN-LT where the number of samples in VFN-LT exhibits the real world long-tailed food distribution. Then we propose a novel 2-Phase framework to address the problem of class-imbalance by (1) undersampling the head classes to remove redundant samples along with maintaining the learned information through knowledge distillation, and (2) oversampling the tail classes by performing visual-aware data augmentation. We show the effectiveness of our method by comparing with existing state-of-the-art long-tailed classification methods and show improved performance on both Food101-LT and VFN-LT benchmarks. The results demonstrate the potential to apply our method to related real life applications.