论文标题
长尾皮肤病变分类的柔性抽样
Flexible Sampling for Long-tailed Skin Lesion Classification
论文作者
论文摘要
大多数医疗任务自然会由于复杂的患者水平疾病和罕见疾病的存在而表现出长尾分布。现有的长尾学习方法通常同样对待每个班级以重新平衡长尾分布。但是,考虑到某些具有挑战性的课程可能会呈现出多样化的阶级分布,因此平衡所有类别的平衡可能会导致绩效的显着下降。为了解决这个问题,在本文中,我们提出了一个基于课程学习的框架,称为“灵活抽样”,用于长尾皮肤病变分类任务。具体而言,我们最初根据单个类原型将训练数据的子集作为锚点进行采样。然后,这些锚点用于预先训练一个推理模型,以评估每级学习难度。最后,我们使用课程采样模块,以学习难度了解的抽样概率从其余训练样本中动态查询新样本。我们根据ISIC数据集上的几种最新方法评估了模型。具有两个长尾环境的结果证明了我们提出的培训策略的优越性,该培训策略为长尾皮肤病变分类提供了新的基准。
Most of the medical tasks naturally exhibit a long-tailed distribution due to the complex patient-level conditions and the existence of rare diseases. Existing long-tailed learning methods usually treat each class equally to re-balance the long-tailed distribution. However, considering that some challenging classes may present diverse intra-class distributions, re-balancing all classes equally may lead to a significant performance drop. To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task. Specifically, we initially sample a subset of training data as anchor points based on the individual class prototypes. Then, these anchor points are used to pre-train an inference model to evaluate the per-class learning difficulty. Finally, we use a curriculum sampling module to dynamically query new samples from the rest training samples with the learning difficulty-aware sampling probability. We evaluated our model against several state-of-the-art methods on the ISIC dataset. The results with two long-tailed settings have demonstrated the superiority of our proposed training strategy, which achieves a new benchmark for long-tailed skin lesion classification.