论文标题
通过稀有示例挖掘改善3D检测中的类内长尾长尾
Improving the Intra-class Long-tail in 3D Detection via Rare Example Mining
论文作者
论文摘要
深度学习体系结构的持续改进稳步提高了3D对象探测器的总体性能到与人类在某些任务和数据集中的水平,在这些任务和数据集中,总体绩效主要由常见示例驱动。但是,即使是表现最好的模型也遇到了最幼稚的错误,而在训练数据中不经常出现的例子,例如具有不规则几何形状的车辆。长尾文献中的大多数研究都集中在每类已知不平衡标签计数的类别分类问题上,但它们并不直接适用于具有较大类内部变化(例如3D对象检测)的阶级内长尾示例,例如,同一类标签的实例可以具有巨大的变化属性,例如形状和尺寸。其他作品建议根据不确定性,困难或多样性的标准使用主动学习来缓解这一问题。在这项研究中,我们确定了一个新的概念维度 - 稀有性 - 以挖掘新数据,以改善模型的长尾性能。我们表明,与难度相反,稀有性是3D检测器以数据为中心改进的关键,因为稀有性是由于缺乏数据支持的结果,而难度与问题的基本歧义有关。我们提出了一种一般有效的方法,可以使用流量模型根据特征空间中的密度估计来识别物体的稀有性,并提出一种针对采矿稀有物体轨迹的原则性成本感知的配方,从而改善了整体模型性能,但更重要的是 - 显着提高了稀有物体的性能(比30.97 \%\%\%\%\%\%\%
Continued improvements in deep learning architectures have steadily advanced the overall performance of 3D object detectors to levels on par with humans for certain tasks and datasets, where the overall performance is mostly driven by common examples. However, even the best performing models suffer from the most naive mistakes when it comes to rare examples that do not appear frequently in the training data, such as vehicles with irregular geometries. Most studies in the long-tail literature focus on class-imbalanced classification problems with known imbalanced label counts per class, but they are not directly applicable to the intra-class long-tail examples in problems with large intra-class variations such as 3D object detection, where instances with the same class label can have drastically varied properties such as shapes and sizes. Other works propose to mitigate this problem using active learning based on the criteria of uncertainty, difficulty, or diversity. In this study, we identify a new conceptual dimension - rareness - to mine new data for improving the long-tail performance of models. We show that rareness, as opposed to difficulty, is the key to data-centric improvements for 3D detectors, since rareness is the result of a lack in data support while difficulty is related to the fundamental ambiguity in the problem. We propose a general and effective method to identify the rareness of objects based on density estimation in the feature space using flow models, and propose a principled cost-aware formulation for mining rare object tracks, which improves overall model performance, but more importantly - significantly improves the performance for rare objects (by 30.97\%