论文标题
在NUSCENES数据集上结构识别和类平衡的3D对象检测
Structure Aware and Class Balanced 3D Object Detection on nuScenes Dataset
论文作者
论文摘要
3-D对象检测对于自动驾驶至关重要。由于其准确的深度信息,基于点云的方法在3-D对象检测中变得越来越流行。 Nutonomy的Nuscenes数据集大大扩展了常用的数据集,例如大小,传感器模式,类别和注释数字。但是,它患有严重的阶级失衡。班级均衡的分组和抽样纸解决了这个问题,并提出了增强和抽样策略。但是,该模型的定位精度受到缩小特征图中空间信息丢失的影响。我们建议通过设计一个充分利用3D点云的结构信息来提高CBGS模型的性能,以提高本地化精度。可拆卸的辅助网络通过两个点级的监督共同优化,即前景细分和中心估计。辅助网络在推理过程中不会引入任何额外的计算,因为它可以在测试时分离。
3-D object detection is pivotal for autonomous driving. Point cloud based methods have become increasingly popular for 3-D object detection, owing to their accurate depth information. NuTonomy's nuScenes dataset greatly extends commonly used datasets such as KITTI in size, sensor modalities, categories, and annotation numbers. However, it suffers from severe class imbalance. The Class-balanced Grouping and Sampling paper addresses this issue and suggests augmentation and sampling strategy. However, the localization precision of this model is affected by the loss of spatial information in the downscaled feature maps. We propose to enhance the performance of the CBGS model by designing an auxiliary network, that makes full use of the structure information of the 3D point cloud, in order to improve the localization accuracy. The detachable auxiliary network is jointly optimized by two point-level supervisions, namely foreground segmentation and center estimation. The auxiliary network does not introduce any extra computation during inference, since it can be detached at test time.