论文标题
对于3D场景流网络很重要
What Matters for 3D Scene Flow Network
论文作者
论文摘要
点云的3D场景流量估计是计算机视觉中的低级3D运动感知任务。流嵌入是场景流估计中常用的技术,它编码两个连续帧之间的点运动。因此,对于流动嵌入捕获运动的正确总方向是至关重要的。但是,以前的作品仅在本地搜索以确定软信号,而忽略了遥远的点,而遥远的点是实际匹配的点。另外,估计的对应关系通常来自相邻点云的正向,并且可能与从向后方向获得的估计对应关系不一致。为了解决这些问题,我们在初始场景流量估计中提出了一个新颖的全能嵌入层,并具有向后的可靠性验证。此外,我们研究并比较了3D场景流网络的关键组件中的几种设计选择,包括点相似度计算,预测变量的输入元素以及预测变量和改进级别的设计。仔细选择了最有效的设计后,我们能够提出一个模型,该模型可以在FlyingThings3D和Kitti场景流数据集上实现最先进的性能。我们提出的模型超过了所有现有方法的FlythThings3D数据集至少38.2%,而Kitti场景流数据集则超过了EPE3D Metric的24.7%。我们在https://github.com/irmvlab/3dflow上发布代码。
3D scene flow estimation from point clouds is a low-level 3D motion perception task in computer vision. Flow embedding is a commonly used technique in scene flow estimation, and it encodes the point motion between two consecutive frames. Thus, it is critical for the flow embeddings to capture the correct overall direction of the motion. However, previous works only search locally to determine a soft correspondence, ignoring the distant points that turn out to be the actual matching ones. In addition, the estimated correspondence is usually from the forward direction of the adjacent point clouds, and may not be consistent with the estimated correspondence acquired from the backward direction. To tackle these problems, we propose a novel all-to-all flow embedding layer with backward reliability validation during the initial scene flow estimation. Besides, we investigate and compare several design choices in key components of the 3D scene flow network, including the point similarity calculation, input elements of predictor, and predictor & refinement level design. After carefully choosing the most effective designs, we are able to present a model that achieves the state-of-the-art performance on FlyingThings3D and KITTI Scene Flow datasets. Our proposed model surpasses all existing methods by at least 38.2% on FlyingThings3D dataset and 24.7% on KITTI Scene Flow dataset for EPE3D metric. We release our codes at https://github.com/IRMVLab/3DFlow.