论文标题

重新考虑多对象跟踪中检测与REID之间的竞争

Rethinking the competition between detection and ReID in Multi-Object Tracking

论文作者

Liang, Chao, Zhang, Zhipeng, Zhou, Xue, Li, Bing, Zhu, Shuyuan, Hu, Weiming

论文摘要

由于精确度和速度平衡,共同学习检测和识别嵌入的单发模型引起了多对象跟踪(MOT)的极大关注。但是,由于将它们视为单次跟踪范式中的两个孤立任务,因此被忽略了检测与重新识别(REID)之间的固有差异和关系(REID)。与现有的两阶段方法相比,这会导致劣质性能。在本文中,我们首先剖析了这两个任务的推理过程,这表明它们之间的竞争不可避免地会破坏与任务有关的表示。为了解决这个问题,我们提出了一个新颖的互惠网络(REN),具有自相关和交叉关系设计,以便促使每个分支更好地学习任务依赖性表示。拟议的模型旨在减轻有害的任务竞争,同时改善检测与REID之间的合作。此外,我们引入了一个比例吸引的注意网络(SAAN),该网络可防止语义水平的未对准以提高ID嵌入的关联能力。通过将两个精心设计的网络集成到一个单一的在线MOT系统中,我们构建了一个强大的MOT跟踪器,即Cstrack。我们的跟踪器在没有其他铃铛和哨声的MOT16,MOT17和MOT20数据集上实现了最先进的性能。此外,CSTRACK效率高,在单个现代GPU上以16.4 fps的速度运行,其轻量级版本甚至以34.6 fps的速度运行。完整的代码已在https://github.com/judasdie/sots上发布。

Due to balanced accuracy and speed, one-shot models which jointly learn detection and identification embeddings, have drawn great attention in multi-object tracking (MOT). However, the inherent differences and relations between detection and re-identification (ReID) are unconsciously overlooked because of treating them as two isolated tasks in the one-shot tracking paradigm. This leads to inferior performance compared with existing two-stage methods. In this paper, we first dissect the reasoning process for these two tasks, which reveals that the competition between them inevitably would destroy task-dependent representations learning. To tackle this problem, we propose a novel reciprocal network (REN) with a self-relation and cross-relation design so that to impel each branch to better learn task-dependent representations. The proposed model aims to alleviate the deleterious tasks competition, meanwhile improve the cooperation between detection and ReID. Furthermore, we introduce a scale-aware attention network (SAAN) that prevents semantic level misalignment to improve the association capability of ID embeddings. By integrating the two delicately designed networks into a one-shot online MOT system, we construct a strong MOT tracker, namely CSTrack. Our tracker achieves the state-of-the-art performance on MOT16, MOT17 and MOT20 datasets, without other bells and whistles. Moreover, CSTrack is efficient and runs at 16.4 FPS on a single modern GPU, and its lightweight version even runs at 34.6 FPS. The complete code has been released at https://github.com/JudasDie/SOTS.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源