论文标题
与主动多对象跟踪的坐标对准多相机协作
Coordinate-Aligned Multi-Camera Collaboration for Active Multi-Object Tracking
论文作者
论文摘要
主动多对象跟踪(AMOT)是一项由集中式系统控制相机以自动和协作调整其姿势的任务,以最大程度地覆盖其共享视野中目标的覆盖范围。在AMOT中,每个摄像机仅从其观察值中收到部分信息,这可能会误导相机以采取本地最佳动作。此外,全局目标,即对物体的最大覆盖范围,很难直接优化。为了解决上述问题,我们为AMOT提出了一个坐标对准的多相机协作系统。在我们的方法中,我们将每个相机视为代理,并使用多代理增强学习解决方案来解决AMOT。为了表示每个代理的观察,我们首先用图像检测器识别摄像机视图中的目标,然后在3D环境中对齐目标的坐标。我们根据全球覆盖范围以及四个个人奖励条款来定义每个代理商的奖励。代理的操作策略由基于价值的Q网络得出。据我们所知,我们是第一个研究AMOT任务的人。为了培训和评估系统的功效,我们建立了一个虚拟但可信的3D环境,称为“足球法庭”,以模仿现实世界中的AMOT场景。实验结果表明,我们的系统的覆盖率为71.88%,表现优于基线方法8.9%。
Active Multi-Object Tracking (AMOT) is a task where cameras are controlled by a centralized system to adjust their poses automatically and collaboratively so as to maximize the coverage of targets in their shared visual field. In AMOT, each camera only receives partial information from its observation, which may mislead cameras to take locally optimal action. Besides, the global goal, i.e., maximum coverage of objects, is hard to be directly optimized. To address the above issues, we propose a coordinate-aligned multi-camera collaboration system for AMOT. In our approach, we regard each camera as an agent and address AMOT with a multi-agent reinforcement learning solution. To represent the observation of each agent, we first identify the targets in the camera view with an image detector, and then align the coordinates of the targets in 3D environment. We define the reward of each agent based on both global coverage as well as four individual reward terms. The action policy of the agents is derived with a value-based Q-network. To the best of our knowledge, we are the first to study the AMOT task. To train and evaluate the efficacy of our system, we build a virtual yet credible 3D environment, named "Soccer Court", to mimic the real-world AMOT scenario. The experimental results show that our system achieves a coverage of 71.88%, outperforming the baseline method by 8.9%.