论文标题

训练您的数据处理器:分布感知和错误补偿坐标是针对人姿势估计的解码

Train Your Data Processor: Distribution-Aware and Error-Compensation Coordinate Decoding for Human Pose Estimation

论文作者

Yang, Feiyu, Song, Zhan, Xiao, Zhenzhong, Chen, Yu, Pan, Zhe, Zhang, Min, Xue, Min, Mo, Yaoyang, Zhang, Yao, Guan, Guoxiong, Qian, Beibei

论文摘要

最近,人类姿势估计的领先表现由基于热图的方法主导。虽然是热图处理的基本组成部分,但热图解码(即将热图转换为坐标)仅接受有限的研究,就我们的最佳知识而言。这项工作通过研究热图解码处理,特别关注整个预测过程中引入的错误,从而填补了空白。我们发现,基于热图的方法的误差令人惊讶地显着,但以前被普遍忽略。鉴于发现的重要性,我们进一步揭示了先前广泛使用的热图解码方法的固有局限性,因此提出了分布感知和错误补偿坐标解码(DAEC)。 DAEC充当模型不足的插件,从培训数据中学习了其解码策略,并明显地提高了具有可忽略的额外计算的各种最先进的人姿势估计模型的性能。具体而言,配备了DAEC,SimpleBaseline-Resnet152-256x192和HRNET-W48-256X192分别显着提高了2.6 AP和2.9 AP,分别在CoCo上实现了72.6 AP和75.7 AP。此外,使用PCKH0.1公制,HRNET-W32-256X256和RESNET-152-256X256框架在MPII上享有8.4%和7.8%的戏剧性促销。在这两个共同的基准上进行的广泛实验表明,DAEC通过相当大的边距超过了竞争对手,从而支持了我们新颖的热图解码想法的合理性和一般性。该项目可在https://github.com/fyang235/daec上找到。

Recently, the leading performance of human pose estimation is dominated by heatmap based methods. While being a fundamental component of heatmap processing, heatmap decoding (i.e. transforming heatmaps to coordinates) receives only limited investigations, to our best knowledge. This work fills the gap by studying the heatmap decoding processing with a particular focus on the errors introduced throughout the prediction process. We found that the errors of heatmap based methods are surprisingly significant, which nevertheless was universally ignored before. In view of the discovered importance, we further reveal the intrinsic limitations of the previous widely used heatmap decoding methods and thereout propose a Distribution-Aware and Error-Compensation Coordinate Decoding (DAEC). Serving as a model-agnostic plug-in, DAEC learns its decoding strategy from training data and remarkably improves the performance of a variety of state-of-the-art human pose estimation models with negligible extra computation. Specifically, equipped with DAEC, the SimpleBaseline-ResNet152-256x192 and HRNet-W48-256x192 are significantly improved by 2.6 AP and 2.9 AP achieving 72.6 AP and 75.7 AP on COCO, respectively. Moreover, the HRNet-W32-256x256 and ResNet-152-256x256 frameworks enjoy even more dramatic promotions of 8.4% and 7.8% on MPII with PCKh0.1 metric. Extensive experiments performed on these two common benchmarks, demonstrates that DAEC exceeds its competitors by considerable margins, backing up the rationality and generality of our novel heatmap decoding idea. The project is available at https://github.com/fyang235/DAEC.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源