论文标题

夜间热红外图像着色的记忆引导的协作关注

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization

论文作者

Luo, Fu-Ya, Cao, Yi-Jun, Yang, Kai-Fu, Li, Yong-Jie

论文摘要

夜间热红外(NTIR)图像着色(也称为NTIR图像翻译为白天颜色图像(NTIR2DC))是一个有希望的研究方向,可促进在不利条件下对人类和智能系统的夜间现场感知(例如,完整的黑暗)。但是,以前开发的方法对于小样本类别的着色性能较差。此外,降低伪标签中的高置信度噪声并解决翻译过程中图像梯度消失的问题仍然不足,并且在翻译过程中防止边缘扭曲也很具有挑战性。为了解决上述问题,我们提出了一个新颖的学习框架,称为记忆引导的协作性关注生成对抗网络(MORNGAN),该框架受到人类的类似推理机制的启发。具体而言,设计了记忆引导的样本选择策略和自适应协作注意力损失,以增强小样本类别的语义保存。此外,我们提出了一个在线语义蒸馏模块,以挖掘和完善NTIR图像的伪标记。此外,引入有条件的梯度修复损失,以减少翻译过程中边缘失真。在NTIR2DC任务上进行的广泛实验表明,在语义保存和边缘一致性方面,提出的Morngan明显优于其他图像到图像翻译方法,这有助于显着提高对象检测精度。

Nighttime thermal infrared (NTIR) image colorization, also known as translation of NTIR images into daytime color images (NTIR2DC), is a promising research direction to facilitate nighttime scene perception for humans and intelligent systems under unfavorable conditions (e.g., complete darkness). However, previously developed methods have poor colorization performance for small sample classes. Moreover, reducing the high confidence noise in pseudo-labels and addressing the problem of image gradient disappearance during translation are still under-explored, and keeping edges from being distorted during translation is also challenging. To address the aforementioned issues, we propose a novel learning framework called Memory-guided cOllaboRative atteNtion Generative Adversarial Network (MornGAN), which is inspired by the analogical reasoning mechanisms of humans. Specifically, a memory-guided sample selection strategy and adaptive collaborative attention loss are devised to enhance the semantic preservation of small sample categories. In addition, we propose an online semantic distillation module to mine and refine the pseudo-labels of NTIR images. Further, conditional gradient repair loss is introduced for reducing edge distortion during translation. Extensive experiments on the NTIR2DC task show that the proposed MornGAN significantly outperforms other image-to-image translation methods in terms of semantic preservation and edge consistency, which helps improve the object detection accuracy remarkably.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源