金字塔功能对齐网络，用于视频脱张

论文标题

金字塔功能对齐网络，用于视频脱张

Pyramid Feature Alignment Network for Video Deblurring

论文作者

Tao, Leitian, Chen, Zhenzhong

论文摘要

由于各种模糊原因，视频脱毛仍然是一项具有挑战性的任务。传统方法已经考虑了如何通过单尺度对齐来利用相邻框架进行恢复。但是，它们通常遭受严重模糊引起的不对对准。在这项工作中，我们旨在更好地利用具有高效特征对齐方式的相邻帧。我们建议使用视频脱张的金字塔功能对齐网络（PFAN）。首先，在对齐之前，用结构到详细的采样（SDD）提取了模糊框架的多尺度特征。这种降采样策略使边缘更加尖锐，这有助于对齐。然后，我们将功能在每个刻度上对齐，并以相应的比例重建图像。该策略有效地监督了每个量表的一致性，从而克服了在对齐阶段的上述量表中传播错误的问题。为了更好地应对复杂和大动作的挑战，而不是分别对每个尺度上的特征对齐功能，而是使用较低的运动信息来指导较高的运动估计。因此，提出了级联引导的可变形比对（CGDA）将粗运动整合到可变形的卷积中，以使其更精确，更准确。正如在广泛的实验中所证明的那样，与最先进的方法相比，我们提出的PFAN以竞争速度实现了卓越的性能。

Video deblurring remains a challenging task due to various causes of blurring. Traditional methods have considered how to utilize neighboring frames by the single-scale alignment for restoration. However, they typically suffer from misalignment caused by severe blur. In this work, we aim to better utilize neighboring frames with high efficient feature alignment. We propose a Pyramid Feature Alignment Network (PFAN) for video deblurring. First, the multi-scale feature of blurry frames is extracted with the strategy of Structure-to-Detail Downsampling (SDD) before alignment. This downsampling strategy makes the edges sharper, which is helpful for alignment. Then we align the feature at each scale and reconstruct the image at the corresponding scale. This strategy effectively supervises the alignment at each scale, overcoming the problem of propagated errors from the above scales at the alignment stage. To better handle the challenges of complex and large motions, instead of aligning features at each scale separately, lower-scale motion information is used to guide the higher-scale motion estimation. Accordingly, a Cascade Guided Deformable Alignment (CGDA) is proposed to integrate coarse motion into deformable convolution for finer and more accurate alignment. As demonstrated in extensive experiments, our proposed PFAN achieves superior performance with competitive speed compared to the state-of-the-art methods.

下载PDF全文

下载文献需遵守相关版权规定

论文标题