论文标题
滚动快门摄像机的上下文感知视频重建
Context-Aware Video Reconstruction for Rolling Shutter Cameras
论文作者
论文摘要
随着滚动百叶窗(RS)摄像机的普遍存在,从连续两个RS框架中恢复潜在的全球快门(GS)视频变得越来越有吸引力,这也使人们对现实主义的需求更高。使用深层神经网络或优化的现有解决方案实现了有希望的性能。但是,这些方法通过RS模型通过图像翘曲生成中间GS框架,这不可避免地会导致黑洞和明显的运动伪像。在本文中,我们通过提出上下文感知的GS视频重建体系结构来减轻这些问题。它促进了诸如遮挡推理,运动补偿和时间抽象之类的优势。具体而言,我们首先估计双侧运动场,以便将两个RS帧的像素相应地扭曲到一个共同的GS框架上。然后,提出了一种完善方案,以指导GS框架合成以及双侧遮挡掩模,以在任意时间在任意时间产生高保真的GS视频帧。此外,我们得出了一个近似的双侧运动场模型,该模型可以作为为相关任务提供简单但有效的GS框架初始化的替代方法。关于合成和真实数据的实验表明,就客观指标和主观视觉质量而言,我们的方法比最先进的方法实现了优于最先进的方法。代码可在\ url {https://github.com/gitcvfb/cvr}上找到。
With the ubiquity of rolling shutter (RS) cameras, it is becoming increasingly attractive to recover the latent global shutter (GS) video from two consecutive RS frames, which also places a higher demand on realism. Existing solutions, using deep neural networks or optimization, achieve promising performance. However, these methods generate intermediate GS frames through image warping based on the RS model, which inevitably result in black holes and noticeable motion artifacts. In this paper, we alleviate these issues by proposing a context-aware GS video reconstruction architecture. It facilitates the advantages such as occlusion reasoning, motion compensation, and temporal abstraction. Specifically, we first estimate the bilateral motion field so that the pixels of the two RS frames are warped to a common GS frame accordingly. Then, a refinement scheme is proposed to guide the GS frame synthesis along with bilateral occlusion masks to produce high-fidelity GS video frames at arbitrary times. Furthermore, we derive an approximated bilateral motion field model, which can serve as an alternative to provide a simple but effective GS frame initialization for related tasks. Experiments on synthetic and real data show that our approach achieves superior performance over state-of-the-art methods in terms of objective metrics and subjective visual quality. Code is available at \url{https://github.com/GitCVfb/CVR}.