看到五千种玫瑰

论文标题

看到五千种玫瑰

Seeing a Rose in Five Thousand Ways

论文作者

Zhang, Yunzhi, Wu, Shangzhe, Snavely, Noah, Wu, Jiajun

论文摘要

什么是玫瑰在视觉上？ A玫瑰包括其内在物质，包括几何形状，纹理和特定对象类别的材料的分布。有了这些内在特性的了解，我们可能会在不同的姿势和不同的照明条件下呈现不同大小和形状的玫瑰。在这项工作中，我们构建了一个生成模型，该模型学会从单个图像（例如花束的照片）捕获这种对象固有。这样的图像包括对象类型的多个实例。这些实例都共享相同的内在物质，但由于这些内在范围内的差异和外在因素（例如姿势和照明）的差异而存在不同。实验表明，我们的模型成功地学习了各种对象的对象固有（几何，纹理和材料的分布），每个对象都来自一个Internet映像。我们的方法在多个下游任务上取得了卓越的结果，包括固有的图像分解，形状和图像生成，视图合成和重新效果。

What is a rose, visually? A rose comprises its intrinsics, including the distribution of geometry, texture, and material specific to its object category. With knowledge of these intrinsic properties, we may render roses of different sizes and shapes, in different poses, and under different lighting conditions. In this work, we build a generative model that learns to capture such object intrinsics from a single image, such as a photo of a bouquet. Such an image includes multiple instances of an object type. These instances all share the same intrinsics, but appear different due to a combination of variance within these intrinsics and differences in extrinsic factors, such as pose and illumination. Experiments show that our model successfully learns object intrinsics (distribution of geometry, texture, and material) for a wide range of objects, each from a single Internet image. Our method achieves superior results on multiple downstream tasks, including intrinsic image decomposition, shape and image generation, view synthesis, and relighting.

下载PDF全文

下载文献需遵守相关版权规定

论文标题