自学与合成数据集：视频降级的背景下哪个较小的邪恶？

论文标题

自学与合成数据集：视频降级的背景下哪个较小的邪恶？

Self-supervision versus synthetic datasets: which is the lesser evil in the context of video denoising?

论文作者

Dewil, Valéry, Barral, Aranud, Facciolo, Gabriele, Arias, Pablo

论文摘要

监督培训导致了图像和视频降解的最新结果。但是，它对真实数据的应用受到限制，因为它需要很难获得的大量嘈杂清洁对的数据集。因此，网络通常经过现实的合成数据培训。最近，已经提出了一些自我监督的框架，即不需要地面真相，直接在嘈杂的数据上训练这种deno网络。关于综合deo的问题，监督培训的问题优于自我监督的方法，但是近年来，差距变得更狭窄，尤其是对于视频而言。在本文中，我们提出了一项研究，旨在确定哪种方法是培训deno网络的真正原始视频的最佳方法：对合成现实数据的监督或对真实数据的自学方法。一项具有定量结果的完整研究在具有真实运动的自然视频的情况下是不可能的，因为没有具有清洁噪声对的数据集。我们通过考虑三个独立的实验来解决这个问题，其中我们将两个框架进行比较。我们发现，实际数据上的自学意识超过合成数据的监督，并且在正常照明条件下，性能下降是由于合成的地面真相产生而不是噪声模型所致。

Supervised training has led to state-of-the-art results in image and video denoising. However, its application to real data is limited since it requires large datasets of noisy-clean pairs that are difficult to obtain. For this reason, networks are often trained on realistic synthetic data. More recently, some self-supervised frameworks have been proposed for training such denoising networks directly on the noisy data without requiring ground truth. On synthetic denoising problems supervised training outperforms self-supervised approaches, however in recent years the gap has become narrower, especially for video. In this paper, we propose a study aiming to determine which is the best approach to train denoising networks for real raw videos: supervision on synthetic realistic data or self-supervision on real data. A complete study with quantitative results in case of natural videos with real motion is impossible since no dataset with clean-noisy pairs exists. We address this issue by considering three independent experiments in which we compare the two frameworks. We found that self-supervision on the real data outperforms supervision on synthetic data, and that in normal illumination conditions the drop in performance is due to the synthetic ground truth generation, not the noise model.

下载PDF全文

下载文献需遵守相关版权规定

论文标题