摘要 Abstract
对于真实噪声去除的有监督训练而言,由于难以收集配对的噪声图像和干净图像的大规模数据集,面临着诸多挑战。近期的方法试图通过利用未配对的清洁图像和噪声图像数据集来解决这一问题。一些方法通过生成合成的清洁-噪声对,以监督的方式训练去噪器。然而,这些方法往往因合成噪声图像与真实噪声图像之间的分布差距而表现不佳。为缓解这一问题,我们提出了一种基于输入稀疏化的解决方案,具体采用随机输入掩码。我们的方法被称为掩码、修复和去噪(Mask, Inpaint and Denoise, MID),训练一个去噪器同时进行去噪和修复合成的清洁-噪声对。一方面,输入稀疏化减少了合成噪声图像与真实噪声图像之间的差距;另一方面,以监督方式训练的修复器可以通过预测缺失的清洁像素(利用剩余未掩码像素)准确重构稀疏输入。我们的方法从合成高斯噪声采样器开始,并通过迭代使用由去噪器预测结果衍生出的噪声数据集对其进行逐步优化。噪声数据集通过在每次迭代中用真实噪声图像减去预测的伪清洁图像创建。其核心思想是改进去噪器可以生成更准确的噪声数据集,从而得到更好的噪声采样器。我们在真实噪声图像数据集上进行了广泛的实验,验证了该方法的表现与现有的无监督去噪方法相比具有竞争力。
Supervised training for real-world denoising presents challenges due to the difficulty of collecting large datasets of paired noisy and clean images. Recent methods have attempted to address this by utilizing unpaired datasets of clean and noisy images. Some approaches leverage such unpaired data to train denoisers in a supervised manner by generating synthetic clean-noisy pairs. However, these methods often fall short due to the distribution gap between synthetic and real noisy images. To mitigate this issue, we propose a solution based on input sparsification, specifically using random input masking. Our method, which we refer to as Mask, Inpaint and Denoise (MID), trains a denoiser to simultaneously denoise and inpaint synthetic clean-noisy pairs. On one hand, input sparsification reduces the gap between synthetic and real noisy images. On the other hand, an inpainter trained in a supervised manner can still accurately reconstruct sparse inputs by predicting missing clean pixels using the remaining unmasked pixels. Our approach begins with a synthetic Gaussian noise sampler and iteratively refines it using a noise dataset derived from the denoiser's predictions. The noise dataset is created by subtracting predicted pseudo-clean images from real noisy images at each iteration. The core intuition is that improving the denoiser results in a more accurate noise dataset and, consequently, a better noise sampler. We validate our method through extensive experiments on real-world noisy image datasets, demonstrating competitive performance compared to existing unsupervised denoising methods.