Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Last update: Sep 26, 2022

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

This repository provides the official PyTorch implementation of the following paper:

Unsupervised Image Denoising with Frequency Domain Knowledge

Nahyun Kim* (KAIST), Donggon Jang* (KAIST), Sunhyeok Lee (KAIST), Bomi Kim (KAIST), and Dae-Shik Kim (KAIST) (*The authors have equally contributed.)

BMVC 2021, Accepted as Oral Paper.

Abstract: Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods.

Requirements

To install requirements:

conda env create -n [your env name] -f environment.yaml
conda activate [your env name]

To train the model

Synthetic Noise (AWGN)

Download DIV2K dataset for training in here
Randomly split the DIV2K dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(DIV2K_C and DIV2K_N) in ./dataset directory.

dataset
└─── DIV2K_C
└─── DIV2K_N
└─── test

Use gen_dataset_synthetic.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/train_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/train_awgn_sigma50.sh # AWGN with a noise level = 50

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

Real-World Noise

Download SIDD-Medium Dataset for training in here
Radnomly split the SIDD-Medium Dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(SIDD_C and SIDD_N) in ./dataset directory.

dataset
└─── SIDD_C
└─── SIDD_N
└─── test

Use gen_dataset_real.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_real.sh

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

To evaluate the model

Synthetic Noise (AWGN)

Download CBSD68 dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/test_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/test_awgn_sigma50.sh # AWGN with a noise level = 50

Real-World Noise

Download the SIDD test dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_real.sh

Pre-trained model

We provide pre-trained models in ./checkpoints directory.

checkpoints
|   AWGN_sigma15.pth # pre-trained model (AWGN with a noise level = 15)
|   AWGN_sigma25.pth # pre-trained model (AWGN with a noise level = 25)
|   AWGN_sigma50.pth # pre-trained model (AWGN with a noise level = 50)
|   SIDD.pth # pre-trained model (Real-World noise)

Acknowledgements

This code is built on U-GAT-IT,CARN, SSD-GAN. We thank the authors for sharing their codes.

Contact

If you have any questions, feel free to contact me ([email protected])

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

Requirements

To train the model

Synthetic Noise (AWGN)

Real-World Noise

To evaluate the model

Synthetic Noise (AWGN)

Real-World Noise

Pre-trained model

Acknowledgements

Contact

Owner

Donggon Jang

(Preprint) Official PyTorch implementation of "How Do Vision Transformers Work?"

Official code for UnICORNN (ICML 2021)

A Deep Reinforcement Learning Framework for Stock Market Trading

Deep motion generator collections

RoFormer_pytorch

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)

GNN-based Recommendation Benchmark

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

A series of convenience functions to make basic image processing operations such as translation, rotation, resizing, skeletonization, and displaying Matplotlib images easier with OpenCV and Python.

Exploration of some patients clinical variables.

Python scripts form performing stereo depth estimation using the HITNET model in ONNX.

A cross-document event and entity coreference resolution system, trained and evaluated on the ECB+ corpus.

Code for "Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods."

Official PaddlePaddle implementation of Paint Transformer

Code for classifying international patents based on the text of their titles/abstracts

Distilled coarse part of LoFTR adapted for compatibility with TensorRT and embedded divices

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

Trajectory Prediction with Graph-based Dual-scale Context Fusion

An Implementation of SiameseRPN with Feature Pyramid Networks