Deep Halftoning with Reversible Binary Pattern

ICCV Paper | Project Website | BibTex

Overview

Existing halftoning algorithms usually drop colors and fine details when dithering color images with binary dot patterns, which makes it extremely difficult to recover the original information. To dispense the recovery trouble in future, we propose a novel halftoning technique that dithers a color image into binary halftone with decent restorability to the original input. The key idea is to implicitly embed those previously dropped information into the binary dot patterns. So, the halftone pattern not only serves to reproduce the image tone, maintain the blue-noise randomness, but also represents the color information and fine details. See the examples illustrated below.

Run

Requirements:
- Basic variant infomation: Python 3.7 and Pytorch 1.0.1.
- Create a virutal environment with satisfied requirements:
```
conda env create -f requirement.yaml
```
Training:
- Place your training set/validation set under dataset/ per the exampled file organization. Or download our [preprocessed full dataset](coming soon).
- Warm-up stage (optional):
```
python train_warm.py --config scripts/invhalf_warm.json
```
  If this stage skipped, please download the pretrained warm-up weight and place it in checkpoints/, which is required at joint-train stage.
- Joint-train stage:
```
python train.py --config scripts/invhalf_full.json
```
Testing:
- Download the pretrained weight below and put it under checkpoints/.
- Place your images in any accesible directory, e.g. test_imgs/.
- Dither the input images and restore from the generated halftones
```
python inference_fast.py --model checkpoints/model_best.pth.tar --data_dir ./test_imgs --save_dir ./result
```

Copyright and License

You are granted with the LICENSE for both academic and commercial usages.

Citation

If any part of our paper and code is helpful to your work, please generously cite with:

@inproceedings{xia-2021-inverthalf,
	author   = {Menghan Xia and Wenbo Hu and Xueting Liu and Tien-Tsin Wong},
	title    = {Deep Halftoning with Reversible Binary Pattern},
	booktitle = {{IEEE/CVF} International Conference on Computer Vision (ICCV)},
	year = {2021}
}

Deep Halftoning with Reversible Binary Pattern

Related tags

Overview

Deep Halftoning with Reversible Binary Pattern

ICCV Paper | Project Website | BibTex

Overview

Run

Copyright and License

Citation

Owner

Menghan Xia

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

The project was to detect traffic signs, based on the Megengine framework.

Research code for Arxiv paper "Camera Motion Agnostic 3D Human Pose Estimation"

Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction

PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds

The full training script for Enformer (Tensorflow Sonnet) on TPU clusters

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

A highly efficient, fast, powerful and light-weight anime downloader and streamer for your favorite anime.

Code for our ALiBi method for transformer language models.

Code for our paper "Interactive Analysis of CNN Robustness"

A chemical analysis of lipophilicities & molecule drawings including ML

Multiple custom object count and detection using YOLOv3-Tiny method

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Code for our paper Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

NOMAD - A blackbox optimization software

Kindle is an easy model build package for PyTorch.