Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

Driver Drowsiness Detection with OpenCV & Dlib

Text recognition (optical character recognition) with deep learning methods.

An organized collection of tutorials and projects created for aspriring computer vision students.

Repository for playing the computer vision apps: People analytics on Raspberry Pi.

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

A curated list of papers, code and resources pertaining to image composition

Handwritten Text Recognition (HTR) using TensorFlow 2.x

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

TableBank: A Benchmark Dataset for Table Detection and Recognition

A dataset handling library for computer vision datasets in LOST-fromat

Character Segmentation using TensorFlow

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

GDB python tool to pretty print and debug c++ xtensor containers

Learning Camera Localization via Dense Scene Matching, CVPR2021

Convert scans of handwritten notes to beautiful, compact PDFs

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Distilling Knowledge via Knowledge Review, CVPR 2021

graph learning code for ogb