ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Last update: Dec 08, 2021

Related tags

Overview

VistaOCR

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Publications

"How to Efficiently Increase Resolution in Neural OCR Models". Stephen Rawls, Huaigu Cao, Joe Mathai, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2018.

"Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR". Stephen Rawls, Huaigu Cao, Senthil Kumar, Prem Natarajan. International Conference on Document Analysis and Recognition (ICDAR) 2017.

"Combining Deep Learning and Language Modeling for Segmentation-free OCR From Raw Pixels". Stephen Rawls, Huaigu Cao, Ekraam Sabir, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2017.

Model

Pretrained Models

Coming Soon. Pre-trained models for English, French, and Arabic Handwriting

Performance Numbers

Coming soon. Expected character and word error rates from public datasets.

How to Train

Coming soon.

How to Decode using Existing Model

Coming soon.

Citation

@inproceedings{vistaocr,
  author    = {Stephen Rawls and Huaigu Cao and Senthil Kumar and Prem Natarjan},
  title     = {Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR},
  booktitle = {Proc. ICDAR},
  year      = {2017},
  url       = {https://doi.org/10.1109/ICDAR.2017.34},
  doi       = {10.1109/ICDAR.2017.34}
}

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Related tags

Overview

VistaOCR

Publications

Model

Pretrained Models

Performance Numbers

How to Train

How to Decode using Existing Model

Citation

Owner

ISI Center for Vision, Image, Speech, and Text Analytics

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition

Histogram specification using openCV in python .

Semantic-based Patch Detection for Binary Programs

Pre-Recognize Library - library with algorithms for improving OCR quality.

Automatically remove the mosaics in images and videos, or add mosaics to them.

This project is basically to draw lines with your hand, using python, opencv, mediapipe.

Optical character recognition for Japanese text, with the main focus being Japanese manga

Read Japanese manga inside browser with selectable text.

TextBoxes++: A Single-Shot Oriented Scene Text Detector

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

SemTorch

Character Segmentation using TensorFlow

One Metrics Library to Rule Them All!

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket:

A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well

Scene text recognition

Augmenting Anchors by the Detector Itself

ARU-Net - Deep Learning Chinese Word Segment

Brief idea about our project is mentioned in project presentation file.

一键翻译各类图片内文字