It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Last update: Jul 11, 2022

Related tags

Overview

OCR-Tool

It is a image ocr tool made in Python using the Tesseract-OCR engine with the pytesseract package and has a GUI. This is my second ever python project so feel free to make any suggestions

Release

To install it, extract the zip that you downloaded. Put it in a folder like program files. There is a file called OCR-Tool.exe. You could make a shortcut and put it on your desktop for easy access.
Windows might say it's dangerous to run and block it. Just click "more info" and there you can run it.
If you download the version without tesseract included please check the dependencies section for instructions on how to add it.

Version 1.1

With tesseract

https://drive.google.com/file/d/1EMS8cKsasorLRXpqVjLxo41nsAEk4SiF/view?usp=sharing

Without tesseract

https://drive.google.com/file/d/1O4EYF9EmawT0VRSM6U1XkDBndGQBxDRe/view?usp=sharing

Features

Modern GUI
Snipping tool (Credit to harupy's python snipping tool)
Open image from folder
Paste image from clipboard
Save text to .txt
Copy text to clipboard
Cancel snip

Dependencies

Tesseract OCR Engine (UB Mannheim). Install either version 4 or 5. 5 is recommended as it performs better. Look for the installlation folder, the default is program files. Copy it into the same folder as main.py and rename the folder to "tesseract"
Pytesseract
PyQt5

Known bugs

Copy text button crashes app
White text doesnt work well

Future features

Better preprocessing to help with weird backgrounds
Document ocr
Menu

make a better chinese character recognition OCR than tesseract

deep ocr See README_en.md for English installation documentation. 只在ubuntu下面测试通过，需要virtualenv安装，安装路径可自行调整： git clone https://github.com/JinpengLI/deep

1.5k Dec 28, 2022

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

PDFImage2TXT - DOWNLOAD INSTALLER HERE What can you do with it? Convert scanned PDFs to TXT. Convert scanned Documents to TXT. No coding required!! In

2 Feb 22, 2022

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract Toolset U^2-Net is used for background removal Textcleaner is used for image cleaning

3 Jul 13, 2022

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Open Semantic Search https://opensemanticsearch.org Integrated search server, ETL framework for document processing (crawling, text extraction, text a

684 Jan 6, 2023

A Python wrapper for Google Tesseract

Python Tesseract Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded i

4.6k Jan 6, 2023

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

English | 简体中文 Introduction PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and a

27.5k Jan 8, 2023

OCR engine for all the languages

Description kraken is a turn-key OCR system optimized for historical and non-Latin script material. kraken's main features are: Fully trainable layout

431 Jan 4, 2023

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

OCR.space OCR Result Checker = Draw OCR overlay on top of image Python tool that takes the OCR.space JSON output as input, and draws an overlay on to

4 Oct 18, 2022

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Attention-based OCR Visual attention-based OCR model for image recognition with additional tools for creating TFRecords datasets and exporting the tra

933 Dec 29, 2022

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Related tags

Overview

OCR-Tool

Release

Version 1.1

With tesseract

Without tesseract

Features

Dependencies

Known bugs

Future features

You might also like...

make a better chinese character recognition OCR than tesseract

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

A Python wrapper for Google Tesseract

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

OCR engine for all the languages

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Releases(Release)

Release(Oct 15, 2021)

Changes

Owner

Khant Htet Aung

A tool to make dumpy among us GIFS

Ddddocr - 通用验证码识别OCR pypi版

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

基于Paddle框架的PSENet复现

Color Picker and Color Detection tool for METR4202

Repository collecting all the submodules for the new PyTorch-based OCR System.

Awesome anomaly detection in medical images

OCR, Scene-Text-Understanding, Text Recognition

Vietnamese Language Detection and Recognition

Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Scene text recognition

a micro OCR network with 0.07mb params.

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition