This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

Last update: Dec 20, 2022

Overview

EAST: An Efficient and Accurate Scene Text Detector

Description:

This version will be updated soon, please pay attention to this work. The motivation of this version is to build a easy-training model. This version can automatically update best_model by comparing current hmean and the former. At the same time, we can see evaluation info about every sample easily.

1.train
2.predict
3.compress
4.compute Hmean(if Hmean is higher than before, update best_weight.pkl)
5.visualization(blue, green, red)
6.multi-scale test (update soon) multi-scale vis. (vis with score, scales)

Thanks

The version is ported from argman/EAST, from Tensorflow to Pytorch

Check On Website

If you have no confidence of the result of our program, you could use submit.zip to submit on website,then you can see result of every image.

Performance

right -- green || wrong -- red || miss -- blue
recall/precision/hmean for every test image

Introduction

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector. The features are summarized blow:

Only RBOX part is implemented.
A fast Locality-Aware NMS in C++ provided by the paper's author.(g++/gcc version 6.0 + will be ok)
Evalution see here for the detailed results.
Differences from original paper
- Use ResNet-50 rather than PVANET
- Use dice loss (optimize IoU of segmentation) rather than balanced cross entropy
- Use linear learning rate decay rather than staged learning rate decay

Thanks for the author's (@zxytim) help! Please cite his paper if you find this useful.

Installation
Download
Prepare dataset/pretrain
Test
Train
Examples

Installation

Any version of pytorch version > 0.4.0 should be ok.

Download

Pretrained model is not provided temporarily. Web site is updating now, please continue to pay attention

Prepare dataset/pretrain weight

[1]. dataset(you need to prepare for dataset for train and test) suggestions: you could do a soft-link to root_to_this_program/dataset/train/img/*.jpg

-- train ./dataset/train/img/img_###.jpg ./dataset/train/gt/img_###.txt (you need to change name)
-- test ./data/test/img_###.jpg (img only)
-- gt.zip ./result/gt.zip(ICDAR15 gt.zip is avaliable on website

** Note: you can download dataset here

-- ICDAR15
-- ICDAR13

[2]. pretrained

In config.py set resume True and set checkpoint path/to/weight/file
I will provide pretrianed weight soon

[3]. check GPUs and CPUs you can use following to check aviliable gpu, this is for train

watch -n 0.1 nvidia-smi

then, you will see 2,3 is avaliable, modify config.py gpu_ids = [0,1], gpu = 2, and modify run.sh - CUDA_VISIBLE_DEVICES=2,3

Train

If you want to train the model, you should provide the dataset path in config.py and run

sh run.py

** Note: you should modify run.sh to specify your gpu id

If you have more than one gpu, you can pass gpu ids to gpu_list(like gpu_list=0,1,2,3) in config.py

** Note: you should change the gt text file of icdar2015's filename to img_*.txt instead of gt_img_*.txt(or you can change the code in icdar.py), and some extra characters should be removed from the file. See the examples in training_samples/**

Test

By default, we set train-eval process into integer. If you want to use eval independently, you can do it by yourself. Any question can contact me.

Examples

Here are some test examples on icdar2015, enjoy the beautiful text boxes!

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

Related tags

Overview

EAST: An Efficient and Accurate Scene Text Detector

Description:

Thanks

Check On Website

Performance

Introduction

Contents

Installation

Download

Prepare dataset/pretrain weight

Train

Test

Examples

Owner

Dejia Song

Computer vision applications project (Flask and OpenCV)

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Implementation of EAST scene text detector in Keras

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

Single Shot Text Detector with Regional Attention

A facial recognition program that plays a alarm (mp3 file) when a person i seen in the room. A basic theif using Python and OpenCV

一款基于Qt与OpenCV的仿真数字示波器

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

Select range and every time the screen changes, OCR is activated.

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

Binarize document images

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Connect Aseprite to Blender for painting pixelart textures in real time

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

BNF Globalization Code (CVPR 2016)

Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.