TextBoxes++-TensorFlow

TextBoxes++ re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project

Author: Zhisheng Zou [email protected]

pretrained model

Google drive

environment

python2.7/python3.5

tensorflow-gpu 1.8.0

at least one gpu

how to use

Getting the xml file like this example xml and put the image together because we need the format like this standard xml
1. picture format: *.png or *.PNG
Getting the xml and flags ensure the XML file is under the same directory as the corresponding image.execute the code: convert_xml_format.py
1. python tools/convert_xml_format.py -i in_dir -s split_flag -l save_logs -o output_dir
2. in_dir means the absolute directory which contains the pic and xml
3. split_flag means whether or not to split the datasets
4. save_logs means whether to save train_xml.txt
5. output_dir means where to save xmls
Getting the tfrecords
1. python gene_tfrecords.py --xml_img_txt_path=./logs/train_xml.txt --output_dir=tfrecords
2. xml_img_txt_path like this train xml
3. output_dir means where to save tfrecords
Training
1. python train.py --train_dir =some_path --dataset_dir=some_path --checkpoint_path=some_path
2. train_dir store the checkpoints when training
3. dataset_dir store the tfrecords for training
4. checkpoint_path store the model which needs to be fine tuned
Testing
1. python test.py -m /home/model.ckpt-858 -o test
2. -m which means the model
3. -o which means output_result_dir
4. -i which means the test img dir
5. -c which means use which device to run the test
6. -n which means the nms threshold
7. -s which means the score threshold

Note:

when you are training the model, you can run the eval_result.py to eval your model and save the result

Textboxes_plusplus implementation with Tensorflow (python)

Related tags

Overview

TextBoxes++-TensorFlow

pretrained model

environment

how to use

Note:

Owner

Automatically download multiple papers by keywords in CVPR

OCR, Scene-Text-Understanding, Text Recognition

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

FOTS Pytorch Implementation

Face Recognizer using Opencv Python

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

Face Anonymizer - FaceAnonApp v1.0

A Python wrapper for the tesseract-ocr API

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

Repository for playing the computer vision apps: People analytics on Raspberry Pi.

pyntcloud is a Python library for working with 3D point clouds.

OCR, Object Detection, Number Plate, Real Time

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition