TextBoxes-TensorFlow

TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project Later, we will overwrite this project so make it more flexiable and modularized.

Author: Daitao Xing : [email protected] Jin Huang : [email protected]

Progress

2017/ 03/14

data_processing phase finished Test：

1. Download the dataset， put 1/ folder and gt.mat uner ddata/sythtext/ folder（will wirte script）   
2. python datasets/data2record.py    
3. python image_processing.py

output： batch_size * 300 * 300 * 3 image

2017/ 03/17

Finish the design of training(can start training)

python train.py \
--train_dir=${TRAIN_DIR} \
--dataset_dir=${DATASET_DIR} \
--save_summaries_secs=60 \
--save_interval_secs=600 \
--weight_decay=0.0005 \
--optimizer=adam \
--learning_rate=0.001 \
--batch_size=32

Problems to be solved：

1. Need to redesign visualization		
2. image_processing can be improved

Next steps:

traing on other datasets
fine tunes
test
automatic downloading datasets and so on

TextBoxes re-implement using tensorflow

Related tags

Overview

TextBoxes-TensorFlow

Progress

Problems to be solved：

Next steps:

Owner

Gu Xiaodong

Awesome anomaly detection in medical images

Repositório para registro de estudo da biblioteca opencv (Python)

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

Ddddocr - 通用验证码识别OCR pypi版

Scene text recognition

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

The virtual calculator will be above the live streaming from your camera

TableBank: A Benchmark Dataset for Table Detection and Recognition

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

Handwritten Number Recognition using CNN and Character Segmentation

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

Python bindings for JIGSAW: a Delaunay-based unstructured mesh generator.

This can be use to convert text in a file to handwritten text.

Automatically fishes for you while you are afk :)

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"