An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Last update: Dec 12, 2022

Related tags

Overview

InceptText-Tensorflow

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Introduction

Tensorflow=1.4.0

Preparation

1.gcc 4.9

2.cuda8.0

3.cd lib && make

可能遇到的错误：

解决办法：把cuda路径添加到系统环境变量，然后改为#include<cuda.h>

解决办法：找到nsync_cv.h的绝对路径然后include

解决办法：找到nsync_mu.h的绝对路径然后include

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

Focus on NLP and OCR

GitHub Repository

The papers published in top-tier AI conferences in recent years.

AI-conference-papers The papers published in top-tier AI conferences in recent years. Paper table AAAI ICLR CVPR ICML ICCV ECCV NIPS 2019 ✔️ ✔️ ✔️ ✔️

6 Dec 09, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

EasyOCR Ready-to-use OCR with 80+ languages supported including Chinese, Japanese, Korean and Thai. What's new 1 February 2021 - Version 1.2.3 Add set

16.7k Jan 03, 2023

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

SRA's seminar on Introduction to Computer Vision Fundamentals

Introduction to Computer Vision This repository includes basics to : Python Numpy: A python library Git Computer Vision. The aim of this repository is

147 Dec 04, 2022

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

EAST_ICPR2018: EAST for ICPR MTWI 2018 Challenge II (Text detection of network images) Introduction This is a repository forked from argman/EAST for t

49 Dec 24, 2022

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

GTA-5-Lane-detection Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and

4 Aug 01, 2021

Automatically download multiple papers by keywords in CVPR

CVFPaperHelper Automatically download multiple papers by keywords in CVPR Install mkdir PapersToRead cd PaperToRead pip install requests tqdm git clon

46 Jun 08, 2022

基于openpose和图像分类的手语识别项目

手语识别 0、使用到的模型 (1). openpose，作者：CMU-Perceptual-Computing-Lab https://github.com/CMU-Perceptual-Computing-Lab/openpose (2). 图像分类classification，作者：Bubbl

20 Dec 15, 2022

fishington.io bot with OpenCV and NumPy

fishington.io-bot fishington.io bot with using OpenCV and NumPy bot can continue to fishing fully automatically how to use Open cmd in fishington.io-b

77 Jan 02, 2023

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Virtual Keyboard With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want. At

5 Jan 23, 2022

Let's explore how we can extract text from forms

Form Segmentation Let's explore how we can extract text from any forms / scanned pages. Objectives The goal is to find an algorithm that can extract t

42 Jun 05, 2022

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection Introduction The code and trained models of: TextField: Learning A Deep

101 Dec 12, 2022

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Convolutional Recurrent Neural Network This software implements the Convolutional Recurrent Neural Network (CRNN), a combination of CNN, RNN and CTC l

2k Dec 31, 2022

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

Passport-Recogniton-System This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and fle

7 Jan 04, 2023

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Related tags

Overview

InceptText-Tensorflow

Introduction

Tensorflow=1.4.0

Preparation

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

The papers published in top-tier AI conferences in recent years.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

SRA's seminar on Introduction to Computer Vision Fundamentals

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

Automatically download multiple papers by keywords in CVPR

基于openpose和图像分类的手语识别项目

fishington.io bot with OpenCV and NumPy

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Let's explore how we can extract text from forms

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

A curated list of awesome synthetic data for text location and recognition

deployment of a hybrid model for automatic weapon detection/ anomaly detection for surveillance applications

PianoVisuals - Create background videos synced with piano music using opencv

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks