Contextual Attention Localization for Offline Handwritten Text Recognition

Last update: Feb 17, 2022

Related tags

Overview

CALText

This repository contains the source code for CALText model introduced in "CALText: Contextual Attention Localization for Offline Handwritten Text" paper. The details of this model are presented in: (Add paper link)

Samples of the datasets that were used to train and test the model can be found at: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

The code in this model was based on the work of:

https://github.com/JianshuZhang/WAP.

https://github.com/wwjwhen/Watch-Attend-and-Parse-tensorflow-version.

Requirements

Python 3 Tensorflow v1.6

Usage

Upload data files into your Colab account, create pickle files (train, valid, and test images and labels) from the dataset. You can place the pickle dataset files at any folder of your preference but change the path settings in the code where this data is being loaded.

Run "makepickle.ipynb" to create pickle files for train and test data. Further distribute the train pickle file into train and valid pickle files by using last 907 images and labels of train as valid.

For training, set mode="train", and run "CALText.ipynb".

For testing, set mode="test", and run "CALText.ipynb".

For Contextual Attention, set alpha_reg=0, while training and testing.

For Contextual Attention Localization, set alpha_reg=1, while training and testing.

Run on Python Compiler

To run the code on python compiler, copy the code and make file as "makepickle.py" and "CALText.py". Use following commands to run code files.

python makepickle.py

python CALText.py

Run on Google Colab

Open "makepickle.ipynb" and "CALText.ipynb" notebook in Google Colab Notebook, and run.

Run "%tensorflow_version 1.x" command at colab notebook before running of "CALText.ipynb".

Change runtime to GPU or TPU for better performance.

Add these lines in notebook for accessing data from google derive:

from google.colab import drive

drive.mount("/gdrive", force_remount=True)

References

PUCIT Offline Handwritten Urdu Lines (PUCIT-OHUL) Dataset: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

Previous Work:

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/index.html

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/ICFHR2020_manuscript.pdf

Contextual Attention Localization for Offline Handwritten Text Recognition

Related tags

Overview

CALText

Requirements

Usage

Run on Python Compiler

Run on Google Colab

References

Owner

The code used for the free [email protected] Webinar series on Reinforcement Learning in Finance

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

Efficient 3D human pose estimation in video using 2D keypoint trajectories

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

RoMA: Robust Model Adaptation for Offline Model-based Optimization

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

Efficient 6-DoF Grasp Generation in Cluttered Scenes

Blender Python - Node-based multi-line text and image flowchart

Code for our CVPR 2022 Paper "GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection"

Auxiliary Raw Net (ARawNet) is a ASVSpoof detection model taking both raw waveform and handcrafted features as inputs, to balance the trade-off between performance and model complexity.

Self-Supervised Learning

JAXDL: JAX (Flax) Deep Learning Library

An implementation of the WHATWG URL Standard in JavaScript

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

Wind Speed Prediction using LSTMs in PyTorch

The implementation of ICASSP 2020 paper "Pixel-level self-paced learning for super-resolution"

PyTorch code for the paper: FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning

End-to-End Referring Video Object Segmentation with Multimodal Transformers