TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Last update: Jan 04, 2023

Related tags

Overview

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Xue, Wenyuan, et al. "TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition." arXiv preprint arXiv:2106.10598 (2021).

This work has been accepted for presentation at ICCV2021. The preview version has released at arXiv.org (https://arxiv.org/abs/2106.10598).

Abstract

A table arranging data in rows and columns is a very effective data structure, which has been widely used in business and scientific research. Considering large-scale tabular data in online and offline documents, automatic table recognition has attracted increasing attention from the document analysis community. Though human can easily understand the structure of tables, it remains a challenge for machines to understand that, especially due to a variety of different table layouts and styles. Existing methods usually model a table as either the markup sequence or the adjacency matrix between different table cells, failing to address the importance of the logical location of table cells, e.g., a cell is located in the first row and the second column of the table. In this paper, we reformulate the problem of table structure recognition as the table graph reconstruction, and propose an end-to-end trainable table graph reconstruction network (TGRNet) for table structure recognition. Specifically, the proposed method has two main branches, a cell detection branch and a cell logical location branch, to jointly predict the spatial location and the logical location of different cells. Experimental results on three popular table recognition datasets and a new dataset with table graph annotations (TableGraph-350K) demonstrate the effectiveness of the proposed TGRNet for table structure recognition.

Getting Started

Requirements

Create the environment from the environment.yml file conda env create --file environment.yml or install the software needed in your environment independently. If you meet some problems when installing PyTorch Geometric, please follow the official installation indroduction (https://pytorch-geometric.readthedocs.io/en/latest/notes/installation.html).

dependencies:
  - python==3.7.0
  - pip==20.2.4
  - pip:
    - dominate==2.5.1
    - imageio==2.8.0
    - networkx==2.3
    - numpy==1.18.2
    - opencv-python==4.4.0.46
    - pandas==1.0.3
    - pillow==7.1.1
    - torchfile==0.1.0
    - tqdm==4.45.0
    - visdom==0.1.8.9
    - Polygon3==3.0.8

PyTorch Installation

# CUDA 10.2
pip install torch==1.5.0 torchvision==0.6.0
# CUDA 10.1
pip install torch==1.5.0+CU101 torchvision==0.6.0+CU101 -f https://download.pytorch.org/whl/torch_stable.html
# CUDA 9.2
pip install torch==1.5.0+CU92 torchvision==0.6.0+CU92 -f https://download.pytorch.org/whl/torch_stable.html

PyTorch Geometric Installation

pip install torch-scatter==2.0.4 -f https://pytorch-geometric.com/whl/torch-1.5.0+${CUDA}.html
pip install torch-sparse==0.6.3 -f https://pytorch-geometric.com/whl/torch-1.5.0+${CUDA}.html
pip install torch-cluster==1.5.4 -f https://pytorch-geometric.com/whl/torch-1.5.0+${CUDA}.html
pip install torch-spline-conv==1.2.0 -f https://pytorch-geometric.com/whl/torch-1.5.0+${CUDA}.html
pip install torch-geometric

where ${CUDA} should be replaced by your specific CUDA version (cu92, cu101, cu102).

Datasets Preparation

Download datasets from Google Dive or Alibaba Cloud.
Put datasets.tar.gz in "./datasets/" and extract it.

cd ./datasets
tar -zxvf datasets.tar.gz
## The './datasets/' folder should look like:
- datasets/
  - cmdd/
  - icdar13table/
  - icdar19_ctdar/
  - tablegraph24k/

Pretrained Models Preparation

IMPORTANT Acoording to feedbacks from users (I also tested by myself), the pretrained models may not work for some enviroments. I have tested the following enviroment that can work as expected.

  - CUDA 9.2
  - torch 1.7.0+torchvision 0.8.0
  - torch-cluster 1.5.9
  - torch-geometric 1.6.3
  - torch-scatter 2.0.6
  - torch-sparse 0.6.9
  - torch-spline-conv 1.2.1

Download pretrained models from Google Dive or Alibaba Cloud.
Put checkpoints.tar.gz in "./checkpoints/" and extract it.

cd ./checkpoints
tar -zxvf checkpoints.tar.gz
## The './checkpoints/' folder should look like:
- checkpoints/
  - cmdd_overall/
  - icdar13table_overall/
  - icdar19_lloc/
  - tablegraph24k_overall/

Test

We have prepared scripts for test and you can just run them.

- test_cmdd.sh
- test_icdar13table.sh
- test_tablegraph-24k.sh
- test_icdar19ctdar.sh

Train

Todo

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Related tags

Overview

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Abstract

Getting Started

Requirements

Datasets Preparation

Pretrained Models Preparation

Test

Train

Owner

Wenyuan

Generative Adversarial Text-to-Image Synthesis

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

Code and datasets for TPAMI 2021

Remote sensing change detection using PaddlePaddle

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

Learning where to learn - Gradient sparsity in meta and continual learning

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

The 2nd place solution of 2021 google landmark retrieval on kaggle.

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

Deep Compression for Dense Point Cloud Maps.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Pytorch implementation of Learning Rate Dropout.

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

KinectFusion implemented in Python with PyTorch

Train DeepLab for Semantic Image Segmentation

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Related tags

Overview

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

Abstract

Getting Started

Requirements

Datasets Preparation

Pretrained Models Preparation

Test

Train

Owner

Wenyuan

Generative Adversarial Text-to-Image Synthesis

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"

Code and datasets for TPAMI 2021

Remote sensing change detection using PaddlePaddle

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

​ This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

Learning where to learn - Gradient sparsity in meta and continual learning

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

The 2nd place solution of 2021 google landmark retrieval on kaggle.

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

Deep Compression for Dense Point Cloud Maps.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Pytorch implementation of Learning Rate Dropout.

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

KinectFusion implemented in Python with PyTorch

Train DeepLab for Semantic Image Segmentation

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.