Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Last update: Oct 14, 2022

Overview

About this repository

This repo contains an Pytorch implementation for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. The code framework is based on TextBox.

Environment

python >= 3.8.11
torch >= 1.6.0

Run install.sh to install other requirements.

Dataset

The processed dataset can be downloaded from Google Drive. Once finished, unzip the datafiles (train.src, train.tgt, ...) to ./data.

An overview of dataset: train: 287113 cases, dev: 13368 cases, test: 11490 cases

Paramters

# overall settings
data_path: 'data/'
checkpoint_dir: 'saved/'
generated_text_dir: 'generated/'
# dataset settings
max_vocab_size: 50000
src_len: 400
tgt_len: 100

# model settngs
decoding_strategy: 'beam_search'
beam_size: 4
is_attention: True
is_pgen: True
is_coverage: True
cov_loss_lambda: 1.0

Log file is located in ./log, more details can be found in yamls.

Note: Distributed Data Parallel (DDP) is not supported yet.

Train & Evaluation

From scratch run `fire.py`.

if __name__ == '__main__':
    config = Config(config_dict={'test_only': False,
                                 'load_experiment': None})
    train(config)

If you want to resume from a checkpoint, just set the 'load_experiment': './saved/$model_name$.pth'. Similarly, when 'test_only' is set to True, 'load_experiment' is required.

Results

The best model is trained on a TITAN Xp GPU (8GB usage).

Training loss

Ablation study

Model	Rouge-1	Rouge-2	Rouge-L
Seq2Seq	22.17	7.20	20.97
Seq2Seq+attn	29.35	12.58	27.38
Seq2Seq+attn+pgen	36.04	15.87	32.92
Seq2Seq+attn+pgen+coverage	39.52	17.85	36.40

Note: The architecture of the Seq2Seq model is based on lstm, I hope I can replace it with transformer in the future.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run `fire.py`.

Results

Training loss

Ablation study

Owner

wxDai

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Official implementation of GraphMask as presented in our paper Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking.

code for Image Manipulation Detection by Multi-View Multi-Scale Supervision

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts

NeuroGen: activation optimized image synthesis for discovery neuroscience

This is an open source library implementing hyperbox-based machine learning algorithms

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

RL and distillation in CARLA using a factorized world model

Random-Afg - Afghanistan Random Old Idz Cloner Tools

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

An implementation of "Optimal Textures: Fast and Robust Texture Synthesis and Style Transfer through Optimal Transport"

Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

An unofficial styleguide and best practices summary for PyTorch

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run fire.py.

Results

Training loss

Ablation study

Owner

wxDai

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Official implementation of GraphMask as presented in our paper Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking.

code for Image Manipulation Detection by Multi-View Multi-Scale Supervision

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts

NeuroGen: activation optimized image synthesis for discovery neuroscience

This is an open source library implementing hyperbox-based machine learning algorithms

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

RL and distillation in CARLA using a factorized world model

Random-Afg - Afghanistan Random Old Idz Cloner Tools

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

A python script to dump all the challenges locally of a CTFd-based Capture the Flag.

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

An implementation of "Optimal Textures: Fast and Robust Texture Synthesis and Style Transfer through Optimal Transport"

Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

An unofficial styleguide and best practices summary for PyTorch

From scratch run `fire.py`.