The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Last update: Dec 29, 2022

Related tags

Overview

FinQA

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Format

"pre_text": the texts before the table;
"post_text": the text after the table;
"table": the table;
"id": unique example id. composed by the original report name plus example index for this report. 

"qa": {
  "question": the question;
  "program": the reasoning program;
  "gold_inds": the gold supporting facts;
  "exe_ans": the gold execution result;
  "program_re": the reasoning program in nested format;
}

Owner

Zhiyu Chen

Ph.D. student in ML/NLP

GitHub Repository

Super easy library for BERT based NLP models

Fast-Bert New - Learning Rate Finder for Text Classification Training (borrowed with thanks from https://github.com/davidtvs/pytorch-lr-finder) Suppor

1.8k Dec 27, 2022

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs".

CrossSum This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summ

29 Nov 19, 2022

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

pkuseg：一个多领域中文分词工具包 (English Version) pkuseg 是基于论文[Luo et. al, 2019]的工具包。其简单易用，支持细分领域分词，有效提升了分词准确度。目录主要亮点编译和安装各类分词工具包的性能对比使用方式论文引用作者常见问题及解答主要

6k Dec 29, 2022

BERT score for text generation

BERTScore Automatic Evaluation Metric described in the paper BERTScore: Evaluating Text Generation with BERT (ICLR 2020). News: Features to appear in

1k Jan 08, 2023

结巴中文分词

jieba “结巴”中文分词：做最好的 Python 中文分词组件 "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best Python Chinese word segmentation

29.8k Jan 02, 2023

NLP tool to extract emotional phrase from tweets 🤩

Emotional phrase extractor Extract phrase in the given text that is used to express the sentiment. Capturing sentiment in language is important in the

38 Oct 17, 2022

Index different CKAN entities in Solr, not just datasets

ckanext-sitesearch Index different CKAN entities in Solr, not just datasets Requirements This extension requires CKAN 2.9 or higher and Python 3 Featu

3 Dec 02, 2022

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

LM-Critic: Language Models for Unsupervised Grammatical Error Correction This repo provides the source code & data of our paper: LM-Critic: Language M

98 Nov 24, 2022

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

Transformer QG on DRCD The inputs of the model refers to we integrate C and A into a new C' in the following form. C' = [c1, c2, ..., [HL], a1, ..., a

1 Oct 22, 2021

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data Authors: Yi-Chang Chen, Yu-Chuan Chang, Yen-Cheng Chang and Yi-Ren Ye

5 Dec 15, 2022

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers. Cherche is meant to be used with small to medium sized corpora. C

224 Nov 29, 2022

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks Recently, researchers proposed pruning deep neural n

4 Feb 23, 2022

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Related tags

Overview

FinQA

Format

Owner

Zhiyu Chen

Super easy library for BERT based NLP models

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs".

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

BERT score for text generation

结巴中文分词

NLP tool to extract emotional phrase from tweets 🤩

Index different CKAN entities in Solr, not just datasets

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

Cherche (search in French) allows you to create a neural search pipeline using retrievers and pre-trained language models as rankers.

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Open-source offline translation library written in Python. Uses OpenNMT for translations

Generate a cool README/About me page for your Github Profile

Treemap visualisation of Maya scene files

precise iris segmentation

ZUNIT - Toward Zero-Shot Unsupervised Image-to-Image Translation

Pre-Training with Whole Word Masking for Chinese BERT

A python framework to transform natural language questions to queries in a database query language.