Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Last update: Oct 21, 2022

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

This repository contains the data and code for the following paper:

**Can NLI Models Verify QA Systems' Predictions? **
Jifan Chen, Eunsol Choi, Greg Durrett
EMNLP 2021 Findings

@article{chen2021can,
  title={Can NLI Models Verify QA Systems' Predictions?},
  author={Chen, Jifan and Choi, Eunsol and Durrett, Greg},
  journal={EMNLP Findings},
  year={2021}
}

Datasets

The NLI data converted from QA datasets through our pipeline described in the paper can be found here

Data Format

The data files are formatted as jsonlines; each example is described as the following:

Field	Description
`example_id`	Example ID
`title_text`	Title of the Wikipedia page of the example, could be NONE
`paragraph_text`	Paragraph containing the answer
`question_text`	Question
`answer_text`	Answer of the question
`answer_sent_text`	Sentence containing the answer
`decontext_answer_sent_text`	Decontextualized sentence containing the answer
`question_statement_text`	Declarative version of the question by combining the answer
`answer_scores`	Top 5 Answer score computed by the QA(BERT-joint) model
`is_correct`	Whether the answer is correct
`answer_sent_text`	Sentence containing the answer

Models

Getting started

git clone https://github.com/jifan-chen/QA-Verification-Via-NLI.git

Install the dependencies by running pip install -r requirements.txt

Question Converter & Decontextualizer

See README in seq2seq_converter.

NQ-NLI

coming soon

Contact

Please contact at [email protected] if you have any questions.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

Datasets

Data Format

Models

Getting started

Question Converter & Decontextualizer

NQ-NLI

Contact

Owner

Jifan Chen

wxPython app for converting encodings, modifying and fixing SRT files

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

小布助手对话短文本语义匹配的一个baseline

An easier way to build neural search on the cloud

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

NLP applications using deep learning.

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

BERT Attention Analysis

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

This repository serves as a place to document a toy attempt on how to create a generative text model in Catalan, based on GPT-2

Constituency Tree Labeling Tool

Perform sentiment analysis and keyword extraction on Craigslist listings

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

A simple word search made in python