Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Last update: Aug 05, 2022

Related tags

Overview

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Task

Training huge unsupervised deep neural networks yields to strong progress in the field of Natural Language Processing (NLP). Using these extensively pre-trained networks for particular NLP applications is the current state-of-the-art approach. In this project, we approach the task of ranking possible clarifying questions for a given query. We fine-tuned a pre-trained BERT model to rank the possible clarifying questions in a classification manner. The achieved model scores a top-5 accuracy of 0.4565 on the provided benchmark dataset.

Installation

This project was originally developed with Python 3.8, PyTorch 1.7, and CUDA 11.0. The training requires one NVIDIA GeForce RTX 1080 (11GB memory).

Create conda environment:

conda create --name dl4nlp
source activate dl4nlp

Install the dependencies:

pip install -r requirements.txt

Run

We use a pretrained BERT-Base by Hugging Face and fine-tune it on the given training dataset. To run training, please use the following command:

python main.py --train

For evaluation on the test set, please use the following command:

python main.py --test

Arguments for training and/or testing:

--train: Run training on training dataset. Default: True
--val: Run evaluation during training on validation dataset. Default: True
--test: Run evaluation on test dataset. Default: True
--cuda-devices: Set GPU index Default: 0
--cpu: Run everything on CPU. Default: False
--data-parallel: Use DataParallel. Default: False
--data-root: Path to dataset folder. Default: data
--train-file-name: Name of training file name in data-root. Default: training.tsv
--test-file-name: Name of test file name in data-root. Default: test_set.tsv
--question-bank-name: Name of question bank file name in data-root. Default: question_bank.tsv
--checkpoints-root: Path to checkpoints folder. Default: checkpoints
--checkpoint-name: File name of checkpoint in checkpoints-root to start training or use for testing. Default: None
--runs-root: Path to output runs folder for tensorboard. Default: runs
--txt-root: Path to output txt folder for evaluation results. Default: txt
--lr: Learning rate. Default: 1e-5
--betas: Betas for optimization. Default: (0.9, 0.999)
--weight-decay: Weight decay. Default: 1e-2
--val-start: Set at which epoch to start validation. Default: 0
--val-step: Set at which epoch rate to valide. Default: 1
--val-split: Use subset of training dataset for validation. Default: 0.005
--num-epochs: Number of epochs for training. Default: 10
--batch-size: Samples per batch. Default: 32
--num-workers: Number of workers. Default: 4
--top-k-accuracy: Evaluation metric with flexible top-k-accuracy. Default: 50
--true-label: True label in dataset. Default: 1
--false-label: False label in dataset. Default: 0

Example output

User query:

Tell me about Computers

Propagated clarifying questions:

do you like using computers
do you want to know how to do computer programming
do you want to see some closeup of a turbine
are you looking for information on different computer programming languages
are you referring to a software

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Related tags

Overview

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Task

Installation

Run

Example output

Owner

An AI Assistant More Than a Toolkit

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

Our VMAgent is a platform for exploiting Reinforcement Learning (RL) on Virtual Machine (VM) scheduling tasks.

A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking

Prevent `CUDA error: out of memory` in just 1 line of code.

This repository contains the entire code for our work "Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid Precoding"

[Machine Learning Engineer Basic Guide] 부스트캠프 AI Tech - Product Serving 자료

NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

Intrusion Detection System using ensemble learning (machine learning)

Gradient representations in ReLU networks as similarity functions

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

Official page of Patchwork (RA-L'21 w/ IROS'21)

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

A simple implementation of Kalman filter in single object tracking

LSSY量化交易系统

Implementation of the GBST block from the Charformer paper, in Pytorch

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

A simple program for training and testing vit

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)