Grading tools for Advanced NLP (11-711)

Installation

You'll need docker and unzip to use this repo. For docker, visit the official guide to get started. For unzip, you can install it on ubuntu via sudo apt-get install unzip.

Install the python package by

git clone https://github.com/ProKil/anlp-grading-tools
cd anlp-grading-tools
pip install -e .

Usage

To evaluate your code, you'll need to change the environment variables in test.sh.

ANLP_TMP_DIR: mkdir a new folder, e.g. mkdir tmp, and point this variable to the absolute path of the tmp folder.

SUBMISSION_DIR: this should point to the folder containing your submission zip file. Note that the toolkit will automatically evaluate all zip files in the folder.

SCORES_DIR: this should point to an empty folder. Your score will be logged in a text file there.

DATA_DIR: this should point to the data folder of minnn-assignment. Please copy the original minnn-assignment/classifier.py to minnn-assignment/data/classifier_orig.py to test if your code can be executed with the original classifier.

Example code to prepare the folders:

mkdir tmp
mkdir scores
cp -r path/to/minnn-assignment/data ./
cp path/to/minnn-assignment/classifier.py data/classifier_orig.py
mkdir submission
cp your/submission.zip submission

Now you can evaluate your code through bash test.sh, after which your scores are at SCORES_DIR/andrewid. It is normal to get 0s for the last two (correct labels for the imdb test set are not available), but you should get reasonable accuracies for the first two (~40).

Troubleshooting

You may find writing files inside ANLP_TMP_DIR and SCORE_DIR requiring permission. You can either use sudo or log into docker through docker run -v FOLDER_TO_WRITE:/mnt -it --entrypoint /bin/bash anlp and cd /mnt to write those files.
You may experience other permission issues with docker. Please refer to this page to use docker without sudo.

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Related tags

Overview

Grading tools for Advanced NLP (11-711)

Installation

Usage

Troubleshooting

Owner

Hao Zhu

My implementation of Safaricom Machine Learning Codility test. The code has bugs, logical I guess I made errors and any correction will be appreciated.

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Conditional Transformer Language Model for Controllable Generation

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

FactSumm: Factual Consistency Scorer for Abstractive Summarization

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.

Simple python code to fix your combo list by removing any text after a separator or removing duplicate combos

Code for evaluating Japanese pretrained models provided by NTT Ltd.

(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

PG-19 Language Modelling Benchmark

Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention

Tool to add main subject to items on Wikidata using a WMFs CirrusSearch for named entity recognition or a manually supplied list of QIDs

An automated program that helps customers of Pizza Palour place their pizza orders

The tool to make NLP datasets ready to use

A script that automatically creates a branch name using google translation api and jira api

A python gui program to generate reddit text to speech videos from the id of any post.