An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Last update: Oct 21, 2022

Related tags

Overview

pl_prompt_sst

An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SST2 sentiment analysis dataset. Leveraging the pytorch-lightning features like logging, gradient accumulation and early stopping, etc. Can be used as a template for further development.

Run

Install requirement

pip install -r requirements.txt

Setup the prompt to use in sst2/prompt_config.json

{
    "template_text": "{\"placeholder\": \"text_a\"} In summary, the film was {\"mask\"}.",
    "label_words": [["bad"], ["good"]]
}

Adjust the arguments in run.sh or the code below for your need, and run it.

CUDA_VISIBLE_DEVICES=0 python -u main.py --input_dir ./sst2 \
                                         --prompt_config_dir ./sst2/prompt_config.json \
                                         --model_class bert \
                                         --model_name_or_path prajjwal1/bert-tiny \
                                         --lr 2e-4
                                         --bs 32 \
                                         --max_seq_length 64 \
                                         --patience 4 \
                                         --accumulation 2 \
                                         --seed 666

In my preliminary experiment with the settings above, the model achieve 0.822 F1 compared to 0.820 without prompt.

Note

Can only be executed after this fix on state_dict()

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Related tags

Overview

pl_prompt_sst

Run

Note

Owner

Zhiling Zhang

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

A library for Multilingual Unsupervised or Supervised word Embeddings

File-based TF-IDF: Calculates keywords in a document, using a word corpus.

Guide to using pre-trained large language models of source code

Persian Bert For Long-Range Sequences

Curso práctico: NLP de cero a cien 🤗

A toolkit for document-level event extraction, containing some SOTA model implementations

A benchmark for evaluation and comparison of various NLP tasks in Persian language.

:P Some basic stuff I'm gonna use for my upcoming Agile Software Development and Devops

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

YACLC - Yet Another Chinese Learner Corpus

Contains links to publicly available datasets for modeling health outcomes using speech and language.

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences"

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

A high-level Python library for Quantum Natural Language Processing

LeBenchmark: a reproducible framework for assessing SSL from speech

Enterprise Scale NLP with Hugging Face & SageMaker Workshop series

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.