Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

YOLOX_AUDIO is an audio event detection model based on YOLOX

Harmonic Memory Networks for Graph Completion

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

GAN Image Generator and Characterwise Image Recognizer with python

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Database Reasoning Over Text project for ACL paper

PyTorch implementation of the Deep SLDA method from our CVPRW-2020 paper "Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis"

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!

Command-line tool for downloading and extending the RedCaps dataset.

Official implementation of ETH-XGaze dataset baseline

Accepted at ICCV-2021: Workshop on Computer Vision for Automated Medical Diagnosis (CVAMD)

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Few-Shot Object Detection via Association and DIscrimination

Learned Token Pruning for Transformers

REGTR: End-to-end Point Cloud Correspondences with Transformers

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Automatic detection and classification of Covid severity degree in LUS (lung ultrasound) scans