Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Last update: Feb 23, 2022

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Recently, researchers proposed pruning deep neural network weights (DNNs) using an $N:M$ fine-grained block sparsity mask. In this mask, for each block of M weights, we have at least N zeros. In contrast to unstructured sparsity, N:M fine-grained block sparsity allows acceleration in actual modern hardware. Previously suggested solutions enabled DNN acceleration at the inference phase. To also allow such acceleration in the training phase, we suggest a novel transposable-fine-grained sparsity mask where the same mask can be used for both forward and backward passes. Our transposable mask ensures that both the weight matrix and its transpose follow the same sparsity pattern; thus the matrix multiplication required for passing the error backward can also be accelerated. We discuss the transposable constraint and devise a new measure for mask constraints, called mask-diversity (MD), which correlates with their expected accuracy. Lastly, we formulate the problem of finding the optimal transposable mask as a minimum-cost-flow problem and suggest a fast linear approximation that can be used when the masks dynamically change while training. Our experiments suggest 2x speed-up with no accuracy degradation over vision and language models. A reference implementation is available in the supplementary material.

Reproducing the results

This repository is partially based on convNet.pytorch repo. please ensure that you are using pytorch 1.7+. Reproducing AdaPrune results

cd AdaPrune
sh scripts/adaprune_dense_bnt.sh
sh scripts/adaprune_sparse.sh

Reproducing static NM-transposable starting from dense pre-trained model:

cd static_TNM
sh scripts/prune_pretrained_R50.sh

Reproducing dynamic NM-transposable from scratch:

cd dynamic_TNM
sh scripts/clone_and_copy.sh
sh scripts/run_R18.sh
sh scripts/run_R50.sh

Code to reprudece NeurIPS paper: Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks

Related tags

Overview

Accelerated Sparse Neural Training: A Provable and Efficient Method to FindN:M Transposable Masks

Reproducing the results

Owner

itay hubara

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

A Paper List for Speech Translation

Neural text generators like the GPT models promise a general-purpose means of manipulating texts.

Code repository for "It's About Time: Analog clock Reading in the Wild"

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Sploitus - Command line search tool for sploitus.com. Think searchsploit, but with more POCs

List of GSoC organisations with number of times they have been selected.

Addon for adding subtitle files to blender VSE as Text sequences. Using pysub2 python module.

Ongoing research training transformer language models at scale, including: BERT & GPT-2

The entmax mapping and its loss, a family of sparse softmax alternatives.

Script and models for clustering LAION-400m CLIP embeddings.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Tensorflow implementation of paper: Learning to Diagnose with LSTM Recurrent Neural Networks.

CredData is a set of files including credentials in open source projects

Knowledge Oriented Programming Language

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

Sample data associated with the Aurora-BP study

Library for Russian imprecise rhymes generation

StarGAN - Official PyTorch Implementation