Anuvada: Interpretable Models for NLP using PyTorch

So, you want to know why your classifier arrived at a particular decision or why your flashy new deep learning classification model is not performing in the way which you would want it to perform? Or there could be bias in your dataset towards a particular class and you want to understand if there are any such edge cases.

One of the common criticisms of deep learning has been it's black box nature (life itself is a big black box, not at all interpretable, don't even ask me about love). To address this issue, researchers have developed many ways to visualise and explain the inference. It is not necessary that a model has to be explainable, but when important decisions like which jobs to recommend to a person or whether to give a person loan are being made, it would be helpful to cross-check the model's claims. In such domains, self-explainable models are necessary.

This library is an ongoing effort to provide a high-level access to such models by building on top of PyTorch.

Here is what you can expect to visualize from a trained model.

Note: This model is a convolutional neural network trained on IMDB sentiment analysis dataset. I trained the model using SGD till validation loss stopped improving. Here is sensitivity analysis on some sample inputs. You can find more details about training the model in the Jupyter notebooks from the examples directory.

Positive review

Negative review

Installing

Clone this repo and add it to your python library path.

Requirements

PyTorch
NumPy
Pandas
Spacy
Gensim
tqdm

To do list

Acknowledgments

https://github.com/henryre/pytorch-fitmodule

Anuvada: Interpretable Models for NLP using PyTorch

Related tags

Overview

Anuvada: Interpretable Models for NLP using PyTorch

Positive review

Negative review

Installing

Requirements

To do list

Acknowledgments

Owner

EDGE

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

A minimal Conformer ASR implementation adapted from ESPnet.

Understand Text Summarization and create your own summarizer in python

Library for Russian imprecise rhymes generation

NVDA, the free and open source Screen Reader for Microsoft Windows

ElasticBERT: A pre-trained model with multi-exit transformer architecture.

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers

Maha is a text processing library specially developed to deal with Arabic text.

GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates

TaCL: Improve BERT Pre-training with Token-aware Contrastive Learning

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

AutoGluon: AutoML for Text, Image, and Tabular Data

Lumped-element impedance calculator and frequency-domain plotter.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

[Preprint] Escaping the Big Data Paradigm with Compact Transformers, 2021

LeBenchmark: a reproducible framework for assessing SSL from speech

An extension for asreview implements a version of the tf-idf feature extractor that saves the matrix and the vocabulary.