Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Last update: Oct 25, 2022

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Idea: How do we interpret what a language model learns at various stages of training? Language models have been recently described as open knowledge bases. We can generate knowledge graphs by extracting relation triples from masked language models at sequential epochs or architecture variants to examine the knowledge acquisition process.

Dataset: Squad, Google-RE (3 flavors)

Models: BERT, RoBeRTa, DistilBert, training RoBERTa from scratch

Authors: Vinitra Swamy, Angelika Romanou, Martin Jaggi

This repository is the official implementation of the NeurIPS 2021 XAI4Debugging paper titled "Interpreting Language Models Through Knowledge Graph Extraction". Found this work useful? Please cite our paper.

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Install requirements and clone repository

git clone https://github.com/epfml/interpret-lm-knowledge.git
pip install git+https://github.com/huggingface/transformers   
pip install textacy
cd interpret-lm-knowledge/scripts

Generate knowledge graphs and dataframes python run_knowledge_graph_experiments.py <dataset> <model> <use_spacy>
e.g. squad Bert spacy
e.g. re-place-birth Roberta

options:

dataset=squad - "squad", "re-place-birth", "re-date-birth", "re-place-death"  
model=Roberta - "Bert", "Roberta", "DistilBert"  
extractor=spacy - "spacy", "textacy", "custom"

See run_lm_experiments notebook for examples.

Train LM model from scratch -> Knowledge Graph

Install requirements and clone repository

!pip install git+https://github.com/huggingface/transformers
!pip list | grep -E 'transformers|tokenizers'
!pip install textacy

Run wikipedia_train_from_scratch_lm.ipynb.
As included in the last cell of the notebook, you can run the KG generation experiments by:

from run_training_kg_experiments import *
run_experiments(tokenizer, model, unmasker, "Roberta3e")

Citations

@inproceedings{swamy2021interpreting,
 author = {Swamy, Vinitra and Romanou, Angelika and Jaggi, Martin},
 booktitle = {Advances in Neural Information Processing Systems, Workshop on eXplainable AI Approaches for Debugging and Diagnosis},
 title = {Interpreting Language Models Through Knowledge Graph Extraction},
 volume = {35},
 year = {2021}
}

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Related tags

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Train LM model from scratch -> Knowledge Graph

Citations

Owner

EPFL Machine Learning and Optimization Laboratory

Implementation of Uformer, Attention-based Unet, in Pytorch

Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing"

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

A annotation of yolov5-5.0

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

Source-to-Source Debuggable Derivatives in Pure Python

you can add any codes in any language by creating its respective folder (if already not available).

From Canonical Correlation Analysis to Self-supervised Graph Neural Networks

tf2onnx - Convert TensorFlow, Keras and Tflite models to ONNX.

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

A Python implementation of global optimization with gaussian processes.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools

A package, and script, to perform imaging transcriptomics on a neuroimaging scan.

Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

A simple image/video to Desmos graph converter run locally

EgoNN: Egocentric Neural Network for Point Cloud Based 6DoF Relocalization at the City Scale

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

PyTorch implementation of our ICCV 2021 paper Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer.

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.