End-To-End Memory Network using Tensorflow

Last update: Oct 27, 2022

Overview

MemN2N

Implementation of End-To-End Memory Networks with sklearn-like interface using Tensorflow. Tasks are from the bAbl dataset.

Get Started

git clone [email protected]:domluna/memn2n.git

mkdir ./memn2n/data/
cd ./memn2n/data/
wget http://www.thespermwhale.com/jaseweston/babi/tasks_1-20_v1-2.tar.gz
tar xzvf ./tasks_1-20_v1-2.tar.gz

cd ../
python single.py

Examples

Running a single bAbI task

Running a joint model on all bAbI tasks

These files are also a good example of usage.

Requirements

tensorflow 1.0
scikit-learn 0.17.1
six 1.10.0

Single Task Results

For a task to pass it has to meet 95%+ testing accuracy. Measured on single tasks on the 1k data.

Pass: 1,4,12,15,20

Several other tasks have 80%+ testing accuracy.

Stochastic gradient descent optimizer was used with an annealed learning rate schedule as specified in Section 4.2 of End-To-End Memory Networks

The following params were used:

epochs: 100
hops: 3
embedding_size: 20

Task	Training Accuracy	Validation Accuracy	Testing Accuracy
1	1.0	1.0	1.0
2	1.0	0.86	0.83
3	1.0	0.64	0.54
4	1.0	0.99	0.98
5	1.0	0.94	0.87
6	1.0	0.97	0.92
7	1.0	0.89	0.84
8	1.0	0.93	0.86
9	1.0	0.86	0.90
10	1.0	0.80	0.78
11	1.0	0.92	0.84
12	1.0	1.0	1.0
13	0.99	0.94	0.90
14	1.0	0.97	0.93
15	1.0	1.0	1.0
16	0.81	0.47	0.44
17	0.76	0.65	0.52
18	0.97	0.96	0.88
19	0.40	0.17	0.13
20	1.0	1.0	1.0

Joint Training Results

Pass: 1,6,9,10,12,13,15,20

Again stochastic gradient descent optimizer was used with an annealed learning rate schedule as specified in Section 4.2 of End-To-End Memory Networks

The following params were used:

epochs: 60
hops: 3
embedding_size: 40

Task	Training Accuracy	Validation Accuracy	Testing Accuracy
1	1.0	0.99	0.999
2	1.0	0.84	0.849
3	0.99	0.72	0.715
4	0.96	0.86	0.851
5	1.0	0.92	0.865
6	1.0	0.97	0.964
7	0.96	0.87	0.851
8	0.99	0.89	0.898
9	0.99	0.96	0.96
10	1.0	0.96	0.928
11	1.0	0.98	0.93
12	1.0	0.98	0.982
13	0.99	0.98	0.976
14	1.0	0.81	0.877
15	1.0	1.0	0.983
16	0.64	0.45	0.44
17	0.77	0.64	0.547
18	0.85	0.71	0.586
19	0.24	0.07	0.104
20	1.0	1.0	0.996

Notes

Single task results are from 10 repeated trails of the single task model accross all 20 tasks with different random initializations. The performance of the model with the lowest validation accuracy for each task is shown in the table above.

Joint training results are from 10 repeated trails of the joint model accross all tasks. The performance of the single model whose validation accuracy passed the most tasks (>= 0.95) is shown in the table above (joint_scores_run2.csv). The scores from all 10 runs are located in the results/ directory.

End-To-End Memory Network using Tensorflow

Related tags

Overview

MemN2N

Get Started

Examples

Requirements

Single Task Results

Joint Training Results

Notes

Owner

Dominique Luna

Denoising images with Fourier Ring Correlation loss

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"

Pywonderland - A tour in the wonderland of math with python.

The Official Repository for "Generalized OOD Detection: A Survey"

Black box hyperparameter optimization made easy.

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

AugLiChem - The augmentation library for chemical systems.

A Fast Knowledge Distillation Framework for Visual Recognition

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Ankou: Guiding Grey-box Fuzzing towards Combinatorial Difference

Deep Federated Learning for Autonomous Driving

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

A Simplied Framework of GAN Inversion

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme (NeurIPS2021)

NEG loss implemented in pytorch

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Emotion classification of online comments based on RNN

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

End-To-End Memory Network using Tensorflow

Related tags

Overview

MemN2N

Get Started

Examples

Requirements

Single Task Results

Joint Training Results

Notes

Owner

Dominique Luna

Denoising images with Fourier Ring Correlation loss

[PNAS2021] The neural architecture of language: Integrative modeling converges on predictive processing

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"

Pywonderland - A tour in the wonderland of math with python.

The Official Repository for "Generalized OOD Detection: A Survey"

Black box hyperparameter optimization made easy.

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

AugLiChem - The augmentation library for chemical systems.

A Fast Knowledge Distillation Framework for Visual Recognition

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Ankou: Guiding Grey-box Fuzzing towards Combinatorial Difference

Deep Federated Learning for Autonomous Driving

Survival analysis (SA) is a well-known statistical technique for the study of temporal events.

A Simplied Framework of GAN Inversion

Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme (NeurIPS2021)

NEG loss implemented in pytorch

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Emotion classification of online comments based on RNN

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.