Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Last update: Jan 02, 2023

Overview

E(n)-Equivariant Transformer (wip)

Implementation of E(n)-Equivariant Transformer, which extends the ideas from Welling's E(n)-Equivariant Graph Neural Network with attention.

Install

$ pip install En-transformer

Usage

import torch
from en_transformer import EnTransformer

model = EnTransformer(
    dim = 512,
    depth = 4,
    dim_head = 64,
    heads = 8,
    edge_dim = 4,
    fourier_features = 2
)

feats = torch.randn(1, 16, 512)
coors = torch.randn(1, 16, 3)
edges = torch.randn(1, 16, 16, 4)

feats, coors = model(feats, coors, edges)  # (1, 16, 512), (1, 16, 3)

Todo

masking
neighborhoods by radius

Citations

@misc{satorras2021en,
    title 	= {E(n) Equivariant Graph Neural Networks}, 
    author 	= {Victor Garcia Satorras and Emiel Hoogeboom and Max Welling},
    year 	= {2021},
    eprint 	= {2102.09844},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

https://github.com/lucidrains/En-transformer/blob/a37e635d93a322cafdaaf829397c601350b23e5b/en_transformer/en_transformer.py#L527

Looking at the source code here: https://pytorch.org/docs/stable/_modules/torch/utils/checkpoint.html#checkpoint_sequential

opened by aced125 2
On rotary embeddings

Hi @lucidrains, thank you for your amazing work; big fan! I had a quick question on the usage of this repository.

Based on my understanding, rotary embeddings are a drop-in replacement for the original sinusoidal or learnt PEs in Transformers for sequential data, as in NLP or other temporal applications. If my application is not on sequential data, is there a reason why I should still use rotary embeddings?

E.g. for molecular datasets such as QM9 (from the En-GNNs paper), would it make sense to have rotary embeddings?

opened by chaitjo 1
Is this line required?

https://github.com/lucidrains/En-transformer/blob/7247e258fab953b2a8b5a73b8dfdfb72910711f8/en_transformer/en_transformer.py#L159

Is this line required? Does line 157, two lines above, make this line redundant?

opened by aced125 1
Performance drop with checkpointing update

I see a drop in performance (higher loss) when I update checkpointing from checkpoint_sequential(self.layers, 1, inp) to checkpoint_sequential(self.layers, len(self.layers), inp). Is this expected?

opened by heiidii 0
varying number of nodes

@lucidrains Thank you for your efficient implementation. I was wondering how to use this implementation for the dataset when the number of nodes in each graph is not the same? For example, the datasets of small molecules.

opened by mohaiminul2810 1
Edge model/rep

Hi,

Thank you for providing this version of the EnGNN model. This is not really an issue just a query. The original model as implemented here (https://github.com/vgsatorras/egnn) has 3 main steps per layer: edge_feat = self.edge_model(h[row], h[col], radial, edge_attr) coord = self.coord_model(coord, edge_index, coord_diff, edge_feat) h, agg = self.node_model(h, edge_index, edge_feat, node_attr) I am interested in the edge_feat and was wondering what would be an equivalent edge representation in your implementation. Line 335 in EnTransformer.py: qk = self.edge_mlp(qk) seems like the best candidate. Thanks, Pooja

opened by heiidii 1
efficient implementation

Hi, I wonder if relative distances and coordinates can be handled more efficiently using memory efficient attention as in " Self-attention Does Not Need O(n^2) Memory". It is straightforward for the scalar part.

opened by amrhamedp 2

Releases(1.0.2)

1.0.2(Jan 4, 2023)

null
Source code(tar.gz)
Source code(zip)
1.0.1(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
1.0.0(Dec 30, 2022)

null
Source code(tar.gz)
Source code(zip)
0.6.0(Nov 24, 2022)

null
Source code(tar.gz)
Source code(zip)
0.5.4(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.3(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.2(Mar 4, 2022)

Source code(tar.gz)
Source code(zip)
0.5.1(Nov 27, 2021)

Source code(tar.gz)
Source code(zip)
0.5.0(Aug 27, 2021)

Source code(tar.gz)
Source code(zip)
0.4.0(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.9(Aug 25, 2021)

Source code(tar.gz)
Source code(zip)
0.3.8(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.7(Jun 10, 2021)

Source code(tar.gz)
Source code(zip)
0.3.6(Jun 8, 2021)

Source code(tar.gz)
Source code(zip)
0.3.5(Jun 6, 2021)

Source code(tar.gz)
Source code(zip)
0.3.4(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.3(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.2(Jun 5, 2021)

Source code(tar.gz)
Source code(zip)
0.3.1(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.3.0(Jun 4, 2021)

Source code(tar.gz)
Source code(zip)
0.2.12(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.11(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.10(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.2.8(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.7(May 17, 2021)

Source code(tar.gz)
Source code(zip)
0.2.6(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.5(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.4(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.3(May 16, 2021)

Source code(tar.gz)
Source code(zip)
0.2.2(May 15, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention. It's all we need.

GitHub Repository

Whisper is a file-based time-series database format for Graphite.

Whisper Overview Whisper is one of three components within the Graphite project: Graphite-Web, a Django-based web application that renders graphs and

1.2k Dec 25, 2022

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Implementation of Neural Scene Graphs, that optimizes multiple radiance fields to represent different objects and a static scene background. Learned representations can be rendered with novel object

151 Dec 26, 2022

Code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection"

CTDNet The PyTorch code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection" Requirements Python 3.6

28 Oct 20, 2022

Python implementation of Bayesian optimization over permutation spaces.

Bayesian Optimization over Permutation Spaces This repository contains the source code and the resources related to the paper "Bayesian Optimization o

9 Dec 23, 2022

A self-supervised 3D representation learning framework named viewpoint bottleneck.

Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck Paper Created by Liyi Luo, Beiwen Tian, Hao Zhao and Guyue Zhou from Institute for AI In

63 Aug 11, 2022

PyTorch implementation for paper Neural Marching Cubes.

NMC PyTorch implementation for paper Neural Marching Cubes, Zhiqin Chen, Hao Zhang. Paper | Supplementary Material (to be updated) Citation If you fin

109 Dec 27, 2022

PyTorch implementation of some learning rate schedulers for deep learning researcher.

pytorch-lr-scheduler PyTorch implementation of some learning rate schedulers for deep learning researcher. Usage WarmupReduceLROnPlateauScheduler Visu

59 Dec 08, 2022

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

ML-PersRef This repository has python code (in jupyter notebooks) for both of the following papers: ML-PersRef: A Machine Learning-based Personalized

3 Sep 04, 2022

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

Incidents Dataset See the following pages for more details: Project page: IncidentsDataset.csail.mit.edu. ECCV 2020 Paper "Detecting natural disasters

67 Dec 27, 2022

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis Andreas Bl

36 Dec 25, 2022

Fully Automatic Page Turning on Real Scores

Fully Automatic Page Turning on Real Scores This repository contains the corresponding code for our extended abstract Henkel F., Schwaiger S. and Widm

7 Jan 02, 2022

Author: Wenhao Yu ([email protected]). ACL 2022. Commonsense Reasoning on Knowledge Graph for Text Generation

Diversifying Commonsense Reasoning Generation on Knowledge Graph Introduction -- This is the pytorch implementation of our ACL 2022 paper "Diversifyin

61 Dec 30, 2022

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Related tags

Overview

E(n)-Equivariant Transformer (wip)

Install

Usage

Todo

Citations

Comments

Checkpoint sequential segments should equal number of layers instead of 1?

On rotary embeddings

Is this line required?

Performance drop with checkpointing update

varying number of nodes

Edge model/rep

efficient implementation

Releases(1.0.2)

1.0.2(Jan 4, 2023)

1.0.1(Dec 30, 2022)

1.0.0(Dec 30, 2022)

0.6.0(Nov 24, 2022)

0.5.4(Mar 4, 2022)

0.5.3(Mar 4, 2022)

0.5.2(Mar 4, 2022)

0.5.1(Nov 27, 2021)

0.5.0(Aug 27, 2021)

0.4.0(Aug 25, 2021)

0.3.9(Aug 25, 2021)

0.3.8(Jun 10, 2021)

0.3.7(Jun 10, 2021)

0.3.6(Jun 8, 2021)

0.3.5(Jun 6, 2021)

0.3.4(Jun 5, 2021)

0.3.3(Jun 5, 2021)

0.3.2(Jun 5, 2021)

0.3.1(Jun 4, 2021)

0.3.0(Jun 4, 2021)

0.2.12(May 27, 2021)

0.2.11(May 27, 2021)

0.2.10(May 27, 2021)

0.2.8(May 17, 2021)

0.2.7(May 17, 2021)

0.2.6(May 16, 2021)

0.2.5(May 16, 2021)

0.2.4(May 16, 2021)

0.2.3(May 16, 2021)

0.2.2(May 15, 2021)

Owner

Phil Wang

Whisper is a file-based time-series database format for Graphite.

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Code for ACM MM2021 paper "Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection"

Python implementation of Bayesian optimization over permutation spaces.

A self-supervised 3D representation learning framework named viewpoint bottleneck.

PyTorch implementation for paper Neural Marching Cubes.

PyTorch implementation of some learning rate schedulers for deep learning researcher.

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

Fully Automatic Page Turning on Real Scores

Author: Wenhao Yu ([email protected]). ACL 2022. Commonsense Reasoning on Knowledge Graph for Text Generation

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Drslmarkov - Distributionally Robust Structure Learning for Discrete Pairwise Markov Networks

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Code for the AI lab course 2021/2022 of the University of Verona

[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

Official Repsoitory for "Activate or Not: Learning Customized Activation." [CVPR 2021]

PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)