Official implementation of "Generating 3D Molecules for Target Protein Binding"

Last update: Dec 07, 2022

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

This is the official implementation of the GraphBP method proposed in the following paper.

Meng Liu, Youzhi Luo, Kanji Uchino, Koji Maruhashi, and Shuiwang Ji. "Generating 3D Molecules for Target Protein Binding".

Requirements

We include key dependencies below. The versions we used are in the parentheses. Our detailed environmental setup is available in environment.yml.

PyTorch (1.9.0)
PyTorch Geometric (1.7.2)
rdkit-pypi (2021.9.3)
biopython (1.79)
openbabel (3.3.1)

Preparing Data

Download and extract the CrossDocked2020 dataset:

wget https://bits.csb.pitt.edu/files/crossdock2020/CrossDocked2020_v1.1.tgz -P data/crossdock2020/
tar -C data/crossdock2020/ -xzf data/crossdock2020/CrossDocked2020_v1.1.tgz
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_train0_fixed.types -P data/crossdock2020/
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_test0_fixed.types -P data/crossdock2020/

Note: (1) The unzipping process could take a lot of time. Unzipping on SSD is much faster!!! (2) Several samples in the training set cannot be processed by our code. Hence, we recommend replacing the it2_tt_0_lowrmsd_mols_train0_fixed.types file with a new one, where these samples are deleted. The new one is available here.

Split data files:

python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_train0_fixed.types data/crossdock2020
python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_test0_fixed.types data/crossdock2020

Run

Train GraphBP from scratch:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main.py

Note: GraphBP can be trained on a 48GB GPU with batchsize=16. Our trained model is avaliable here.

Generate atoms in the 3D space with the trained model:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_gen.py

Postprocess and then save the generated molecules:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_eval.py

Reference

@article{liu2022graphbp,
      title={Generating 3D Molecules for Target Protein Binding},
      author={Meng Liu and Youzhi Luo and Kanji Uchino and Koji Maruhashi and Shuiwang Ji},
      journal={arXiv preprint arXiv:2204.09410},
      year={2022},
}

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

Requirements

Preparing Data

Run

Reference

Owner

DIVE Lab, Texas A&M University

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

A 3D sparse LBM solver implemented using Taichi

Spectralformer: Rethinking hyperspectral image classification with transformers

Pytorch implementation of Bert and Pals: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning

The Instructed Glacier Model (IGM)

Code for the paper: Sketch Your Own GAN

Code for project: "Learning to Minimize Remainder in Supervised Learning".

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Code for 2021 NeurIPS --- Towards Multi-Grained Explainability for Graph Neural Networks

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

OMNIVORE is a single vision model for many different visual modalities

Fit Fast, Explain Fast

Data, notebooks, and articles associated with the RSNA AI Deep Learning Lab at RSNA 2021

The official implementation of the Hybrid Self-Attention NEAT algorithm

Object detection using yolo-tiny model and opencv used as backend

Some pre-commit hooks for OpenMMLab projects

QuALITY: Question Answering with Long Input Texts, Yes!