Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Last update: Jan 07, 2023

Related tags

Overview

Decision Transformer

Lili Chen*, Kevin Lu*, Aravind Rajeswaran, Kimin Lee, Aditya Grover, Michael Laskin, Pieter Abbeel, Aravind Srinivas†, and Igor Mordatch†

*equal contribution, †equal advising

A link to our paper can be found on arXiv.

Overview

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling. Contains scripts to reproduce experiments.

Instructions

We provide code in two sub-directories: atari containing code for Atari experiments and gym containing code for OpenAI Gym experiments. See corresponding READMEs in each folder for instructions; scripts should be run from the respective directories. It may be necessary to add the respective directories to your PYTHONPATH.

Citation

Please cite our paper as:

@article{chen2021decisiontransformer,
  title={Decision Transformer: Reinforcement Learning via Sequence Modeling},
  author={Lili Chen and Kevin Lu and Aravind Rajeswaran and Kimin Lee and Aditya Grover and Michael Laskin and Pieter Abbeel and Aravind Srinivas and Igor Mordatch},
  journal={arXiv preprint arXiv:2106.01345},
  year={2021}
}

Note: this is not an official Google or Facebook product.

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Related tags

Overview

Decision Transformer

Overview

Instructions

Citation

Owner

Kevin Lu

Code for visualizing the loss landscape of neural nets

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

RIM: Reliable Influence-based Active Learning on Graphs.

Video Instance Segmentation with a Propose-Reduce Paradigm (ICCV 2021)

Unsupervised Discovery of Object Radiance Fields

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Semantic Segmentation with Pytorch-Lightning

The Codebase for Causal Distillation for Language Models.

The CLRS Algorithmic Reasoning Benchmark

Study of human inductive biases in CNNs and Transformers.

Tensorflow2 Keras-based Semantic Segmentation Models Implementation

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks

This is the source code for generating the ASL-Skeleton3D and ASL-Phono datasets. Check out the README.md for more details.

Code to go with the paper "Decentralized Bayesian Learning with Metropolis-Adjusted Hamiltonian Monte Carlo"

Release of the ConditionalQA dataset

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion.

Official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models.

Code repository for our paper regarding the L3D dataset.

[AAAI 2021] MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.