Multi Task RL Baselines

Last update: Jan 09, 2023

Related tags

Deep Learning mtrl

Overview

MTRL

Multi Task RL Algorithms

Introduction
Setup
Usage
Documentation
Contributing to MTRL
Community
Acknowledgements

Introduction

MTRL is a library of multi-task reinforcement learning algorithms. It has two main components:

Building blocks and agents that implement the multi-task RL algorithms.
Experiment setups that enable training/evaluation on different setups.

Together, these two components enable use of MTRL across different environments and setups.

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

License

Citing MTRL

If you use MTRL in your research, please use the following BibTeX entry:

@Misc{Sodhani2021MTRL,
  author =       {Shagun Sodhani and Amy Zhang},
  title =        {MTRL - Multi Task RL Algorithms},
  howpublished = {Github},
  year =         {2021},
  url =          {https://github.com/facebookresearch/mtrl}
}

Setup

Clone the repository: git clone [email protected]:facebookresearch/mtrl.git.
Install dependencies: pip install -r requirements/dev.txt

Usage

MTRL supports 8 different multi-task RL algorithms as described here.
MTRL supports multi-task environments using MTEnv. These environments include MetaWorld and multi-task variants of DMControl Suite
Refer the tutorial to get started with MTRL.

Documentation

https://mtrl.readthedocs.io

Contributing to MTRL

There are several ways to contribute to MTRL.

Use MTRL in your research.
Contribute a new algorithm. We currently support 8 multi-task RL algorithms and are looking forward to adding more environments.
Check out the good-first-issues on GitHub and contribute to fixing those issues.
Check out additional details here.

Community

Ask questions in the chat or github issues:

Chat
Issues

Acknowledgements

Our implementation of SAC is inspired by Denis Yarats' implementation of SAC.
Project file pre-commit, mypy config, towncrier config, circleci etc are based on same files from Hydra.

Multi Task RL Baselines

Related tags

Overview

MTRL

Contents

Introduction

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

License

Citing MTRL

Setup

Usage

Documentation

Contributing to MTRL

Community

Acknowledgements

Owner

Facebook Research

Semi-supervised learning for object detection

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Multi-Objective Reinforced Active Learning

PolyTrack: Tracking with Bounding Polygons

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

Config files for my GitHub profile.

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

Parallel Latent Tree-Induction for Faster Sequence Encoding

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

This repository contains part of the code used to make the images visible in the article "How does an AI Imagine the Universe?" published on Towards Data Science.

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

ROS-UGV-Control-Interface - Control interface which can be used in any UGV

Generative Art Using Neural Visual Grammars and Dual Encoders

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

This is the repository for Learning to Generate Piano Music With Sustain Pedals

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

Benchmark VAE - Library for Variational Autoencoder benchmarking

Multi Task RL Baselines

Related tags

Overview

MTRL

Contents

Introduction

List of publications & submissions using MTRL (please create a pull request to add the missing entries):

License

Citing MTRL

Setup

Usage

Documentation

Contributing to MTRL

Community

Acknowledgements

Owner

Facebook Research

Semi-supervised learning for object detection

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Multi-Objective Reinforced Active Learning

PolyTrack: Tracking with Bounding Polygons

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

Config files for my GitHub profile.

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

Parallel Latent Tree-Induction for Faster Sequence Encoding

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

This repository contains part of the code used to make the images visible in the article "How does an AI Imagine the Universe?" published on Towards Data Science.

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

ROS-UGV-Control-Interface - Control interface which can be used in any UGV

Generative Art Using Neural Visual Grammars and Dual Encoders

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

This is the repository for Learning to Generate Piano Music With Sustain Pedals

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

Benchmark VAE - Library for Variational Autoencoder benchmarking

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.