DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

Yolov5 + Deep Sort with PyTorch

Implementation for Homogeneous Unbalanced Regularized Optimal Transport

AAAI 2022: Stationary diffusion state neural estimation

MADE (Masked Autoencoder Density Estimation) implementation in PyTorch

A C implementation for creating 2D voronoi diagrams

Code accompanying the paper "How Tight Can PAC-Bayes be in the Small Data Regime?"

A simple approach to emable dense segmentation with ViT.

Neural Network Libraries

Reinforcement Learning for the Blackjack

A non-linear, non-parametric Machine Learning method capable of modeling complex datasets

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

This initial strategy was developed specifically for larger pools and is based on taking a moving average and deriving Bollinger Bands to create a projected active liquidity range.

Mesh TensorFlow: Model Parallelism Made Easier

Algorithmic trading using machine learning.

A general-purpose encoder-decoder framework for Tensorflow

Raster Vision is an open source Python framework for building computer vision models on satellite, aerial, and other large imagery sets

Scenarios, tutorials and demos for Autonomous Driving

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Official code for our CVPR '22 paper "Dataset Distillation by Matching Training Trajectories"

Auxiliary Raw Net (ARawNet) is a ASVSpoof detection model taking both raw waveform and handcrafted features as inputs, to balance the trade-off between performance and model complexity.