Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

PyTorch implementation of paper: HPNet: Deep Primitive Segmentation Using Hybrid Representations.

DIVeR: Deterministic Integration for Volume Rendering

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Official implementation of "Learning Not to Reconstruct" (BMVC 2021)

Pytorch implementation of Bert and Pals: Projected Attention Layers for Efficient Adaptation in Multi-Task Learning

OneFlow is a performance-centered and open-source deep learning framework.

[CVPR 2022] CoTTA Code for our CVPR 2022 paper Continual Test-Time Domain Adaptation

Implementation of light baking system for ray tracing based on Activision's UberBake

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Code of paper "Compositionally Generalizable 3D Structure Prediction"

use tensorflow 2.0 to tell a dog and cat from a specified picture

PFFDTD is an open-source FDTD simulator for 3D room acoustics

Image Restoration Using Swin Transformer for VapourSynth

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Differential rendering based motion capture blender project.

領域を指定し、キーを入力することで画像を保存するツールです。クラス分類用のデータセット作成を想定しています。

A PyTorch implementation of the Relational Graph Convolutional Network (RGCN).