Deeprl - Standard DQN and dueling network for simple games

Last update: Apr 12, 2020

Overview

DeepRL

This code implements the standard deep Q-learning and dueling network with experience replay (memory buffer) for playing simple games.

DQN algorithm implemented in this code is from the Google DeepMind's paper Playing Atari with Deep Reinforcement Learning[link].

Dueling network is from the paper Dueling Network Architectures for Deep Reinforcement Learning [link]

Requirement

DeepRL is implemented with Torch and the packages of its ecosystem. This code is well worked on my Mac Pro with CPU (I haven't tested it on Linux and GPU). Install Torch7 firstly, then you should install the following packages by luarocks

luarocks install nn
luarocks install image
luarocks install qt
luarocks install optim

Running

You can run this code by tapping the command in the project dir.

qlua main.lua

The result looks like

DQN: I got the accuracy of 93.2% (932 success of 1000 epochs).

Dueling: I got the accuracy of 99.2% (992 success of 1000 epochs).

Code

The envir.lua indicates the environment in reinforcement learning stage, which receives the action and produces the states and a reward for agent.

The agent.lua is the implementation of agent which receives the states and reward to produce the action directed by the policy network.

The learner.lua is the learning algorithm of DQN with experience replay as the following.

MISC

I completed this code when I was an intern at Horizon Robotics. I will greatly thank the article of Andrej Karpathy and other implementations:SeanNaren's code and EderSantana's gist.

LICENSE

MIT

Deeprl - Standard DQN and dueling network for simple games

Related tags

Overview

DeepRL

Requirement

Running

Code

MISC

LICENSE

Owner

Yao Zhou

Implement of homography net by pytorch

Multi-query Video Retreival

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

Machine Learning automation and tracking

CTRMs: Learning to Construct Cooperative Timed Roadmaps for Multi-agent Path Planning in Continuous Spaces

The official implementation of paper "Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks" (IJCV under review).

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

Editing a classifier by rewriting its prediction rules

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

PINN Burgers - 1D Burgers equation simulated by PINN

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data

MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Real-time object detection on Android using the YOLO network with TensorFlow

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

RP-GAN: Stable GAN Training with Random Projections

This repository is an open-source implementation of the ICRA 2021 paper: Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling.

Multi-Glimpse Network With Python