This is a clean and robust Pytorch implementation of DQN and Double DQN.

Last update: Dec 27, 2022

Related tags

Deep Learning DQN-DDQN-Pytorch

Overview

DQN/DDQN-Pytorch

This is a clean and robust Pytorch implementation of DQN and Double DQN. Here is the training curve:

All the experiments are trained with same hyperparameters.

A quick render here:

Dependencies

gym==0.18.3
numpy==1.21.2
pytorch==1.8.1

How to use my code

Train from scratch

run 'python main.py', where the default enviroment is CartPole-v1.

Play with trained model

run 'python main.py --write False --render True --Loadmodel True --ModelIdex 50000'

Change Enviroment

If you want to train on different enviroments, just run 'python main.py --EnvIdex 1'.
The --EnvIdex can be set to be 0 and 1, where
'--EnvIdex 0' for 'CartPole-v1'
'--EnvIdex 1' for 'LunarLander-v2'

Visualize the training curve

You can use the tensorboard to visualize the training curve. History training curve is saved at '\runs'

Hyperparameter Setting

For more details of Hyperparameter Setting, please check 'main.py'

References

DQN: Mnih V , Kavukcuoglu K , Silver D , et al. Playing Atari with Deep Reinforcement Learning[J]. Computer Science, 2013.

Double DQN: Hasselt H V , Guez A , Silver D . Deep Reinforcement Learning with Double Q-learning[J]. Computer ence, 2015.

This is a clean and robust Pytorch implementation of DQN and Double DQN.

Related tags

Overview

DQN/DDQN-Pytorch

Dependencies

How to use my code

Train from scratch

Play with trained model

Change Enviroment

Visualize the training curve

Hyperparameter Setting

References

Other RL algorithms by Pytorch can be found here.

Owner

XinJingHao

Classical OCR DCNN reproduction based on PaddlePaddle framework.

Logsig-RNN: a novel network for robust and efficient skeleton-based action recognition

ICCV2021 Oral SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks

Learning Logic Rules for Document-Level Relation Extraction

Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.

Few-NERD: Not Only a Few-shot NER Dataset

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

ML models implementation practice

Deep Hedging Demo - An Example of Using Machine Learning for Derivative Pricing.

Personal implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Language-Driven Semantic Segmentation

[cvpr22] Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

A lightweight python AUTOmatic-arRAY library.

RoMA: Robust Model Adaptation for Offline Model-based Optimization

Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022

Code for the paper "Adversarial Generator-Encoder Networks"

Continual Learning of Electronic Health Records (EHR).

OpenDelta - An Open-Source Framework for Paramter Efficient Tuning.

Official Pytorch implementation of Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

AutoML library for deep learning