A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

MEDS: Enhancing Memory Error Detection for Large-Scale Applications

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

Code for the SIGGRAPH 2021 paper "Consistent Depth of Moving Objects in Video".

Code for classifying international patents based on the text of their titles/abstracts

Quantum-enhanced transformer neural network

A sketch extractor for anime/illustration.

scikit-learn inspired API for CRFsuite

PyTorch implementation of GLOM

"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021

Official implementation of Densely connected normalizing flows

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"

Light-Head R-CNN

pix2pix in tensorflow.js

🍷 Gracefully claim weekly free games and monthly content from Epic Store.

implicit displacement field

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Colab notebook and additional materials for Python-driven analysis of redlining data in Philadelphia

kullanışlı ve işinizi kolaylaştıracak bir araç