RL Algorithms with examples in Python / Pytorch / Unity ML agents

Last update: Aug 19, 2022

Overview

Reinforcement Learning Project

This project was created to make it easier to get started with Reinforcement Learning. It now contains:

An implementation of the DDPG Algorithm in Python, which works for both single-agent environments and multi-agent environments.
Single and parallel environments in Unity ML agents using the Python API.
Two Jupyter notebooks:
- 3DBall.ipynb: This is a simple example to get started with Unity ML Agents & the DDPG Algorithm.
- 3DBall_parallel_environment.ipynb: The same, but now for an environment run in parallel.

Getting Started

Install Basic Dependencies

To set up your python environment to run the code in the notebooks, follow the instructions below.

If you're on Windows I recommend installing Miniforge. It's a minimal installer for Conda. I also recommend using the Mamba package manager instead of Conda. It works almost the same as Conda, but only faster. There's a cheatsheet of Conda commands which also work in Mamba. To install Mamba, use this command:

conda install mamba -n base -c conda-forge

Create (and activate) a new environment with Python 3.6 or later. I recommend using Python 3.9:

Linux or Mac:

mamba create --name rl39 python=3.9 numpy
source activate rl39

Windows:

mamba create --name rl39 python=3.9 numpy
activate rl39

Install PyTorch by following instructions on Pytorch.org. For example, to install PyTorch on Windows with GPU support, use this command:

mamba install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch

Install additional packages:

mamba install jupyter notebook matplotlib

Create an IPython kernel for the rl39 environment in Jupyter.

python -m ipykernel install --user --name rl39 --display-name "rl39"

Change the kernel to match the rl39 environment by using the drop-down menu Kernel -> Change kernel inside Jupyter Notebook.

Install Unity Machine Learning Agents

Note: In order to run the notebooks on Windows, it's not necessary to install the Unity Editor, because I have provided the standalone executables of the environments for you.

Unity ML Agents is the software that we use for the environments. The agents that we create in Python can interact with these environments. Unity ML Agents consists of several parts:

The Unity Editor is used for creating environments. To install:
- Install Unity Hub.
- Install the latest version of Unity by clicking on the green button Unity Hub on the download page.
To start the Unity editor you must first have a project:
- Start the Unity Hub.
- Click on "Projects"
- Create a new dummy project.
- Click on the project you've just added in the Unity Hub. The Unity Editor should start now.
The Unity ML-Agents Toolkit. Download the latest release of the source code or use the Git command: git clone --branch release_18 https://github.com/Unity-Technologies/ml-agents.git.
The Unity ML Agents package is used inside the Unity Editor. Please read the instructions for installation.
The mlagents Python package is used as a bridge between Python and the Unity editor (or standalone executable). To install, use this command: python -m pip install mlagents==0.27.0. Please note that there's no conda package available for this.

Install an IDE for Python

For Windows, I would recommend using PyCharm (my choice), or Visual Studio Code. Inside those IDEs you can use the Conda environment you have just created.

Creating a custom Unity executable

Load the examples project

The Unity ML-Agents Toolkit contains several example environments. Here we will load them all inside the Unity editor:

Start the Unity Hub.
Click on "Projects"
Add a project by navigating to the Project folder inside the toolkit.
Click on the project you've just added in the Unity Hub. The Unity Editor should start now.

Create a 3D Ball executable

The 3D Ball example contains 12 environments in one, but this doesn't work very well in the Python API. The main problem is that there's no way to reset each environment individually. Therefore, we will remove the other 11 environments in the editor:

Load the 3D Ball scene, by going to the project window and navigating to Examples -> 3DBall -> Scenes-> 3DBall
In the Hierarchy window select the other 11 3DBall objects and delete them, so that only the 3DBall object remains.

Next, we will build the executable:

Go to File -> Build Settings
In the Build Settings window, click Build
Navigate to notebooks folder and add 3DBall to the folder name that is used for the build.

Instructions for running the notebooks

Download the Unity executables for Windows. In case you're not on Windows, you have to build the executables yourself by following the instructions above.
Place the Unity executable folders in the same folder as the notebooks.
Load a notebook with Jupyter notebook. (The command to start Jupyter notebook is jupyter notebook)
Follow further instructions in the notebook.

Releases(v1.0.0)

v1.0.0(Jan 14, 2022)

initial release
Source code(tar.gz)
Source code(zip)

An example project demonstrating how the Autonomous Learning Library can be used to build new reinforcement learning agents.

About This repository shows how Autonomous Learning Library can be used to build new reinforcement learning agents. In particular, it contains a model

5 Aug 30, 2022

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

TextWorld A text-based game generator and extensible sandbox learning environment for training and testing reinforcement learning (RL) agents. Also ch

983 Dec 23, 2022

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Pacman AI Jussi Doherty CAP 4601 - Introduction to Artificial Intelligence - Fall 2020 Python version 3.0+ Source of this project This repo contains a

1 Jan 3, 2022

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Algo-ScriptML Python implementations of some of the fundamental Machine Learning models and algorithms from scratch. The goal of this project is not t

81 Nov 26, 2022

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

PyTorch Implementation of Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers 1 Using Colab Please notic

489 Jan 7, 2023

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

Causal-Adversarial-Instruments This is a PyTorch Implementation code for develop

26 Dec 28, 2022

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

RL Algorithms with examples in Python / Pytorch / Unity ML agents

Related tags

Overview

Reinforcement Learning Project

Getting Started

Install Basic Dependencies

Install Unity Machine Learning Agents

Install an IDE for Python

Creating a custom Unity executable

Load the examples project

Create a 3D Ball executable

Instructions for running the notebooks

You might also like...

An example project demonstrating how the Autonomous Learning Library can be used to build new reinforcement learning agents.

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Causal-Adversarial-Instruments - PyTorch Implementation for Developing Library of Investigating Adversarial Examples on A Causal View by Instruments

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

TensorRT examples (Jetson, Python/C++)(object detection)

Hi Guys, here I am providing examples, which will help you in Lerarning Python

Releases(v1.0.0)

v1.0.0(Jan 14, 2022)

Owner

Rogier Wachters

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Streamlit tool to explore coco datasets

Python implementation of a live deep learning based age/gender/expression recognizer

A simple, clean TensorFlow implementation of Generative Adversarial Networks with a focus on modeling illustrations.

Adversarial Attacks on Probabilistic Autoregressive Forecasting Models.

[CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution

Google AI Open Images - Object Detection Track: Open Solution

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!

Official implementation for Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder at NeurIPS 2020

Supporting code for "Autoregressive neural-network wavefunctions for ab initio quantum chemistry".

上海交通大学全自动抢课脚本，支持准点开抢与抢课后持续捡漏两种模式。2021/06/08更新。

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Self-Adaptable Point Processes with Nonparametric Time Decays

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

End-to-end speech secognition toolkit

(JMLR'19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.