Multi agent DDPG algorithm written in Python + Pytorch

Last update: Feb 26, 2022

Related tags

Overview

Project 3: Collaboration and Competition

Project Details

For this project, you will work with the Tennis environment.

In this environment, two agents control rackets to bounce a ball over a net. If an agent hits the ball over the net, it receives a reward of +0.1. If an agent lets a ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play.

The observation space consists of 8 variables corresponding to the position and velocity of the ball and racket. Each agent receives its own, local observation. Two continuous actions are available, corresponding to movement toward (or away from) the net, and jumping.

The task is episodic, and in order to solve the environment, your agents must get an average score of +0.5 (over 100 consecutive episodes, after taking the maximum over both agents). Specifically,

After each episode, we add up the rewards that each agent received (without discounting), to get a score for each agent. This yields 2 (potentially different) scores. We then take the maximum of these 2 scores.
This yields a single score for each episode.

The environment is considered solved, when the average (over 100 episodes) of those scores is at least +0.5.

Getting Started

Dependencies

To set up your python environment to run the code in the notebook, follow the instructions below.

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name drlnd python=3.6
source activate drlnd

Windows:

conda create --name drlnd python=3.6 
activate drlnd

Clone the repository, and navigate to the python/ folder. Then, install several dependencies.

git clone https://github.com/udacity/deep-reinforcement-learning.git
cd deep-reinforcement-learning/python
pip install .

Note: You may encounter issues with installing Pytorch 0.4.0. In that case, please replace the file python/requirements.txt with the file requirements.txt inside this project.

Create an IPython kernel for the drlnd environment.

python -m ipykernel install --user --name drlnd --display-name "drlnd"

Before running code in a notebook, change the kernel to match the drlnd environment by using the drop-down Kernel menu.

Instructions

Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
(For Windows users) Check out this link if you need help with determining if your computer is running a 32-bit version or 64-bit version of the Windows operating system.

(For AWS) If you'd like to train the agent on AWS (and have not enabled a virtual screen), then please use this link to obtain the "headless" version of the environment. You will not be able to watch the agent without enabling a virtual screen, but you will be able to train the agent. (To watch the agent, you should follow the instructions to enable a virtual screen, and then download the environment for the Linux operating system above.)
Place the extracted files in the same folder as the notebook Tennis.ipynb.
Load the notebook with Jupyter notebook. (The command to start Jupyter notebook is jupyter notebook)
Follow further instructions in the notebook.

You might also like...

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Language Emergence in Multi Agent Dialog Code for the Paper Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog Satwik Kottur, José M.

105 Nov 25, 2022

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Language Emergence in Multi Agent Dialog Code for the Paper Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog Satwik Kottur, José M.

105 Nov 25, 2022

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

WideLinears Pytorch parallel Neural Networks A package of pytorch modules for fast paralellization of separate deep neural networks. Ideal for agent-b

1 Dec 17, 2021

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

AMAZ3DSim AMAZ3DSim is a lightweight python-based 3D network multi-agent simulator. It uses a cell-based congestion model. It calculates risk, battery

13 Nov 4, 2022

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Multi agent DDPG algorithm written in Python + Pytorch

Related tags

Overview

Project 3: Collaboration and Competition

Project Details

Getting Started

Dependencies

Instructions

You might also like...

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Pytorch modules for paralel models with same architecture. Ideal for multi agent-based systems

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

Multi Agent Path Finding Algorithms

A parallel framework for population-based multi-agent reinforcement learning.

Releases(v1.0.0)

v1.0.0(Dec 29, 2021)

Owner

Rogier Wachters

Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

DAT4 - General Assembly's Data Science course in Washington, DC

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

PyTorch implementation of MulMON

A cool little repl-based simulation written in Python

Code & Data for Enhancing Photorealism Enhancement

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

BuildingNet: Learning to Label 3D Buildings

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Simple Python application to transform Serial data into OSC messages

pytorch bert intent classification and slot filling

Brain Tumor Detection with Tensorflow Neural Networks.

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

ivadomed is an integrated framework for medical image analysis with deep learning.