Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Last update: Jan 07, 2023

Overview

DRL-robot-navigation

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles. Obstacles are detected by laser readings and a goal is given to the robot in polar coordinates. Trained in ROS Gazebo simulator with PyTorch. Tested with ROS Melodic on Ubuntu 18.04 with python 3.6.9 and pytorch 1.10.

Training example:

Pre-print of the article:

Some more information is given in the article at: https://arxiv.org/abs/2103.07119

Please cite as:
@misc{cimurs2021goaldriven,
title={Goal-Driven Autonomous Exploration Through Deep Reinforcement Learning},
author={Reinis Cimurs and Il Hong Suh and Jin Han Lee},
year={2021},
eprint={2103.07119},
archivePrefix={arXiv},
primaryClass={cs.RO}
}

Main dependencies:

Clone the repository:

$ cd ~
### Clone this repo
$ git clone https://github.com/reiniscimurs/DRL-robot-navigation

The network can be run with a standard 2D laser, but this implementation uses a simulated 3D Velodyne sensor

Compile the workspace:

$ cd ~/DRL-robot-navigation/catkin_ws
### Compile
$ catkin_make_isolated

Open a terminal and set up sources:

$ export ROS_HOSTNAME=localhost
$ export ROS_MASTER_URI=http://localhost:11311
$ export ROS_PORT_SIM=11311
$ export GAZEBO_RESOURCE_PATH=~/DRL-robot-navigation/catkin_ws/src/multi_robot_scenario/launch
$ source ~/.bashrc
$ cd ~/DRL-robot-navigation/catkin_ws
$ source devel_isolated/setup.bash
### Run the training
$ cd ~/DRL-robot-navigation/TD3
$ python3 velodyne_td3.py

To kill the training process:

$ killall -9 rosout roslaunch rosmaster gzserver nodelet robot_state_publisher gzclient python python3

Gazebo environment:

Rviz:

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Related tags

Overview

DRL-robot-navigation

Owner

Experiments with the Robust Binary Interval Search (RBIS) algorithm, a Query-Based prediction algorithm for the Online Search problem.

Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

Code for: Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space. Nicholas Monath, Manzil Zaheer, Daniel Silva, Andrew McCallum, Amr Ahmed. KDD 2019.

TorchIO is a Medical image preprocessing and augmentation toolkit for deep learning. Part of the PyTorch Ecosystem.

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

A benchmark dataset for emulating atmospheric radiative transfer in weather and climate models with machine learning (NeurIPS 2021 Datasets and Benchmarks Track)

OpenDelta - An Open-Source Framework for Paramter Efficient Tuning.

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

A Game-Theoretic Perspective on Risk-Sensitive Reinforcement Learning

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

Teaching end to end workflow of deep learning

Learning Saliency Propagation for Semi-supervised Instance Segmentation

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

True per-item rarity for Loot

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Composable transformations of Python+NumPy programsComposable transformations of Python+NumPy programs

Frigate - NVR With Realtime Object Detection for IP Cameras

Multi-modal co-attention for drug-target interaction annotation and Its Application to SARS-CoV-2