Flexible Option Learning - NeurIPS 2021

Last update: Nov 09, 2022

Related tags

Overview

Flexible Option Learning

This repository contains code for the paper Flexible Option Learning presented as a Spotlight at NeurIPS 2021. The implementation is based on gym-miniworld, OpenAI's baselines and the Option-Critic's tabular implementation.

Contents:

FourRooms Experiments
Continuous Control Experiments
Visual Navigation Experiments
Citation

Tabular Experiments (Four-Rooms)

Installation and Launch code

pip install gym==0.12.1
cd diagnostic_experiments/
python main_fixpol.py --multi_option # for experiments with fixed options
python main.py --multi_option # for experiments with learned options

Continuous Control (MuJoCo)

Installation

virtualenv moc_cc --python=python3
source moc_cc/bin/activate
pip install tensorflow==1.12.0 
cd continuous_control
pip install -e . 
pip install gym==0.9.3
pip install mujoco-py==0.5.1

Launch

cd baselines/ppoc_int
python run_mujoco.py --switch --nointfc --env AntWalls --eta 0.9 --mainlr 8e-5 --intlr 8e-5 --piolr 8e-5

Maze Navigation (MiniWorld)

Installation

virtualenv moc_vision --python=python3
source moc_vision/bin/activate
pip install tensorflow==1.13.1
cd vision_miniworld
pip install -e .
pip install gym==0.15.4

Launch

cd baselines/
# Run agent in first task
python run.py --alg=ppo2_options --env=MiniWorld-WallGap-v0 --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

# Load and run agent in transfer task
python run.py --alg=ppo2_options --env=MiniWorld-WallGapTransfer-v0 --load_path path/to/model --num_timesteps 2500000 --save_interval 1000  --num_env 8 --noptions 4 --eta 0.7

Cite

If you find this work useful to you, please consider adding you to your references.

@inproceedings{
klissarov2021flexible,
title={Flexible Option Learning},
author={Martin Klissarov and Doina Precup},
booktitle={Thirty-Fifth Conference on Neural Information Processing Systems},
year={2021},
url={https://openreview.net/forum?id=L5vbEVIePyb}
}

Flexible Option Learning - NeurIPS 2021

Related tags

Overview

Flexible Option Learning

Tabular Experiments (Four-Rooms)

Installation and Launch code

Continuous Control (MuJoCo)

Installation

Launch

Maze Navigation (MiniWorld)

Installation

Launch

Cite

Owner

Martin Klissarov

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Adversarial Texture Optimization from RGB-D Scans (CVPR 2020).

[CVPR 2022] Back To Reality: Weak-supervised 3D Object Detection with Shape-guided Label Enhancement

Implementation for Shape from Polarization for Complex Scenes in the Wild

Code and Experiments for ACL-IJCNLP 2021 Paper Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering.

Download from Onlyfans.com.

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

KwaiRec: A Fully-observed Dataset for Recommender Systems (Density: Almost 100%)

Source code for deep symbolic optimization.

Material for my PyConDE & PyData Berlin 2022 Talk "5 Steps to Speed Up Your Data-Analysis on a Single Core"

Statistical-Rethinking-with-Python-and-PyMC3 - Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath

一个运行在 𝐞𝐥𝐞𝐜𝐕𝟐𝐏 或 𝐪𝐢𝐧𝐠𝐥𝐨𝐧𝐠 等定时面板的签到项目

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Using VapourSynth with super resolution models and speeding them up with TensorRT.

Predicting 10 different clothing types using Xception pre-trained model.

Make Watson Assistant send messages to your Discord Server

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

Python library for tracking human heads with FLAME (a 3D morphable head model)

Another pytorch implementation of FCN (Fully Convolutional Networks)

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.