A PyTorch implementation of Implicit Q-Learning

Last update: Dec 12, 2022

Related tags

Overview

IQL-PyTorch

This repository houses a minimal PyTorch implementation of Implicit Q-Learning (IQL), an offline reinforcement learning algorithm, along with a script to run IQL on tasks from the D4RL benchmark.

Note that the paper's authors have published their official implementation, which is based on JAX. My implementation is intended to be an alternative for PyTorch users, and my general recommendation is to use the authors' code unless you specifically want/need PyTorch for some reason. I am planning to validate my implementation against the results stated in the paper once I have some spare compute.

Owner

Garrett Thomas

CS PhD student at Stanford

GitHub Repository

UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus

UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus General info This is

71 Oct 25, 2022

A PyTorch implementation of Implicit Q-Learning

Related tags

Overview

IQL-PyTorch

Owner

Garrett Thomas

UmlsBERT: Clinical Domain Knowledge Augmentation of Contextual Embeddings Using the Unified Medical Language System Metathesaurus

Geometry-Free View Synthesis: Transformers and no 3D Priors

《Rethinking Sptil Dimensions of Vision Trnsformers》(2021)

Flexible-CLmser: Regularized Feedback Connections for Biomedical Image Segmentation

Multi-Scale Geometric Consistency Guided Multi-View Stereo

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

使用yolov5训练自己数据集(详细过程)并通过flask部署

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Efficient face emotion recognition in photos and videos

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Perception-aware multi-sensor fusion for 3D LiDAR semantic segmentation (ICCV 2021)

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2021)

Face Recognition Attendance Project

Second Order Optimization and Curvature Estimation with K-FAC in JAX.

A repository for benchmarking neural vocoders by their quality and speed.

DeepMoCap: Deep Optical Motion Capture using multiple Depth Sensors and Retro-reflectors

An unofficial implementation of "Unpaired Image Super-Resolution using Pseudo-Supervision." CVPR2020