Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Last update: Dec 20, 2022

Related tags

Overview

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

This is the code associated with the paper Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks, published at CVPR 2020.

Data generation

In our work we report results on two large-scale autonomous driving datasets: NuScenes and Argoverse. The birds-eye-view ground truth labels we use to train and evaluate our networks are generated by combining map information provided by the two datasets with 3D bounding box annotations, which we rasterise to produce a set of one-hot binary labels. We also make use of LiDAR point clouds to infer regions of the birds-eye-view which are completely occluded by buildings or other objects.

NuScenes

To train our method on NuScenes you will first need to

Download the NuScenes dataset which can be found at https://www.nuscenes.org/download. Only the metadata, keyframe and lidar blobs are necessary.
Download the map expansion pack. Note that to replicate our original results you should use the original version of the expansion (v1.0). The later versions fixed some bugs with the original maps so we would expect even better performance!
Install the NuScenes devkit from https://github.com/nutonomy/nuscenes-devkit
Cd to mono-semantic-maps
Edit the configs/datasets/nuscenes.yml file, setting the dataroot and label_root entries to the location of the NuScenes dataset and the desired ground truth folder respectively.
Run our data generation script: python scripts/make_nuscenes_labels.py. Bewarned there's a lot of data so this will take a few hours to run!

Argoverse

To train on the Argoverse dataset:

Download the Argoverse tracking data from https://www.argoverse.org/data.html#tracking-link. Our models were trained on version 1.1, you will need to download the four training blobs, validation blob, and the HD map data.
Install the Argoverse devkit from https://github.com/argoai/argoverse-api
Cd to mono-semantic-maps
Edit the configs/datasets/argoverse.yml file, setting the dataroot and label_root entries to the location of the install Argoverse data and the desired ground truth folder respectively.
Run our data generation script: python scripts/make_argoverse_labels.py. This script will also take a while to run!

Training

Once ground truth labels have been generated, you can train our method by running the train.py script in the root directory:

python train.py --dataset nuscenes --model pyramid

The --dataset flag allows you to specify the dataset to train on, either 'argoverse' or 'nuscenes'. The model flag allows training of the proposed method 'pyramid', or one of the baseline methods ('vpn' or 'ved'). Additional command line options can be specified by passing a list of key-value pairs to the --options flag. The full list of configurable options can be found in the configs/defaults.yml file.

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Related tags

Overview

Predicting Semantic Map Representations from Images with Pyramid Occupancy Networks

Data generation

NuScenes

Argoverse

Training

Owner

Thomas Roddick

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Non-Attentive-Tacotron - This is Pytorch Implementation of Google's Non-attentive Tacotron.

Reinforcement Learning for Portfolio Management

CTRL-C: Camera calibration TRansformer with Line-Classification

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Self-Supervised Methods for Noise-Removal

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

An implementation of the efficient attention module.

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

A python comtrade load library accelerated by go

Koopman operator identification library in Python

CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition (NeurIPS 2019)

Wanli Li and Tieyun Qian: Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction, IJCNN 2021

DCGAN-tensorflow - A tensorflow implementation of Deep Convolutional Generative Adversarial Networks

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

The project was to detect traffic signs, based on the Megengine framework.

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness