[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Last update: Dec 16, 2022

Related tags

Overview

Panoptic NeRF

Project Page | Paper | Dataset

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation
Xiao Fu*, Shangzhan zhang*, Tianrun Chen, Yichong Lu, Lanyun Zhu, Xiaowei Zhou, Andreas Geiger, Yiyi Liao
arXiv 2022

Installation

Create a virtual environment via conda.

conda create -n panopticnerf python=3.7
conda activate panopticnerf

Install torch and torchvision.

conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch

Install requirements.
```
pip install -r requirements.txt
```

Data Preparation

We evaluate our model on KITTI-360. Here we show the structure of a test dataset as follow. You can download it from here and then put it into $ROOT (RGBs should query the KITTI-360 website).

├── KITTI-360
  ├── 2013_05_28_drive_0000_sync
    ├── image_00
    ├── image_01
  ├── bbx_intersection
    ├── *_00.npz
    ├── *_01.npz
  ├── calibration
    ├── calib_cam_to_pose.txt
    ├── perspective.txt
  ├── data_3d_bboxes
  ├── data_poses
    ├── cam0_to_world.txt
    ├── poses.txt
  ├── pspnet
  ├── sgm
  ├── visible_id

file	Intro
`image_00/01`	stereo RGB images
`pspnet`	2D pseudo ground truth
`sgm`	weak stereo depth supervision
`visible_id`	per-frame bounding primitive IDs
`data_poses`	system poses in a global Euclidean coordinate
`calibration`	extrinsics and intrinsics of the perspective cameras
`bbx_intersection`	ray-mesh intersections, containing depths between hitting points and camera origin, semantic label IDs and bounding primitive IDs

Generate ray-mesh intersections (bbx_intersection/*.npz). The red dots and blue dots indicate where the rays hit into and out of the meshes, respectively. For the given test scene, START=3353, NUM=64.

# image_00
python mesh_intersection.py intersection_start_frame ${START} intersection_frames ${NUM} use_stereo False
# image_01
python mesh_intersection.py intersection_start_frame ${START} intersection_frames ${NUM} use_stereo True

Evaluate the origin of a scene (center_pose) and the distance from the origin to the furthest bounding primitive (dist_min). Then accordingly modify the .yaml file.
```
python recenter_pose.py recenter_start_frame ${START} recenter_frames ${NUM}
```

Training and Visualization

We provide the training code. Replace resume False with resume True to load the pretained model.

python train_net.py --cfg_file configs/panopticnerf_test.yaml pretrain nerf gpus '1,' use_stereo True use_pspnet True use_depth True pseudo_filter True weight_th 0.05 resume False

Render semantic map, panoptic map and depth map in a single forward pass, which takes around 10s per-frame on a single 3090 GPU. Please make sure to maximize the GPU memory utilization by increasing the size of the chunk to reduce inference time. Replace use_stereo False with use_stereo True to render the right views.
```
python run.py --type visualize --cfg_file configs/panopticnerf_test.yaml use_stereo False
```
Visualize novel view appearance & label synthesis. Before rendering, select a frame and generate corresponding ray-mesh intersections with respect to its novel spiral poses by enabling spiral poses==True in lib.datasets.kitti360.panopticnerf.py.

Evaluation

├── KITTI-360
  ├── gt_2d_semantics
  ├── gt_2d_panoptics
  ├── lidar_depth

Download the corresponding pretrained model and put it to $ROOT/data/trained_model/panopticnerf/panopticnerf_test/latest.pth.
We provide some semantic & panoptic GTs and LiDAR point clouds for evaluation. The details of evaluation metrics can be found in the paper.
Eval mean intersection-over-union (mIoU)

python run.py --type eval_miou --cfg_file configs/panopticnerf_test.yaml use_stereo False

Eval panoptic quality (PQ)

sh eval_pq_test.sh

Eval depth with 0-100m LiDAR point clouds, where the far depth can be adjusted to evaluate the closer scene.

python run.py --type eval_depth --cfg_file configs/panopticnerf_test.yaml use_stereo False max_depth 100.

Eval Multi-view Consistency (MC)

python eval_consistency.py --cfg_file configs/panopticnerf_test.yaml use_stereo False consistency_thres 0.1

News

12/04/2022 Code released.
29/03/2022 Repo created. Code will come soon.

Citation

@article{fu2022panoptic,
  title={Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation},
  author={Fu, Xiao and Zhang, Shangzhan and Chen, Tianrun and Lu, Yichong and Zhu, Lanyun and Zhou, Xiaowei and Geiger, Andreas and Liao, Yiyi},
  journal={arXiv preprint arXiv:2203.15224},
  year={2022}
}

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Related tags

Overview

Panoptic NeRF

Project Page | Paper | Dataset

Installation

Data Preparation

Training and Visualization

Evaluation

News

Citation

Owner

Xiao Fu

Code for the paper "There is no Double-Descent in Random Forests"

Tool for working with Y-chromosome data from YFull and FTDNA

CTC segmentation python package

AFLNet: A Greybox Fuzzer for Network Protocols

Video Instance Segmentation using Inter-Frame Communication Transformers (NeurIPS 2021)

mmdetection version of TinyBenchmark.

[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

An AI made using artificial intelligence (AI) and machine learning algorithms (ML) .

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

Blender Add-on that sets a Material's Base Color to one of Pantone's Colors of the Year

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

Public repository containing materials used for Feed Forward (FF) Neural Networks article.

Simulate genealogical trees and genomic sequence data using population genetic models

Free Book about Deep-Learning approaches for Chess (like AlphaZero, Leela Chess Zero and Stockfish NNUE)

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

Repo for CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Python scripts using the Mediapipe models for Halloween.