Deep Two-View Structure-from-Motion Revisited

Last update: Jan 06, 2023

Overview

Deep Two-View Structure-from-Motion Revisited

This repository provides the code for our CVPR 2021 paper Deep Two-View Structure-from-Motion Revisited.

We have provided the functions for training, validating, and visualization.

Note: some config flags are designed for ablation study, and we have a plan to re-org the codes later. Please feel free to submit issues if you feel confused about some parts.

Requirements

Python = 3.6.x
Pytorch >= 1.6.0
CUDA >= 10.1

and the others could be installed by

pip install -r requirements.txt

Pytorch from 1.1.0 to 1.6.0 should also work well, but it will disenable mixed precision training, and we have not tested it.

To use the RANSAC five-point algorithm, you also need to

cd RANSAC_FiveP

python setup.py install --user

The CUDA extension would be installed as 'essential_matrix'. Tested under Ubuntu and CUDA 10.1.

Models

Pretrained models are provided here.

KITTI Depth

To reproduce our results, please first download the KITTI dataset RAW data and 14GB official depth maps. You should also download the split files provided by us, and unzip them into the root of the KITTI raw data. Then, modify the gt_depth_dir (KITTI_loader.py, L278) to the address of KITTI official depth maps.

For training,

python main.py -b 32 --lr 0.0005 --nlabel 128 --fix_flownet \
--data PATH/TO/YOUR/KITTI/DATASET --cfg cfgs/kitti.yml \
--pretrained-depth depth_init.pth.tar --pretrained-flow flow_init.pth.tar

For evaluation,

python main.py -v -b 1 -p 1 --nlabel 128 \
--data PATH/TO/YOUR/KITTI/DATASET --cfg cfgs/kitti.yml \
--pretrained kitti.pth.tar"

The default evaluation split is Eigen, where the metric abs_rel should be around 0.053 and rmse should be close to 2.22. If you would like to use the Eigen SfM split, please set cfg.EIGEN_SFM = True and cfg.KITTI_697 = False.

KITTI Pose

For fair comparison, we use a KITTI odometry evaluation toolbox as provided here. Please generate poses by sequence, and evaluate the results correspondingly.

Acknowledgment:

Thanks Shihao Jiang and Dylan Campbell for sharing the implementation of the GPU-accelerated RANSAC Five-point algorithm. We really appreciate the valuable feedback from our area chairs and reviewers. We would like to thank Charles Loop for helpful discussions and Ke Chen for providing field test images from NVIDIA AV cars.

BibTex:

@article{wang2021deep,
  title={Deep Two-View Structure-from-Motion Revisited},
  author={Wang, Jianyuan and Zhong, Yiran and Dai, Yuchao and Birchfield, Stan and Zhang, Kaihao and Smolyanskiy, Nikolai and Li, Hongdong},
  journal={CVPR},
  year={2021}
}

Deep Two-View Structure-from-Motion Revisited

Related tags

Overview

Deep Two-View Structure-from-Motion Revisited

Requirements

Models

KITTI Depth

KITTI Pose

Acknowledgment:

BibTex:

Owner

Jianyuan Wang

Start-to-finish tutorial for interactive music co-creation in PyTorch and Tensorflow.js

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

Unsupervised Image to Image Translation with Generative Adversarial Networks

Download files from DSpace systems (because for some reason DSpace won't let you)

Code for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)

Simple Baselines for Human Pose Estimation and Tracking

Command-line tool for downloading and extending the RedCaps dataset.

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

Dungeons and Dragons randomized content generator

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Code and datasets for TPAMI 2021

Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"

Regulatory Instruments for Fair Personalized Pricing.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

AISTATS 2019: Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

A Lighting Pytorch Framework for Recommendation System, Easy-to-use and Easy-to-extend.

Deep Two-View Structure-from-Motion Revisited

Related tags

Overview

Deep Two-View Structure-from-Motion Revisited

Requirements

Models

KITTI Depth

KITTI Pose

Acknowledgment:

BibTex:

Owner

Jianyuan Wang

Start-to-finish tutorial for interactive music co-creation in PyTorch and Tensorflow.js

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

Unsupervised Image to Image Translation with Generative Adversarial Networks

Download files from DSpace systems (because for some reason DSpace won't let you)

Code for MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)

Simple Baselines for Human Pose Estimation and Tracking

Command-line tool for downloading and extending the RedCaps dataset.

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

Dungeons and Dragons randomized content generator

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Code and datasets for TPAMI 2021

Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"

Regulatory Instruments for Fair Personalized Pricing.

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

AISTATS 2019: Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

A Lighting Pytorch Framework for Recommendation System, Easy-to-use and Easy-to-extend.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .