Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Last update: Nov 25, 2021

Related tags

Overview

This is a Pytorch Lightning version PSMNet which is based on JiaRenChang/PSMNet.

use python main.py to start training.

PSM-Net

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network" paper (CVPR 2018) by Jia-Ren Chang and Yong-Sheng Chen.

Official repository: JiaRenChang/PSMNet

Usage

1) Requirements

Python3.5+
Pytorch0.4
Opencv-Python
Matplotlib
TensorboardX
Tensorboard

All dependencies are listed in requirements.txt, you execute below command to install the dependencies.

pip install -r requirements.txt

2) Train

usage: train.py [-h] [--maxdisp MAXDISP] [--logdir LOGDIR] [--datadir DATADIR]
                [--cuda CUDA] [--batch-size BATCH_SIZE]
                [--validate-batch-size VALIDATE_BATCH_SIZE]
                [--log-per-step LOG_PER_STEP]
                [--save-per-epoch SAVE_PER_EPOCH] [--model-dir MODEL_DIR]
                [--lr LR] [--num-epochs NUM_EPOCHS]
                [--num-workers NUM_WORKERS]

PSMNet

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --logdir LOGDIR       log directory
  --datadir DATADIR     data directory
  --cuda CUDA           gpu number
  --batch-size BATCH_SIZE
                        batch size
  --validate-batch-size VALIDATE_BATCH_SIZE
                        batch size
  --log-per-step LOG_PER_STEP
                        log per step
  --save-per-epoch SAVE_PER_EPOCH
                        save model per epoch
  --model-dir MODEL_DIR
                        directory where save model checkpoint
  --lr LR               learning rate
  --num-epochs NUM_EPOCHS
                        number of training epochs
  --num-workers NUM_WORKERS
                        num workers in loading data

For example:

python train.py --batch-size 16 \
                --logdir log/exmaple \
                --num-epochs 500

3) Visualize result

This repository uses tensorboardX to visualize training result. Find your log directory and launch tensorboard to look over the result. The default log directory is /log.

tensorboard --logdir <your_log_dir>

Here are some of my training results (have been trained for 1000 epochs on KITTI2015):

4) Inference

usage: inference.py [-h] [--maxdisp MAXDISP] [--left LEFT] [--right RIGHT]
                    [--model-path MODEL_PATH] [--save-path SAVE_PATH]

PSMNet inference

optional arguments:
  -h, --help            show this help message and exit
  --maxdisp MAXDISP     max diparity
  --left LEFT           path to the left image
  --right RIGHT         path to the right image
  --model-path MODEL_PATH
                        path to the model
  --save-path SAVE_PATH
                        path to save the disp image

For example:

python inference.py --left test/left.png \
                    --right test/right.png \
                    --model-path checkpoint/08/best_model.ckpt \
                    --save-path test/disp.png

5) Pretrained model

A model trained for 1000 epochs on KITTI2015 dataset can be download here. (I choose the best model among the 1000 epochs)

state {
    'epoch': 857,
    '3px-error': 3.466
}

Task List

Contact

Email: [email protected]

Welcome for any discussions!

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Related tags

Overview

PSM-Net

Usage

1) Requirements

2) Train

3) Visualize result

4) Inference

5) Pretrained model

Task List

Contact

Owner

XIAOTIAN LIU

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

PyTorch implementation of Deformable Convolution

Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

Workshop Materials Delivered on 28/02/2022

A module that used for encrypt code which includes RSA and AES

Using python and scikit-learn to make stock predictions

Contextual Attention Network: Transformer Meets U-Net

Official repo for the work titled "SharinGAN: Combining Synthetic and Real Data for Unsupervised GeometryEstimation"

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Real-Time High-Resolution Background Matting

PyTorch implementation of Octave Convolution with pre-trained Oct-ResNet and Oct-MobileNet models

Spam your friends and famly and when you do your famly will disown you and you will have no friends.

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

ONNX-PackNet-SfM: Python scripts for performing monocular depth estimation using the PackNet-SfM model in ONNX

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity