TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Last update: Dec 29, 2022

Overview

Keras implementation of PSPNet(caffe)

Implemented Architecture of Pyramid Scene Parsing Network in Keras.

For the best compability please use Python3.5

Setup

Install dependencies:
- Tensorflow (-gpu)
- Keras
- numpy
- scipy
- pycaffe(PSPNet)(optional for converting the weights)
```
pip install -r requirements.txt --upgrade
```
Converted trained weights are needed to run the network. Weights(in .h5 .json format) have to be downloaded and placed into directory weights/keras

Already converted weights can be downloaded here:

Convert weights by yourself(optional)

(Note: this is not required if you use .h5/.json weights)

Running this needs the compiled original PSPNet caffe code and pycaffe.

python weight_converter.py <path to .prototxt> <path to .caffemodel>

Usage:

python pspnet.py -m <model> -i <input_image>  -o <output_path>
python pspnet.py -m pspnet101_cityscapes -i example_images/cityscapes.png -o example_results/cityscapes.jpg
python pspnet.py -m pspnet101_voc2012 -i example_images/pascal_voc.jpg -o example_results/pascal_voc.jpg

List of arguments:

 -m --model        - which model to use: 'pspnet50_ade20k', 'pspnet101_cityscapes', 'pspnet101_voc2012'
    --id           - (int) GPU Device id. Default 0
 -s --sliding      - Use sliding window
 -f --flip         - Additional prediction of flipped image
 -ms --multi_scale - Predict on multiscale images

Keras results:

Implementation details

The interpolation layer is implemented as custom layer "Interp"
Forward step takes about ~1 sec on single image

Memory usage can be optimized with:

config = tf.ConfigProto()
config.gpu_options.per_process_gpu_memory_fraction = 0.3 
sess = tf.Session(config=config)

ndimage.zoom can take a long time

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

Related tags

Overview

Keras implementation of PSPNet(caffe)

Setup

Convert weights by yourself(optional)

Usage:

Keras results:

Implementation details

Owner

VladKry

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

A library for uncertainty representation and training in neural networks.

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features

PartImageNet is a large, high-quality dataset with part segmentation annotations

Pytorch implementation of the DeepDream computer vision algorithm

CHERRY is a python library for predicting the interactions between viral and prokaryotic genomes

Simple streamlit app to demonstrate HERE Tour Planning

Fast, differentiable sorting and ranking in PyTorch

Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images

Fast, Attemptable Route Planner for Navigation in Known and Unknown Environments

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Contains code for the paper "Vision Transformers are Robust Learners".

Segmentation-Aware Convolutional Networks Using Local Attention Masks

Repo for "Physion: Evaluating Physical Prediction from Vision in Humans and Machines" submission to NeurIPS 2021 (Datasets & Benchmarks track)

KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

This repo contains the code for paper Inverse Weighted Survival Games

The Noise Contrastive Estimation for softmax output written in Pytorch

[ICCV 2021] Target Adaptive Context Aggregation for Video Scene Graph Generation

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

PyTorch implementation of Tacotron speech synthesis model.