TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Introduction

TernausNet is a modification of the celebrated UNet architecture that is widely used for binary Image Segmentation. For more details, please refer to our arXiv paper.

Pre-trained encoder speeds up convergence even on the datasets with a different semantic features. Above curve shows validation Jaccard Index (IOU) as a function of epochs for Aerial Imagery

This architecture was a part of the winning solutiuon (1st out of 735 teams) in the Carvana Image Masking Challenge.

Installation

pip install ternausnet

Citing TernausNet

Please cite TernausNet in your publications if it helps your research:

@ARTICLE{arXiv:1801.05746,
         author = {V. Iglovikov and A. Shvets},
          title = {TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation},
        journal = {ArXiv e-prints},
         eprint = {1801.05746},
           year = 2018
        }

Example of the train and test pipeline

https://github.com/ternaus/robot-surgery-segmentation

UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset

Related tags

Overview

TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation

Introduction

Installation

Citing TernausNet

Example of the train and test pipeline

Owner

Vladimir Iglovikov

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

Prototypical Networks for Few shot Learning in PyTorch

Pytorch implementation of AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks

Hysterese plugin with two temperature offset areas

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

Implementation of paper "Graph Condensation for Graph Neural Networks"

A denoising diffusion probabilistic model (DDPM) tailored for conditional generation of protein distograms

Learning cell communication from spatial graphs of cells

A LiDAR point cloud cluster for panoptic segmentation

Recurrent Scale Approximation (RSA) for Object Detection

A knowledge base construction engine for richly formatted data

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving

NeurIPS 2021, "Fine Samples for Learning with Noisy Labels"

PyTorch implementation for our AAAI 2022 Paper "Graph-wise Common Latent Factor Extraction for Unsupervised Graph Representation Learning"

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

Distance Encoding for GNN Design

KAPAO is an efficient multi-person human pose estimation model that detects keypoints and poses as objects and fuses the detections to predict human poses.