Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Last update: Aug 30, 2022

Overview

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation:

Work In Progress, Results can't be replicated yet with the models here

UPDATE: April 28th: Skip_Connection added thanks to the reviewers, check model model-tiramasu-67-func-api.py

feel free to open issues for suggestions:)

Keras2 + TF used for the recent updates, which might cause with some confilict from previous version I had in here

What is The One Hundred Layers Tiramisu?

A state of art (as in Jan 2017) Semantic Pixel-wise Image Segmentation model that consists of a fully deep convolutional blocks with downsampling, skip-layer then to Upsampling architecture.
An extension of DenseNets to deal with the problem of semantic segmentation.

Fully Convolutional DensNet = (Dense Blocks + Transition Down Blocks) + (Bottleneck Blocks) + (Dense Blocks + Transition Up Blocks) + Pixel-Wise Classification layer

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio) arXiv:1611.09326 cs.CV

Requirements:

Keras==2.0.2
tensorflow-gpu==1.0.1
or just go ahead and do: pip install -r requirements.txt

Model Strucure:

DenseBlock: BatchNormalization + Activation [ Relu ] + Convolution2D + Dropout
TransitionDown: BatchNormalization + Activation [ Relu ] + Convolution2D + Dropout + MaxPooling2D
TransitionUp: Deconvolution2D (Convolutions Transposed)

Model Params:

RMSprop is used with Learnining Rete of 0.001 and weight decay 0.995
- However, using those got me nowhere, I switched to SGD and started tweaking the LR + Decay myself.
There are no details given about BatchNorm params, again I have gone with what the Original DenseNet paper had suggested.
Things to keep in mind perhaps:
- the weight inti: he_uniform (maybe change it around?)
- the regualzrazation too agressive?

Repo (explanation):

Download the CamVid Dataset as explained below:
- Use the data_loader.py to crop images to 224, 224 as in the paper implementation.
run model-tiramasu-67-func-api.py or python model-tirmasu-56.py for now to generate each models file.
run python train-tirmasu.py to start training:
- Saves best checkpoints for the model and data_loader included for the CamVidDataset
helper.py contains two methods normalized and one_hot_it, currently for the CamVid Task

Dataset:

In a different directory run this to download the dataset from original Implementation.
- git clone [email protected]:alexgkendall/SegNet-Tutorial.git
- copy the /CamVid to here, or change the DataPath in data_loader.py to the above directory
The run python data_loader.py to generate these two files:
- /data/train_data.npz/ and /data/train_label.npz
- This will make it easy to process the model over and over, rather than waiting the data to be loaded into memory.

Experiments:

Models	Acc	Loss	Notes
FC-DenseNet 67			150 Epochs, RMSPROP

To Do:

[x] FC-DenseNet 103
[x] FC-DenseNet 56
[x] FC-DenseNet 67
[ ] Replicate Test Accuracy CamVid Task
[ ] Replicate Test Accuracy GaTech Dataset Task
[ ] Requirements

Original Results Table:

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Related tags

Overview

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation:

The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio) arXiv:1611.09326 cs.CV

Requirements:

Model Strucure:

Model Params:

Repo (explanation):

Dataset:

To Do:

Owner

Yad Konrad

A simple baseline for 3d human pose estimation in tensorflow. Presented at ICCV 17.

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

Learning to Adapt Structured Output Space for Semantic Segmentation, CVPR 2018 (spotlight)

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Chinese named entity recognization with BiLSTM using Keras

Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy" (ICLR 2022 Spotlight)

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))

Contour-guided image completion with perceptual grouping (BMVC 2021 publication)

Fast Differentiable Matrix Sqrt Root

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Effect of Deep Transfer and Multi task Learning on Sperm Abnormality Detection

Causal Influence Detection for Improving Efficiency in Reinforcement Learning

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

Parameterising Simulated Annealing for the Travelling Salesman Problem

A Python library for generating new text from existing samples.

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

[3DV 2020] PeeledHuman: Robust Shape Representation for Textured 3D Human Body Reconstruction

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.