Implementation of U-Net and SegNet for building segmentation

Last update: Dec 07, 2022

Overview

Specialized project

Created by Katrine Nguyen and Martin Wangen-Eriksen as a part of our specialized project at Norwegian University of Science and Technology (NTNU).

Models

Most of our code and the U-net model is significantly inspired by this project Unet-for-Person-Segmentation. The SegNet model we created on our own based on other implementations of SegNet in Tensorflow.

Data

The model is trained and tested on Massachusetts Buildings Dataset from Kaggle. The original images where 1500X1500 pixels each over an area of 1500x1500 meters (1mx1m resolution). The original 137 images were cropped into 64x64 pixels and images without building were filtered out.

To make the masks compatible with our model the masks was changed from white (255,255,255) labels to greyscale with value 1. This is done in image_fix.py found in the repo.

Folder structure

Images and masks are saved in local directories and used in data.py and test.py. This is of course possible to change, however if you want to use the exact same code you can follow this folder structure.


.
├── ...
├── building-segmentation                # Directory for all images
│   ├── Images                           # Directory for raw images
│   │   ├── cropped_images_train_64      # Directory for cropped images where number specifies resolution, containg .jpg
│   │   ├── cropped_images_train_128     # Directory for cropped images where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   ├── Masks                            # Directory for all maskes
│   │   ├── cropped_masks_train_64       # Directory for cropped masks where number specifies resolution, containg .jpg
│   │   ├── cropped_masks_train_128      # Directory for cropped masks where number specifies resolution, containg .jpg 
│   │   └── ...                          # More directories with other resolutions
│   └── Test                             # Miscellaneous information
│       ├── test_64                      # Directory for images where number specifies resolution, containing .jpg
│       └── ...                          # More directories with other resolutions
└── ...

# data.py
    images = glob(os.path.join(dataset_path, "images/cropped_images_train_64/*"))
    masks = glob(os.path.join(dataset_path, "masks/cropped_masks_train_64/*"))
    
    # In main:
        dataset_path = "building-segmentation"
    
# test.py
    test_images = glob("building-segmentation/test/test_64/*")

Implementation of U-Net and SegNet for building segmentation

Related tags

Overview

Specialized project

Models

Data

Folder structure

Running the project

Requirements

Training

Testing

Owner

Martin.w-e

PyTorch implementation of SmoothGrad: removing noise by adding noise.

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

FeTaQA: Free-form Table Question Answering

ADB-IP-ROTATION - Use your mobile phone to gain a temporary IP address using ADB and data tethering

[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception

A machine learning project which can detect and predict the skin disease through image recognition.

A list of multi-task learning papers and projects.

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

In this project we predict the forest cover type using the cartographic variables in the training/test datasets.

Pytorch implementation of Zero-DCE++

Repository for the paper "PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation", CVPR 2021.

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

A PyTorch implementation of "Predict then Propagate: Graph Neural Networks meet Personalized PageRank" (ICLR 2019).

Train the HRNet model on ImageNet

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

PyTorch Implementation of SSTNs for hyperspectral image classifications from the IEEE T-GRS paper "Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A FAS Framework."

A Comparative Framework for Multimodal Recommender Systems

Official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR)

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)