[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Last update: Nov 25, 2022

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

This repository contains the source code for the paper Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation.

Abstract

We present an approach for jointly matching and segmenting object instances of the same category within a collection of images. In contrast to existing algorithms that tackle the tasks of semantic matching and object co-segmentation in isolation, our method exploits the complementary nature of the two tasks. The key insights of our method are two-fold. First, the estimated dense correspondence fields from semantic matching provide supervision for object co-segmentation by enforcing consistency between the predicted masks from a pair of images. Second, the predicted object masks from object co-segmentation in turn allow us to reduce the adverse effects due to background clutters for improving semantic matching. Our model is end-to-end trainable and does not require supervision from manually annotated correspondences and object masks. We validate the efficacy of our approach on five benchmark datasets: TSS, Internet, PF-PASCAL, PF-WILLOW, and SPair-71k, and show that our algorithm performs favorably against the state-of-the-art methods on both semantic matching and object co-segmentation tasks.

Citation

If you find our code useful, please consider citing our work using the following bibtex:

@article{MaCoSNet,
    title={Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation},
    author={Chen, Yun-Chun and Lin, Yen-Yu and Yang, Ming-Hsuan and Huang, Jia-Bin},
    journal={IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)},
    year={2020}
}

@inproceedings{WeakMatchNet,
  title={Deep Semantic Matching with Foreground Detection and Cycle-Consistency},
  author={Chen, Yun-Chun and Huang, Po-Hsiang and Yu, Li-Yu and Huang, Jia-Bin and Yang, Ming-Hsuan and Lin, Yen-Yu},
  booktitle={Asian Conference on Computer Vision (ACCV)},
  year={2018}
}

Environment

Install Anaconda Python3.7
This code is tested on NVIDIA V100 GPU with 16GB memory

pip install -r requirements.txt

Dataset

Please download the PF-PASCAL, PF-WILLOW, SPair-71k, TSS, and Internet datasets
Please modify the variable DATASET_DIR in config.py
Please modify the variable CSV_DIR in config.py

Training

You may determine which dataset to be the training set by changing the $DATASET variable in train.sh
You may change the $BATCH_SIZE variable in train.sh to a suitable value based on the GPU memory
The trained model will be saved under the trained_models folder

sh train.sh

Evaluation

You may determine which dataset to be evaluated by changing the $DATASET variable in eval.sh
You may change the $BATCH_SIZE variable in eval.sh to a suitable value based on the GPU memory

sh eval.sh

Acknowledgement

This code is heavily borrowed from Rocco et al.

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Related tags

Overview

Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Abstract

Citation

Environment

Dataset

Training

Evaluation

Acknowledgement

Owner

Yun-Chun Chen

Autonomous racing with the Anki Overdrive

Python package for multiple object tracking research with focus on laboratory animals tracking.

A flexible framework of neural networks for deep learning

Modular Gaussian Processes

DeLag: Detecting Latency Degradation Patterns in Service-based Systems

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

OpenVINO黑客松比赛项目

Python Library for Signal/Image Data Analysis with Transport Methods

CVAT is free, online, interactive video and image annotation tool for computer vision

Voice assistant - Voice assistant with python

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Most popular metrics used to evaluate object detection algorithms.

TorchGeo is a PyTorch domain library, similar to torchvision, that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data.

A collection of resources on GAN Inversion.

Code for our paper A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization,

Deep Learning tutorials in jupyter notebooks.

Run PowerShell command without invoking powershell.exe

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region