[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Last update: Nov 21, 2022

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

[arXiv][Project page >> coming soon]

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

( 🌟 denotes equal contribution)

Installation

The codebase is built on PyTorch 1.1.0 and tested on Ubuntu 16.04 environment (Python3.6, CUDA9.0, cuDNN7.5).

For installing, follow these intructions

conda create -n mlzsl python=3.6
conda activate mlzsl
conda install pytorch=1.1 torchvision=0.3 cudatoolkit=9.0 -c pytorch
pip install matplotlib scikit-image scikit-learn opencv-python yacs joblib natsort h5py tqdm pandas

Install warmup scheduler

cd pytorch-gradual-warmup-lr; python setup.py install; cd ..

Attention Visualization

Results


Our approach on NUS-WIDE Dataset.	Our approach on OpenImages Dataset.

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Download pre-computed features from here and store them at features folder inside BiAM/datasets/NUS-WIDE directory.
[Optional] You can extract the features on your own by using the original NUS-WIDE dataset from here and run the below script:

python feature_extraction/extract_nus_wide.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full NUS-WIDE dataset, please run:

sh scripts/train_nus.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on NUS-WIDE. You can download the pretrained weights from here and store them at NUS-WIDE folder inside pretrained_weights directory.

sh scripts/evaluate_nus.sh

OPEN-IMAGES

Step 1: Data preparation

Please download the annotations for training, validation, and testing into this folder.
Store the annotations inside BiAM/datasets/OpenImages.
To extract the features for OpenImages-v4 dataset run the below scripts for crawling the images and extracting features of them:

## Crawl the images from web
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `train`: download images into `./image_data/train/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `validation`: download images into `./image_data/validation/`
python ./datasets/OpenImages/download_imgs.py  #`data_set` == `test`: download images into `./image_data/test/`

## Run feature extraction codes for all the 3 splits
python feature_extraction/extract_openimages_train.py
python feature_extraction/extract_openimages_test.py
python feature_extraction/extract_openimages_val.py

Step 2: Training from scratch

To train and evaluate multi-label zero-shot learning model on full OpenImages-v4 dataset, please run:

sh scripts/train_openimages.sh
sh scripts/evaluate_openimages.sh

Step 3: Evaluation using pretrained weights

To evaluate the multi-label zero-shot model on OpenImages. You can download the pretrained weights from here and store them at OPENIMAGES folder inside pretrained_weights directory.

sh scripts/evaluate_openimages.sh

License

This repository is released under the Apache 2.0 license as found in the LICENSE file.

Citation

If you find this repository useful, please consider giving a star ⭐ and citation 🎊 :

@article{narayan2021discriminative,
title={Discriminative Region-based Multi-Label Zero-Shot Learning},
author={Narayan, Sanath and Gupta, Akshita and Khan, Salman and  Khan, Fahad Shahbaz and Shao, Ling and Shah, Mubarak},
journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
publisher = {IEEE},
year={2021}
}

Contact

Should you have any question, please contact 📧 [email protected]

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Related tags

Overview

Discriminative Region-based Multi-Label Zero-Shot Learning (ICCV 2021)

Sanath Narayan*, Akshita Gupta*, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah

Installation

Attention Visualization

Results

Training and Evaluation

NUS-WIDE

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

OPEN-IMAGES

Step 1: Data preparation

Step 2: Training from scratch

Step 3: Evaluation using pretrained weights

License

Citation

Contact

Owner

Akshita Gupta

Melanoma Skin Cancer Detection using Convolutional Neural Networks and Transfer Learning🕵🏻‍♂️

Pytorch Implementation of rpautrat/SuperPoint

68 keypoint annotations for COFW test data

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

Fully Convolutional Refined Auto Encoding Generative Adversarial Networks for 3D Multi Object Scenes

LEAP: Learning Articulated Occupancy of People

Anti-UAV base on PaddleDetection

banditml is a lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

A very lightweight monitoring system for Raspberry Pi clusters running Kubernetes.

Spatial Single-Cell Analysis Toolkit

End-to-End Referring Video Object Segmentation with Multimodal Transformers

Copy Paste positive polyp using poisson image blending for medical image segmentation

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

K Closest Points and Maximum Clique Pruning for Efficient and Effective 3D Laser Scan Matching (To appear in RA-L 2022)

Artificial Neural network regression model to predict the energy output in a combined cycle power plant.

Learning Modified Indicator Functions for Surface Reconstruction

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021

Educational API for 3D Vision using pose to control carton.

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Sanath Narayan^, Akshita Gupta^, Salman Khan, Fahad Shahbaz Khan, Ling Shao, Mubarak Shah