Learning Open-World Object Proposals without Learning to Classify

Last update: Dec 22, 2022

Overview

Learning Open-World Object Proposals without Learning to Classify

Pytorch implementation for "Learning Open-World Object Proposals without Learning to Classify" (arXiv 2021)

Dahun Kim, Tsung-Yi Lin, Anelia Angelova, In So Kweon, and Weicheng Kuo.

@article{kim2021oln,
  title={Learning Open-World Object Proposals without Learning to Classify},
  author={Kim, Dahun and Lin, Tsung-Yi and Angelova, Anelia and Kweon, In So and Kuo, Weicheng},
  journal={arXiv preprint arXiv:2108.06753},
  year={2021}
}

Introduction

Humans can recognize novel objects in this image despite having never seen them before. “Is it possible to learn open-world (novel) object proposals?” In this paper we propose Object Localization Network (OLN) that learns localization cues instead of foreground vs background classification. Only trained on COCO, OLN is able to propose many novel objects (top) missed by Mask R-CNN (bottom) on an out-of-sample frame in an ego-centric video.

Cross-category generalization on COCO

We train OLN on COCO VOC categories, and test on non-VOC categories. Note our [email protected] evaluation does not count those proposals on the 'seen' classes into the budget (k), to avoid evaluating recall on see-class objects.

Method	AUC	[email protected]	[email protected]	[email protected]	[email protected]	[email protected]	Download
OLN-Box	24.8	18.0	26.4	33.4	39.0	45.0	model

Disclaimer

This repo is tested under Python 3.7, PyTorch 1.7.0, Cuda 11.0, and mmcv==1.2.5.

Installation

This repo is built based on mmdetection.

You can use following commands to create conda env with related dependencies.

conda create -n oln python=3.7 -y
conda activate oln
conda install pytorch=1.7.0 torchvision cudatoolkit=11.0 -c pytorch -y
pip install mmcv-full
pip install -r requirements.txt
pip install -v -e .

Please also refer to get_started.md for more details of installation.

Prepare datasets

COCO dataset is available from official websites. It is recommended to download and extract the dataset somewhere outside the project directory and symlink the dataset root to $OLN/data as below.

object_localization_network
├── mmdet
├── tools
├── configs
├── data
│   ├── coco
│   │   ├── annotations
│   │   ├── train2017
│   │   ├── val2017
│   │   ├── test2017

Testing

Our trained models are available for download here. Place it under trained_weights/latest.pth and run the following commands to test OLN on COCO dataset.

# Multi-GPU distributed testing
bash tools/dist_test_bbox.sh configs/oln_box/oln_box.py \
trained_weights/latest.pth ${NUM_GPUS}
# OR
python tools/test.py configs/oln_box/oln_box.py work_dirs/oln_box/latest.pth --eval bbox

Training

# Multi-GPU distributed training
bash tools/dist_train.sh configs/oln_box/oln_box.py ${NUM_GPUS}

Contact

If you have any questions regarding the repo, please contact Dahun Kim ([email protected]) or create an issue.

Learning Open-World Object Proposals without Learning to Classify

Related tags

Overview

Learning Open-World Object Proposals without Learning to Classify

Pytorch implementation for "Learning Open-World Object Proposals without Learning to Classify" (arXiv 2021)

Introduction

Cross-category generalization on COCO

Disclaimer

Installation

Prepare datasets

Testing

Training

Contact

Owner

Dahun Kim

This repo is customed for VisDrone.

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Kaggle DSTL Satellite Imagery Feature Detection

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Self-driving car env with PPO algorithm from stable baseline3

Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds (CVPR 2022)

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities.

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.

Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)

Dungeons and Dragons randomized content generator

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

A deep neural networks for images using CNN algorithm.

source code for https://arxiv.org/abs/2005.11248 "Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics"

Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th place solution

The Dual Memory is build from a simple CNN for the deep memory and Linear Regression fro the fast Memory

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

🔥3D-RecGAN in Tensorflow (ICCV Workshops 2017)