Large-Scale Unsupervised Object Discovery

Last update: Sep 19, 2022

Related tags

Deep Learning LOD

Overview

Large-Scale Unsupervised Object Discovery

Huy V. Vo, Elena Sizikova, Cordelia Schmid, Patrick Pérez, Jean Ponce [PDF]

We propose a novel ranking-based large-scale unsupervised object discovery algorithm that scales up to 1.7M images.

This repository contains code used in the paper.

Quantitative Results

Installation

Follow INSTALL.md and DATA.md to install LOD and prepare data for running it.

Run LOD on a small toy dataset

Follow GETTING_STARTED_small_dataset.md to run LOD with VGG16 features on a small subset of 60 images of Pascal VOC2007 dataset.

Getting Started

Follow GETTING_STARTED.md to run LOD with VGG16 features and GETTING_STARTED_OBOW.md with VGG16-based OBoW features on C20K dataset.

Citations

@inproceedings{Vo21LOD,
  title     = {Large-Scale Unsupervised Object Discovery},
  author    = {Vo, Huy V. and Sizikova, Elena and Schmid, 
               Cordelia and P{\'e}rez, Patrick and Ponce, Jean},
  booktitle = {Advances in Neural Information Processing Systems 34 (NeurIPS 2021)}
  year      = {2021},
}

Acknowledgments

This work was supported in part by the Inria/NYU collaboration, the Louis Vuitton/ENS chair on artificial intelligence and the French government under management of Agence Nationale de la Recherche as part of the “Investissements d’avenir” program, reference ANR19-P3IA-0001 (PRAIRIE 3IA Institute). Elena Sizikova was supported by the Moore-Sloan Data Science Environment initiative (funded by the Alfred P. Sloan Foundation and the Gordon and Betty Moore Foundation) through the NYU Center for Data Science. Huy V. Vo was supported in part by a Valeo/Prairie CIFRE PhD Fellowship.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Large-Scale Unsupervised Object Discovery

Related tags

Overview

Large-Scale Unsupervised Object Discovery

Quantitative Results

Installation

Run LOD on a small toy dataset

Getting Started

Citations

Acknowledgments

License

Owner

Code and datasets for TPAMI 2021

PyTorch implementation of PP-LCNet

A self-supervised 3D representation learning framework named viewpoint bottleneck.

Joint detection and tracking model named DEFT, or ``Detection Embeddings for Tracking.

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Visual dialog agents with pre-trained vision-and-language encoders.

A Lightweight Experiment & Resource Monitoring Tool 📺

Project for tracking occupancy in Tel-Aviv parking lots.

In Search of Probeable Generalization Measures

PyTorch implementation for the Neuro-Symbolic Sudoku Solver leveraging the power of Neural Logic Machines (NLM)

This application explain how we can easily integrate Deepface framework with Python Django application

PolyTrack: Tracking with Bounding Polygons

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Minimal implementation and experiments of "No-Transaction Band Network: A Neural Network Architecture for Efficient Deep Hedging".

Run containerized, rootless applications with podman

Implementations of paper Controlling Directions Orthogonal to a Classifier

Code for pre-training CharacterBERT models (as well as BERT models).

BBScan py3 - BBScan py3 With Python

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Udacity Suse Cloud Native Foundations Scholarship Course Walkthrough