Active Learning at the ImageNet Scale

This repo contains code for the paper Active Learning at the ImageNet Scale by Zeyad Emam*, Hong-Min Chu*, Ping-Yeh Chiang*, Wojtek Czaja, Richard Leapman, Micah Goldblum, and Tom Goldstein.

Requirements

pip install -r requirements.txt

Comet and Logging

This project uses Comet ML to log all experiments, you must install comet_ml (included in requirements.txt), however, the code does not require the user to have a Comet ML account or to enable comet logging at all. If you choose to use comet ML, then you should include your API key in your home directory ~/.comet.config (more on this in the Comet ML documentation). To use comet make sure the use the flag --enable_comet.

Logs and network weights are stored according to the command line arguments --log_dir and --ckpt_path.

Loading SSP checkpoints

Self-supervised pretrained checkpoints must be obtained separately and specified in ./src/arg_pools for each argpool, under the key "init_pretrained_ckpt_path". To access the checkpoints used in our experiments, please use the following links:

Sample Commands to Reproduce the Results in the Paper

Each Imagenet experiment was conducted on a cluster node with a single V100-SXM2 GPU (32GB VRAM), 64gb of RAM, and 16 2.3 GHz Intel Gold 6140 cpus. If more than one gpu are available on the node, the code will automatically distribute batches across all gpus using DistributedDataParallel training.

Below is a sample command for running an experiment. The full list of command line arguments can be found in src/utils/parser.py.

python main_al.py --dataset_dir 
   
     --exp_name RandomSampler_arg_ssp_linear_evaluation_imagenet_b10000 --dataset imagenet --arg_pool ssp_linear_evaluation --model SSLResNet50 --strategy RandomSampler --rounds 8 --round_budget 10000 --init_pool_size 30000 --subset_labeled 50000 --subset_unlabeled 80000 --freeze_feature --partitions 10 --init_pool_type random

The full list of commands to reproduce all plots in the paper can be obtained by running python src/gen_jobs.py.

Code for Active Learning at The ImageNet Scale.

Related tags

Overview

Active Learning at the ImageNet Scale

Requirements

Comet and Logging

Loading SSP checkpoints

Sample Commands to Reproduce the Results in the Paper

Owner

Zeyad Emam

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

PyG (PyTorch Geometric) - A library built upon PyTorch to easily write and train Graph Neural Networks (GNNs)

An inofficial PyTorch implementation of PREDATOR based on KPConv.

The source code for Adaptive Kernel Graph Neural Network at AAAI2022

Latex code for making neural networks diagrams

git《Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser》(2021) GitHub: [fig5]

《Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching》(CVPR 2020)

Type4Py: Deep Similarity Learning-Based Type Inference for Python

Fiddle is a Python-first configuration library particularly well suited to ML applications.

Asynchronous Advantage Actor-Critic in PyTorch

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Repositorio de los Laboratorios de Análisis Numérico / Análisis Numérico I de FAMAF, UNC.

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Reading Group @mila-iqia on Computational Optimal Transport for Machine Learning Applications

Style transfer between images was performed using the VGG19 model

Software for Multimodalty 2D+3D Facial Expression Recognition (FER) UI

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.