Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Last update: Jan 15, 2022

Related tags

Overview

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

This is the code for our NeurIPS2021 paper "Understanding the Generalization Benefit of Model Invariance from a Data Perspective". There are two major parts in our code: sample covering number estimation and generalization benefit evaluation.

Requirments

Python 3.8
PyTorch
torchvision
scikit-learn-extra
scipy
robustness package (already included in our code)

Our code is based on robustness package.

Dataset

CIFAR-10 Download and extract the data into /data/cifar10
R2N2 Download the ShapeNet rendered images and put the data into /data/r2n2

The randomly sampled R2N2 images used for computing sample covering numbers and indices of examples for different sample sizes could be found here.

Estimation of sample covering numbers

To estimate the sample covering numbers of different data transformations, run the following script in /scn.

CUDA_VISIBLE_DEVICES=0 python run_scn.py  --epsilon 3 --transformation crop --cover_number_method fast --data-path /path/to/dataset

Note that the input is a N x C x H x W tensor where N is sample size.

Evaluation of generalization benefit

To train the model with data augmentation method, run the following script in /learn_invariance for R2N2 dataset

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset r2n2 \
    --data ../data/2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --transforms view  \
    --inv-method aug \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name view

or the following script for CIFAR-10 dataset

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset cifar \
    --data ../data/cifar10 \
    --n-per-class all \
    --transforms crop  \
    --inv-method aug \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name crop

By setting --transforms to be one of {none, flip, crop, rotate, view}, the specific transformation will be considered.

To train the model with regularization method, run the following script. Currently, the code only support 3d-view transformation on R2N2 dataset.

CUDA_VISIBLE_DEVICES=0 python main.py \
    --dataset r2n2 \
    --data ../data/r2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --transforms view  \
    --inv-method reg \
    --inv-method-beta 1 \
    --out-dir /path/to/out_dir \
    --arch resnet18 --epoch 110 --lr 1e-2 --step-lr 50 \
    --workers 30 --batch-size 128 --exp-name reg_view

To evaluate the model with invariance loss and worst-case consistency accuracy, run the following script.

CUDA_VISIBLE_DEVICES=0 python main.py  \
    --dataset r2n2 \
    --data ../data/r2n2/ShapeNetRendering \
    --metainfo-path ../data/r2n2/metainfo_all.json \
    --inv-method reg \
    --arch resnet18 \
    --resume /path/to/checkpoint.pt.best \
    --eval-only 1 \
    --transforms view  \
    --adv-eval 0 \
    --batch-size 2  \
    --no-store

Note that to have the worst-case consistency accuracy we need to load 24 view images in R2N2RenderingsTorch class in dataset_3d.py.

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Related tags

Overview

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Requirments

Dataset

Estimation of sample covering numbers

Evaluation of generalization benefit

Owner

Custom studies about block sparse attention.

Deep motion transfer

Semi-SDP Semi-supervised parser for semantic dependency parsing.

(ICCV 2021) ProHMR - Probabilistic Modeling for Human Mesh Recovery

This is the code for our paper "Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text"

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Neural Re-rendering for Full-frame Video Stabilization

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking

An Efficient Implementation of Analytic Mesh Algorithm for 3D Iso-surface Extraction from Neural Networks

PyTorch implementation of UPFlow (unsupervised optical flow learning)

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

State-Relabeling Adversarial Active Learning

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

PyTorch implementation of EigenGAN

Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

TensorFlow implementation of "Attention is all you need (Transformer)"