This project is the PyTorch implementation of our CVPR 2022 paper:

Last update: Nov 29, 2022

Related tags

Overview

Requirements and Dependency

Install PyTorch with CUDA (for GPU). (Experiments are validated on python 3.8.11 and pytorch 1.7.0)
(For visualization if needed), install the dependency visdom by:

pip install visdom

===========================================================================

Note: I noted the pre-trained models are requested, here is the link of ResNet-XBNBlock-standard_train and ResNet-XBNBlock-advanced_train. I will upload our pre-trained ResNeXt model, if I can access the machine in my lab. (work at home due to COVID-19)

===========================================================================

Experiments

Here, we provide the code for reproducing the main experiments on ImageNet datasets.

1. Prepare the dataset:

Download the ImageNet-1K datasets, and put it in the dir: ./data/imageNet/ or you can specify your datapath by changing --dataset-root=/your-data-path

2. Run scripts of experiments:

We provide the scripts in ./experiments/, including the experiments on the ResNet, ResNeXt, Mobilenet-V2 and ShuffleNet-V2 .

This project is the PyTorch implementation of our CVPR 2022 paper:

Related tags

Overview

Requirements and Dependency

Experiments

1. Prepare the dataset:

2. Run scripts of experiments:

Owner

Lei Huang

Adversarially Learned Inference

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Implementation of Google Brain's WaveGrad high-fidelity vocoder

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

Implementations of polygamma, lgamma, and beta functions for PyTorch

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

68 keypoint annotations for COFW test data

Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

Object classification with basic computer vision techniques

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral

ObjDetApp deploys a pytorch model for object detection

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Architecture Patterns with Python (TDD, DDD, EDM)

An SE(3)-invariant autoencoder for generating the periodic structure of materials

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

This project is the PyTorch implementation of our CVPR 2022 paper:

Related tags

Overview

Requirements and Dependency

Experiments

1. Prepare the dataset:

2. Run scripts of experiments:

Owner

Lei Huang

Adversarially Learned Inference

SPT_LSA_ViT - Implementation for Visual Transformer for Small-size Datasets

Implementation of Google Brain's WaveGrad high-fidelity vocoder

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

Implementations of polygamma, lgamma, and beta functions for PyTorch

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

68 keypoint annotations for COFW test data

Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

Object classification with basic computer vision techniques

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral

*ObjDetApp* deploys a pytorch model for object detection

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Architecture Patterns with Python (TDD, DDD, EDM)

An SE(3)-invariant autoencoder for generating the periodic structure of materials

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

ObjDetApp deploys a pytorch model for object detection