A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Last update: Dec 26, 2022

Related tags

Overview

CLIP4CMR

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

The original data and pre-calculated CLIP features are available at here. The train.pkl and test.pkl include image pixel features and text id features, and the clip_train.pkl and clip_test.pkl include 1024-dimensional image and text features.

Owner

GitHub Repository

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

VSR-Transformer By Jiezhang Cao, Yawei Li, Kai Zhang, Luc Van Gool This paper proposes a new Transformer for video super-resolution (called VSR-Transf

225 Nov 13, 2022

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

20 Nov 15, 2022

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

SegSwap Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery" [PDF] [Project page] If our project

41 Dec 10, 2022

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

1 Oct 25, 2021

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

GEP (GDB Enhanced Prompt) GEP (GDB Enhanced Prompt) is a GDB plug-in which make your GDB command prompt more convenient and flexibility. Why I need th

23 Dec 21, 2022

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

CurriculumNet Introduction This repo contains related code and models from the ECCV 2018 CurriculumNet paper. CurriculumNet is a new training strategy

156 Jul 04, 2022

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Balloon Learning Environment Docs The Balloon Learning Environment (BLE) is a simulator for stratospheric balloons. It is designed as a benchmark envi

87 Dec 25, 2022

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

U-GAT-IT — Official PyTorch Implementation : Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Imag

2.4k Jan 04, 2023

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation This repository contains the official PyTorch implementation of the following

270 Dec 30, 2022

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Trivial Augment This is the official implementation of TrivialAugment (https://arxiv.org/abs/2103.10158), as was used for the paper. TrivialAugment is

94 Dec 30, 2022

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

TextWorld A text-based game generator and extensible sandbox learning environment for training and testing reinforcement learning (RL) agents. Also ch

983 Dec 23, 2022

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning The predictive learning of spatiotemporal sequences aims to generate future

243 Dec 26, 2022

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Related tags

Overview

CLIP4CMR

Owner

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

基于pytorch构建cyclegan示例

Audio2Face - Audio To Face With Python

A Python Package for Convex Regression and Frontier Estimation

RLHive: a framework designed to facilitate research in reinforcement learning.

MG-GCN: Scalable Multi-GPU GCN Training Framework

Image Segmentation Evaluation

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Related tags

Overview

CLIP4CMR

Owner

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

Pytorch implementation of paper "Learning Co-segmentation by Segment Swapping for Retrieval and Discovery"

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

基于pytorch构建cyclegan示例

Audio2Face - Audio To Face With Python

A Python Package for Convex Regression and Frontier Estimation

RLHive: a framework designed to facilitate research in reinforcement learning.

MG-GCN: Scalable Multi-GPU GCN Training Framework

Image Segmentation Evaluation

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Official implementation for NIPS'17 paper: PredRNN: Recurrent Neural Networks for Predictive Learning Using Spatiotemporal LSTMs.

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.