CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Last update: Jan 07, 2023

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Model	Dataset	Acc (%)
ViT-B/32 (Paper)	CIFAR100	65.1
ViT-B/32 (Our)	CIFAR100	61.71
ViT-B/32 (Paper	CIFAR10	91.3
ViT-B/32 (Our)	CIFAR10	88.8

Overview

Training

Work In Process

Usage

Evaluation

python evaluation.py --dataset CIFAR100 --cuda True

args
- dataset (str): CIFAR10, CIFAR100 (default: CIFAR100)
- num_workers (int): default: 0
- batch_size (int): default: 128
- cuda (bool): False
Training
- Prepare Data
  - Visual Genome Dataset link
  - Download (images, region descriptions)
- training
```
python main.py --base_dir ./ --cuda True
```

Reference

paper link
Author: Alec Radford, Jong Wook Kim, Chris Hallacy, Girish Sastry, Amanda Askell, Pamela Mishkin, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Jack Clark, Gretchen Krueger, Ilya Sutskever
OpenAI

Owner

Myeongjun Kim

Computer Vision Research using Deep Learning

GitHub Repository

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021 How to cite If you use these data please cite the o

2 Dec 20, 2021

NAVER BoostCamp Final Project

CV 14조 final project Super Resolution and Deblur module Inference code & Pretrained weight Repo SwinIR Deblur 실행 방법 streamlit run WebServer/Server_SRD

5 Sep 06, 2022

Image to Image translation, image generataton, few shot learning

Semi-supervised Learning for Few-shot Image-to-Image Translation [paper] Abstract: In the last few years, unpaired image-to-image translation has witn

49 Nov 18, 2022

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

Patch-Rotation(PatchRot) Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models Submitted to Neurips2021 To

4 Jul 12, 2021

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks We provide the code (in PyTorch) and datasets for our paper "On Size-Orient

4 Jun 18, 2022

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

Hire-Wave-MLP.pytorch Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP Resul

29 Oct 28, 2022

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation Source code of the CVPR'2022 paper "Thin-Plate Spline Motion Model for Image Animation"

1.4k Dec 30, 2022

An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

PC-SOS-SDP: an Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering PC-SOS-SDP is an exact algorithm based on the branch-and-bound techn

1 Nov 13, 2022

Pytorch implementation of Learning with Opponent-Learning Awareness

Pytorch implementation of Learning with Opponent-Learning Awareness using DiCE

82 Sep 15, 2022

Neurolab is a simple and powerful Neural Network Library for Python

Neurolab Neurolab is a simple and powerful Neural Network Library for Python. Contains based neural networks, train algorithms and flexible framework

152 Dec 06, 2022

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

ODE GAN (Prototype) in PyTorch Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary

15 Feb 10, 2022

Roadmap to becoming a machine learning engineer in 2020

Roadmap to becoming a machine learning engineer in 2020, inspired by web-developer-roadmap.

1.7k Dec 29, 2022

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

Evaluation, Training, Demo, and Inference of DeFMO DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021) Denys Rozumnyi, Martin R. O

139 Dec 26, 2022

Code for paper ECCV 2020 paper: Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop.

Who Left the Dogs Out? Evaluation and demo code for our ECCV 2020 paper: Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization

29 Dec 28, 2022

Honours project, on creating a depth estimation map from two stereo images of featureless regions

image-processing This module generates depth maps for shape-blocked-out images Install If working with anaconda, then from the root directory: conda e

2 Oct 17, 2022

Collection of NLP model explanations and accompanying analysis tools

Thermostat is a large collection of NLP model explanations and accompanying analysis tools. Combines explainability methods from the captum library wi

126 Nov 22, 2022

Migration of Edge-based Distributed Federated Learning

FedFly: Towards Migration in Edge-based Distributed Federated Learning About the research Due to mobility, a device participating in Federated Learnin

11 Nov 13, 2022

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction. NeurIPS 2021.

59 Nov 25, 2022

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

deep_nn_model_with_only_python_100%_test_accuracy deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and differen

0 Aug 28, 2022

Pipeline for employing a Lightweight deep learning models for LOW-power systems

PL-LOW A high-performance deep learning model lightweight pipeline that gradually lightens deep neural networks in order to utilize high-performance d

9 Aug 13, 2022

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Related tags

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Overview

Training

Usage

Reference

Owner

Myeongjun Kim

CLDF dataset derived from Robbeets et al.'s "Triangulation Supports Agricultural Spread" from 2021

NAVER BoostCamp Final Project

Image to Image translation, image generataton, few shot learning

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

On Size-Oriented Long-Tailed Graph Classification of Graph Neural Networks

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

Pytorch implementation of Learning with Opponent-Learning Awareness

Neurolab is a simple and powerful Neural Network Library for Python

Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations

Roadmap to becoming a machine learning engineer in 2020

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects (CVPR 2021)

Code for paper ECCV 2020 paper: Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop.

Honours project, on creating a depth estimation map from two stereo images of featureless regions

Collection of NLP model explanations and accompanying analysis tools

Migration of Edge-based Distributed Federated Learning

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

Pipeline for employing a Lightweight deep learning models for LOW-power systems