This repository contains a CBIR system that uses swin transformer to extract image's feature.

Last update: Nov 17, 2022

Related tags

Overview

Swin-transformer based CBIR

This repository contains a CBIR(content-based image retrieval) system. Here we use Swin-transformer to extract query image's feature, and retrieve similar ones from image database. Notably, our program achieves intelligent user interaction, including selecting an image by opening explorer dialog and cropping interested region by drafting mouse.

Structure

SWIN_CBIR/
|-- checkpoints/
|
|-- database/
|   |-- data/
|   |   |-- 1.jpg
|   |   |-- 2.jpg
|   |  
|   |-- DB.npz
|   |-- index.txt
|
|-- models/
|   |-- __init__.py
|   |-- build.py
|   |-- swin_transformer.py
|
|-- scripts/
|   |-- generate_DB.sh
|
|-- test/
|
|-- config.py
|-- database.py
|-- generate_DB.py
|-- main.py
|-- requirements.txt
|-- README

Getting Started

Prepare images database

Just find out some images and put them into database/data/.
run ./script/generate_DB.sh in linux machine to extract features of all images and package them into DB.npz.
run main.py, open an image and select interested region, then program will find similar images in database automatically!

Results

Here we show two image retrieval results. Two images in the first row are original image and cropped image respectively while the others are retrieval results (have been sorted by similarity).

Note: all images are resize to square for visual requirement, so there would be distorted in some of the images.

Acknowledgments

Part of code in this repository are copied from Swin-transformer, thank the authors for their exquiste code.

This repository contains a CBIR system that uses swin transformer to extract image's feature.

Related tags

Overview

Swin-transformer based CBIR

Structure

Getting Started

Results

Acknowledgments

Owner

JsHou

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

PyTorch implementation of MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Fortuitous Forgetting in Connectionist Networks

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

AI Summer's complete catalog of articles

Compare neural networks by their feature similarity

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Pytorch-3dunet - 3D U-Net model for volumetric semantic segmentation written in pytorch

Reimplement of SimSwap training code

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

DeRF: Decomposed Radiance Fields

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

This repository contains the files for running the Patchify GUI.

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"

A Lightweight Hyperparameter Optimization Tool 🚀

TensorFlow implementation of ENet, trained on the Cityscapes dataset.

A deep learning object detector framework written in Python for supporting Land Search and Rescue Missions.

Husein pet projects in here!