The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Last update: Dec 04, 2022

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation. The code is written in PyTorch, very simple to understand.

There is also a Caffe implementation, please check it if you use Caffe and Matlab.

Requirement:

Python 3
Pytorch 0.4.0

How to run:

open the ipython notebooks with jupyter notebook

then open vgg_fr.ipynb or vgg_fsp.ipynb, these are the two main files for demonstrate feedback idea.

How it looks:

If you run vgg_fsp.ipynb without modification of code, you are supposed to see below visualization:

Input image:

Image gradient with respect to the target label:

Image gradient with respect to the target label after 4 iterations of feedback selective pruning (FSP):

Files explanation:

vgg_fr.ipynb: the main file that defines the vgg feedback network with the feedback recovering mechanism and run a feedback visualization on examplar images.
vgg_fsp.ipynb: the main file that defines the vgg feedback network with the feedback selective pruning mechanism and run a feedback visualization on examplar images.
images: storing exmaplar images
imagenet1000_clsid_to_human.txt: storing image net 1000 class names, for visualization and understanding purpose
test/simple_test.ipynb: unit test for a simple feedback network, using a simple fully connected structure
test/vgg_test.ipynb: unit test for the loading of a pretrained vgg network, then check the weights copying from pretrained network to a new defined network interface

Citation

Please consider citing in your publications if it helps your research:

@inproceedings{cao2015look,
  title={Look and think twice: Capturing top-down visual attention with feedback convolutional neural networks},
  author={Cao, Chunshui and Liu, Xianming and Yang, Yi and Yu, Yinan and Wang, Jiang and Wang, Zilei and Huang, Yongzhen and Wang, Liang and Huang, Chang and Xu, Wei and others},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  pages={2956--2964},
  year={2015}
}

The code is an implementation of Feedback Convolutional Neural Network for Visual Localization and Segmentation.

Related tags

Overview

Feedback Convolutional Neural Network for Visual Localization and Segmentation

Requirement:

How to run:

How it looks:

Files explanation:

Citation

Owner

Towards Long-Form Video Understanding

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

GRF: Learning a General Radiance Field for 3D Representation and Rendering

Immortal tracker

Multi-query Video Retreival

Using the provided dataset which includes various book features, in order to predict the price of books, using various proposed methods and models.

Vehicle Detection Using Deep Learning and YOLO Algorithm

Automatic deep learning for image classification.

Scenic: A Jax Library for Computer Vision and Beyond

CS50x-AI - Artificial Intelligence with Python from Harvard University

PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition

Blind Video Temporal Consistency via Deep Video Prior

Astrostatistics class for the MSc degree in Astrophysics at the University of Milan-Bicocca (Italy)

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

PyTorch implementation of Graph Convolutional Networks in Feature Space for Image Deblurring and Super-resolution, IJCNN 2021.

FaceAPI: AI-powered Face Detection & Rotation Tracking, Face Description & Recognition, Age & Gender & Emotion Prediction for Browser and NodeJS using TensorFlow/JS

Malware Analysis Neural Network project.

A PyTorch implementation of "ANEMONE: Graph Anomaly Detection with Multi-Scale Contrastive Learning", CIKM-21

Simple data balancing baselines for worst-group-accuracy benchmarks.

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021.