An easy-to-use app to visualise attentions of various VQA models.

Last update: Nov 13, 2022

Overview

Ask Me Anything: A tool for visualising Visual Question Answering (AMA)

An easy-to-use app to visualise attentions of various VQA models. Please click here to see a live demo of the app!

• Models
• Requirements
• Installation
• How to run
• How to use
• Contributing
• Acknowledgements

Models

• MFB - Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
Zhou Yu, Jun Yu, Jianping Fan, Dacheng Tao
Arxiv

• (Coming soon) MCAN - Deep Modular Co-Attention Networks for Visual Question Answering
Zhou Yu, Jun Yu, Yuhao Cui, Dacheng Tao, Qi Tian
Arvix

Requirements

Please check the requirements.txt file for the version numbers.

opencv_python==4.4.0.46
numpy==1.19.4
pandas==1.1.4
torch==1.4.0
matplotlib==3.3.2
gdown==3.12.2
seaborn==0.11.0
dotmap==1.3.23
streamlit==0.70.0
Pillow==8.0.1
PyYAML==5.3.1

Installation

Install Anaconda
Clone this repository and cd into it.
git clone https://github.com/apugoneappu/ask_me_anything.git && cd ask_me_anything
In a new environment (new_env)
pip install -r requirements.txt

How to run

From the directory of this repository, do the following -

conda activate new_env
streamlit run main.py
In a browser tab, open the Network URL displayed in your terminal.

Done! 🎉

How to use

Contributing

First of all, thank you for wanting to contribute to this work! I will try and make your job as easy as possible. Detailed instructions coming soon ...

Acknowledgements

This repository has been built by modifying the OpenVQA repository.

I would also like to thank Yash Khandelwal, Nikhil Shah and Chinmay Singh for their support and amazing suggestions!

Huge thanks to Streamlit for making all of this possible and for Streamlit Sharing that enables free hosting of this app! ❤️

An easy-to-use app to visualise attentions of various VQA models.

Related tags

Overview

Ask Me Anything: A tool for visualising Visual Question Answering (AMA)

Models

Requirements

Installation

How to run

How to use

Contributing

Acknowledgements

Owner

Apoorve

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Attention Probe: Vision Transformer Distillation in the Wild

Aalto-cs-msc-theses - Listing of M.Sc. Theses of the Department of Computer Science at Aalto University

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

DilatedNet in Keras for image segmentation

A voice recognition assistant similar to amazon alexa, siri and google assistant.

Official repository for Few-shot Image Generation via Cross-domain Correspondence (CVPR '21)

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

Demonstrational Session git repo for H SAF User Workshop (28/1)

TensorFlow Implementation of "Show, Attend and Tell"

This repository contains the segmentation user interface from the OpenSurfaces project, extracted as a lightweight tool

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Adapter-BERT: Parameter-Efficient Transfer Learning for NLP.

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

[ICSE2020] MemLock: Memory Usage Guided Fuzzing

This folder contains the python code of UR5E's advanced forward kinematics model.

Implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"