CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Last update: Dec 22, 2022

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

This repo contains code for our paper "Counterfactual Samples Synthesizing for Robust Visual Question Answering" This repo contains code modified from here,many thanks!

Prerequisites

Make sure you are on a machine with a NVIDIA GPU and Python 2.7 with about 100 GB disk space.
h5py==2.10.0
pytorch==1.1.0
Click==7.0
numpy==1.16.5
tqdm==4.35.0

Data Setup

You can use

bash tools/download.sh

to download the data
and the rest of the data and trained model can be obtained from BaiduYun(passwd:3jot) or GoogleDrive unzip feature1.zip and feature2.zip and merge them into data/rcnn_feature/
use

bash tools/process.sh

to process the data

Training

Run

CUDA_VISIBLE_DEVICES=0 python main.py --dataset cpv2 --mode q_v_debias --debias learned_mixin --topq 1 --topv -1 --qvp 5 --output [] --seed 0

to train a model

Testing

Run

CUDA_VISIBLE_DEVICES=0 python eval.py --dataset cpv2 --debias learned_mixin --model_state []

to eval a model

Citation

If you find this code useful, please cite the following paper:

@inproceedings{chen2020counterfactual,
title={Counterfactual Samples Synthesizing for Robust Visual Question Answering},
author={Chen, Long and Yan, Xin and Xiao, Jun and Zhang, Hanwang and Pu, Shiliang and Zhuang, Yueting},
booktitle={CVPR},
year={2020}
}

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Related tags

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Prerequisites

Data Setup

Training

Testing

Citation

Owner

Self-describing JSON-RPC services made easy

An implementation of based on pytorch and mmcv

这是一个unet-pytorch的源码，可以训练自己的模型

DFFNet: An IoT-perceptive Dual Feature Fusion Network for General Real-time Semantic Segmentation

Time Dependent DFT in Tamm-Dancoff Approximation

This repository contains the official implementation code of the paper Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis

Retina blood vessel segmentation with a convolutional neural network

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models (ICCV 2021 Oral)

Hl classification bc - A Network-Based High-Level Data Classification Algorithm Using Betweenness Centrality

A PyTorch-based Semi-Supervised Learning (SSL) Codebase for Pixel-wise (Pixel) Vision Tasks

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

Code for Paper "Evidential Softmax for Sparse MultimodalDistributions in Deep Generative Models"

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

Predicting Event Memorability from Contextual Visual Semantics

Codes for 'Dual Parameterization of Sparse Variational Gaussian Processes'

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Fast and robust certifiable relative pose estimation