Predicting Event Memorability from Contextual Visual Semantics

Last update: Oct 06, 2021

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

This repository contains pytorch implementation of five configurations in our paper "Predicting Event Memorability from Contextual Visual Semantics".

Raw images are to be put in '../datasets/r3/images/'
Train and validation (val) splits for different configurations are under '../datasets/r3/splits/'; the set of train_1.txt, val_1.txt, etc. contains image names and memorability scores for the respective split.
Configurations of ablation study are with individual folders, e.g., './no_face', './no_activity', etc. './full_set' is for full configuration without removing features.
Complete extrinsic features and the memory test outcome is available in 'R3_data.csv' file. Description of the features is given in 'R3_data_notes.txt'. Both can be downloaded together with the original image cues @ https://drive.google.com/drive/folders/1Bx_ePv7ui6DCIXkESCpoyuvd0H3B9o6d?usp=sharing
The AMNet implementation is adpated from https://github.com/ok1zjf/AMNet

########################################################################################

To train AMNet and CEMNet_wt_AMNet:

python3 main.py --train-batch-size 128 --test-batch-size 128 --cnn ResNet50FC --dataset lamem --train-split train_1 --val-split val_1

To predict:

python3 main.py --cnn ResNet50FC --model-weights /path/to/model/weights_xx.pkl --eval-images /path/to/evl_images --csv-out memorabilities.txt

To train other models (ICNet, MLP, CEMNet_wt_ICNet):

[Go the the respective folder, e.g., '../ICNet']

python main.py

To predict (please select corresponding splits and model in predict.py):

python predict.py

[Where necessary, change Dataset.py to the corresponding directory of split]

########################################################################################

System configuration:

platform: UBUNTU 16.04

GPU: GeForce GTX 1080

CUDA:9.0

########################################################################################

Python packages:

python 3.5.6

pytorch 0.2.0

Torchvison 0.1.9

Numpy 1.15.2

Opencv 3.1.0

PIL 6.1.0

########################################################################################

To cite the paper: Xu Q., Fang F., del Molino A.G, Subbaraju V., Lim J.H., Predicting Event Memorability from Contextual Visual Semantics, NeurIPS 2021.

If you have any questions, please feel free to contact Dr Xu Qianli: [email protected]

Predicting Event Memorability from Contextual Visual Semantics

Related tags

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

Owner

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Management Dashboard for Torchserve

Speed-Test - You can check your intenet speed using this tool

Zero-shot Learning by Generating Task-specific Adapters

Equivariant Imaging: Learning Beyond the Range Space

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

LBBA-boosted WSOD

Library for machine learning stacking generalization.

Software Platform for solving and manipulating multiparametric programs in Python

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

YOLOX_AUDIO is an audio event detection model based on YOLOX

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Get started learning C# with C# notebooks powered by .NET Interactive and VS Code.

traiNNer is an open source image and video restoration (super-resolution, denoising, deblurring and others) and image to image translation toolbox based on PyTorch.

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

ColossalAI-Examples - Examples of training models with hybrid parallelism using ColossalAI

Fast image augmentation library and an easy-to-use wrapper around other libraries

Predicting Event Memorability from Contextual Visual Semantics

Related tags

Overview

Predicting-Event-Memorability-from-Contextual-Visual-Semantics

Owner

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Management Dashboard for Torchserve

Speed-Test - You can check your intenet speed using this tool

Zero-shot Learning by Generating Task-specific Adapters

Equivariant Imaging: Learning Beyond the Range Space

🧮 Matrix Factorization for Collaborative Filtering is just Solving an Adjoint Latent Dirichlet Allocation Model after All

LBBA-boosted WSOD

Library for machine learning stacking generalization.

Software Platform for solving and manipulating multiparametric programs in Python

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

YOLOX_AUDIO is an audio event detection model based on YOLOX

ICNet for Real-Time Semantic Segmentation on High-Resolution Images, ECCV2018

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Get started learning C# with C# notebooks powered by .NET Interactive and VS Code.

traiNNer is an open source image and video restoration (super-resolution, denoising, deblurring and others) and image to image translation toolbox based on PyTorch.

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

PyTorch code for the paper "FIERY: Future Instance Segmentation in Bird's-Eye view from Surround Monocular Cameras"

ColossalAI-Examples - Examples of training models with hybrid parallelism using ColossalAI

Fast image augmentation library and an easy-to-use wrapper around other libraries

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人