Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Last update: Dec 09, 2022

Related tags

Overview

Character in Story Identification Network (CiSIN)

This project hosts the code for our paper.

Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung and Gunhee Kim. Character Grounding and Re-Identification inStory of Videos and Text Descriptions. In ECCV (spotlight), 2020.

This project is an Winning Solution in LSMDC 19 "Fill-in the Characters" task. For more information about the LSMDC visit the Large Scale Movie Description Challenge (LSMDC) 2019

Reference

If you use this code as part of any published research, please refer following paper,

@inproceedings{yu:2020:ECCV,
    title="{Character Grounding and Re-Identification inStory of Videos and Text Descriptions}",
    author={Yu, Youngjae and Kim, Jongseok and Yun, Heeseung and Chung Jiwan and Kim, Gunhee},
    booktitle={ECCV},
    year=2020
}

System Requirements

The following dependencies should be installed:

Python 3.6
Pytorch 1.4.0
torchvision 0.5.0
CUDA 10.0 supported GPU with at least 12GB memory
see requirements.txt for more details

Data Setup

Coming soon,

CiSIN

To train our model,

python train.py

Acknowledgement

We thank SNUVL lab members for helpful comments. This research was supported by Seoul National University, Brain Research Program by National Research Foundation of Korea (NRF) (2017M3C7A1047860), and AIR Lab (AI Research Lab) in Hyundai Motor Company through HMC-SNU AI Consortium Fund.

License

LICENSE.md.

Character Grounding and Re-Identification in Story of Videos and Text Descriptions

Related tags

Overview

Character in Story Identification Network (CiSIN)

Reference

System Requirements

Data Setup

CiSIN

Acknowledgement

License

Owner

Regulatory Instruments for Fair Personalized Pricing.

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

RSNA Intracranial Hemorrhage Detection with python

Codes for 'Dual Parameterization of Sparse Variational Gaussian Processes'

Hashformers is a framework for hashtag segmentation with transformers.

GDSC-ML Team Interview Task

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

An example of semantic segmentation using tensorflow in eager execution.

A transformer-based method for Healthcare Image Captioning in Vietnamese

This is a demo app to be used in the video streaming applications

🦕 NanoSaur is a little tracked robot ROS2 enabled, made for an NVIDIA Jetson Nano

A general-purpose encoder-decoder framework for Tensorflow

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Dynamic vae - Dynamic VAE algorithm is used for anomaly detection of battery data

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Boundary'

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Image Restoration Using Swin Transformer for VapourSynth

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System