Multi-query Video Retrieval

This repository contains the code for the paper:

@misc{wang2022multiquery,
      title={Multi-query Video Retrieval}, 
      author={Zeyu Wang and Yu Wu and Karthik Narasimhan and Olga Russakovsky},
      year={2022},
      eprint={2201.03639},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Data Preparation

Download raw videos for MSR-VTT, MSVD and VATEX, and put them into data/{dataset}/raw_videos folder.
Run the script data/extract_frames.sh to extract frames from raw videos.

The resulting data folder structures like this:

├── data
    ├── msrvtt
        ├── msrvtt_train.json
        ├── msrvtt_test.json
        ├── msrvtt_test_varying_query_sample_1-20.json
        ├── raw_videos
            ├── video0.mp4
            ├── ...
        ├── extracted_frames
            ├── video0.mp4
                ├── 0.jpg
                ├── ...
            ├── ...
    ├── msvd
        ├── ...
    ├── vatex
        ├── ...

For Frozen model, download the pretrained checkpoint provided by the original authors here, and put into record/pretrained folder.

Training

Run command: python train.py -c configs/{config_path}

Evaluation

Run command: python evaluate.py -c configs/{config_path}

Acknowledgements

The structure of this repository is based on https://github.com/victoresque/pytorch-template. Some of the code are adpated from https://github.com/m-bain/frozen-in-time and https://github.com/ArrowLuo/CLIP4Clip.

Multi-query Video Retreival

Related tags

Overview

Multi-query Video Retrieval

Data Preparation

Training

Evaluation

Acknowledgements

Owner

Princeton Visual AI Lab

Scalable machine learning based time series forecasting

SiamMOT is a region-based Siamese Multi-Object Tracking network that detects and associates object instances simultaneously.

PyTorch implementation of DreamerV2 model-based RL algorithm

Plenoxels: Radiance Fields without Neural Networks

Pytorch implementation of ICASSP 2022 paper Attention Probe: Vision Transformer Distillation in the Wild

OpenMMLab Model Deployment Toolset

MutualGuide is a compact object detector specially designed for embedded devices

Simple streamlit app to demonstrate HERE Tour Planning

Hard cater examples from Hopper ICLR paper

Official implementation of AAAI-21 paper "Label Confusion Learning to Enhance Text Classification Models"

Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

BC3407-Group-5-Project - BC3407 Group Project With Python

This thesis is mainly concerned with state-space methods for a class of deep Gaussian process (DGP) regression problems

Dynamic Capacity Networks using Tensorflow

시각 장애인을 위한 스마트 지팡이에 활용될 딥러닝 모델 (DL Model Repo)

Manage the availability of workspaces within Frappe/ ERPNext (sidebar) based on user-roles

Neighborhood Contrastive Learning for Novel Class Discovery

Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation