Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

Last update: Dec 27, 2022

Related tags

Overview

Line as a Visual Sentence with LineTR

This repository contains the inference code, pretrained model, and demo scripts of the following paper. It supports both point(SuperPoint) and line features(LSD+LineTR).

@article{syoon-2021-linetr,
  author    = {Sungho Yoon and Ayoung Kim},
  title     = {{Line as a Visual Sentence}: Context-aware Line Descriptor for Visual Localization},
  booktitle = {IEEE Robotics and Automation Letters},
  year      = {2021}
}

Abstract

Along with feature points for image matching, line features provide additional constraints to solve visual geometric problems in robotics and computer vision (CV). Although recent convolutional neural network (CNN)-based line descriptors are promising for viewpoint changes or dynamic environments, we claim that the CNN architecture has innate disadvantages to abstract variable line length into the fixed-dimensional descriptor. In this paper, we effectively introduce the Line-Transformer dealing with variable lines. Inspired by natural language processing (NLP) tasks where sentences can be understood and abstracted well in neural nets, we view a line segment as a sentence that contains points (words). By attending to well-describable points on a line dynamically, our descriptor performs excellently on variable line length. We also propose line signature networks sharing the line's geometric attributes to neighborhoods. Performing as group descriptors, the networks enhance line descriptors by understanding lines' relative geometries. Finally, we present the proposed line descriptor and matching in a Point and Line Localization (PL-Loc). We show that the visual localization with feature points can be improved using our line features. We validate the proposed method for homography estimation and visual localization.

Getting Started

This code was tested with Python 3.6 and PyTorch 1.8 on Ubuntu 18.04.

# create and activate a new conda environment
conda create -y --name linetr
conda activate linetr

# install the dependencies
conda install -y python=3.6
pip install -r requirements.txt

Command

There are two demo scripts:

demo_LineTR.py : run a live demo on a camera or video file
match_line_pairs.py : find line correspondence for image pairs, listed in input_pairs.txt

Keyboard control:

n: select the current frame as the anchor
e/r: increase/decrease the keypoint confidence threshold
d/f: increase/decrease the nearest neighbor matching threshold for keypoints
c/v: increase/decrease the nearest neighbor matching threshold for keylines
k: toggle the visualization of keypoints
q: quit

The scripts are partially reusing SuperGluePretrainedNetwork.

BibTeX Citation

@ARTICLE{syoon-2021-linetr,
  author    = {Sungho Yoon and Ayoung Kim},
  title     = {{Line as a Visual Sentence}: Context-aware Line Descriptor for Visual Localization},
  booktitle = {IEEE Robotics and Automation Letters},
  year      = {2021},
  url       = {https://arxiv.org/abs/2109.04753}
}

Acknowledgment

This work was fully supported by [Localization in changing city] project funded by NAVER LABS Corporation.

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

Related tags

Overview

Line as a Visual Sentence with LineTR

Abstract

Getting Started

Command

BibTeX Citation

Acknowledgment

Owner

SungHo Yoon

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Google and Stanford University released a new pre-trained model called ELECTRA

Dust model dichotomous performance analysis

Traditional Chinese Text Recognition Dataset: Synthetic Dataset and Labeled Data

This project consists of data analysis and data visualization (done using python)of all IPL seasons from 2008 to 2019 and answering the most asked questions about the IPL.

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Tokenizer - Module python d'analyse syntaxique et de grammaire, tokenization

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

Mapping a variable-length sentence to a fixed-length vector using BERT model

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

Yes it's true :broken_heart:

Crowd sourced training data for Rasa NLU models

초성 해석기 based on ko-BART

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

NLP made easy

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"