git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Last update: Sep 08, 2021

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pedestrian Detection from Another Perspective]", published in Neurips 2020.

Our method aiming at detecting highly occluded and highly-overlapped instances in crowded scenes especially for pedestrian detection.

Codes are prepared to release here. Due to the experiments are conducted with internal framework, we need some time to rewrite and clean the code. We will release the complete code soon.

Abstract

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be mostly attributed to the widely used representation of pedestrians, i.e., 2Daxis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes,and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What’s more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Method

The network structure and some visualization results are shown here:

Citation

@article{BetaRCNN,
  title={Beta R-CNN: Looking into Pedestrian Detection from Another Perspective},
  author={Xu, Zixuan and Li, Banghuai and Yuan, Ye and Dang, Anhong},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Contact

If you have any questions, please do not hesitate to contact Zixuan Xu ([email protected]).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Abstract

Method

Citation

Contact

Owner

OpenDILab RL Kubernetes Custom Resource and Operator Lib

DeepHyper: Scalable Asynchronous Neural Architecture and Hyperparameter Search for Deep Neural Networks

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

This is the official implementation of our proposed SwinMR

catch-22: CAnonical Time-series CHaracteristics

License Plate Detection Application

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

Deep learning library for solving differential equations and more

Code I use to automatically update my videos' metadata on YouTube

This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)

A Demo server serving Bert through ONNX with GPU written in Rust with <3

Adaout is a practical and flexible regularization method with high generalization and interpretability

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

PyTorch code for ICPR 2020 paper Future Urban Scene Generation Through Vehicle Synthesis

Awesome Long-Tailed Learning

Official Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

Learning Saliency Propagation for Semi-supervised Instance Segmentation

Official code of "Mitigating the Mutual Error Amplification for Semi-Supervised Object Detection"

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"