(CVPR 2021) PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds

Last update: Dec 25, 2022

Related tags

Deep Learning PAConv

Overview

PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds

by Mutian Xu*, Runyu Ding*, Hengshuang Zhao, and Xiaojuan Qi.

Introduction

This repository is built for the official implementation of:

PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds (CVPR2021) [arXiv]

If you find our work useful in your research, please consider citing:

@inproceedings{xu2021paconv,
  title={PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds},
  author={Xu, Mutian and Ding, Runyu and Zhao, Hengshuang and Qi, Xiaojuan},
  booktitle={CVPR},
  year={2021}
}

Highlight

All initialization models and trained models are available.
Provide fast multiprocessing training (nn.parallel.DistributedDataParallel) with official nn.SyncBatchNorm.
Incorporated with tensorboardX for better visualization of the whole training process.
Support recent versions of PyTorch.
Well designed code structures for easy reading and using.

Usage

We provide scripts for different point cloud processing tasks:

Object Classification task on Modelnet40.
Shape Part Segmentation task on ShapeNetPart.
Indoor Scene Segmentation task on S3DIS.

You can find the instructions for running these tasks in the above corresponding folders.

Performance

The following tables report the current performances on different tasks and datasets. ( * denotes the backbone architectures)

Object Classification on ModelNet40

Method	OA
PAConv (PointNet)*	93.2%
PAConv (DGCNN)*	93.9%

Shape Part Segmentation on ShapeNet Part

Method	Class mIoU	Instance mIoU
PAConv (DGCNN)*	84.6%	86.1%

Indoor Scene Segmentation on S3DIS Area-5

Method	S3DIS mIoU
PAConv (PointNet++)*	66.58%

Contact

You are welcome to send pull requests or share some ideas with us. Contact information: Mutian Xu ([email protected]) or Runyu Ding ([email protected]).

Acknowledgement

Our code base is partially borrowed from PointWeb, DGCNN and PointNet++.

(CVPR 2021) PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds

Related tags

Overview

PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds

Introduction

Highlight

Usage

Performance

Object Classification on ModelNet40

Shape Part Segmentation on ShapeNet Part

Indoor Scene Segmentation on S3DIS Area-5

Contact

Acknowledgement

Owner

CVMI Lab

GANmouflage: 3D Object Nondetection with Texture Fields

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

DeepLabv3+：Encoder-Decoder with Atrous Separable Convolution语义分割模型在tensorflow2当中的实现

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(2021) paper

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

TensorFlow 2 implementation of the Yahoo Open-NSFW model

A whale detector design for the Kaggle whale-detector challenge!

A Tensorflow implementation of CapsNet based on Geoffrey Hinton's paper Dynamic Routing Between Capsules

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.

Xview3 solution - XView3 challenge, 2nd place solution

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation