3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Last update: Dec 26, 2022

Related tags

Deep Learning 3D-Reconstruction

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

代码：Part1

技术文档：[Markdown] [PDF]

原始图像：Original Images

点云结果：Point Cloud Results-1

效果图：

Part II 基于计算机视觉方法的点云到点云窗户识别

代码：Part2

技术文档：[Markdown] [PDF]

点云结果：Point Cloud Results-2

算法流程图：

Part III 基于ResNest的图像到点云的语义分割

代码：Part3

技术文档：[Markdown] [PDF]

语义分割结果：Semantic Segmentation Results

点云结果：Point Cloud Results-3

效果图：

参考文献

AA-RMVSNet [arXiv] [CVF] [PDF]

Wei Z, Zhu Q, Min C, et al. Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 6187-6196.

Cascade-MVSNet [arXiv] [CVF] [PDF]

Gu X, Fan Z, Zhu S, et al. Cascade cost volume for high-resolution multi-view stereo and stereo matching[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 2495-2504.

TransMVSNet [arXiv] [PDF]

Ding Y, Yuan W, Zhu Q, et al. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers[J]. arXiv preprint arXiv:2111.14600, 2021.

LoFTR [arXiv] [CVF] [PDF]

Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 8922-8931.

PatchmatchNet [arXiv] [CVF] [PDF]

Wang F, Galliani S, Vogel C, et al. PatchmatchNet: Learned Multi-View Patchmatch Stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 14194-14203.

ResNeSt [arXiv] [PDF]

Zhang H, Wu C, Zhang Z, et al. Resnest: Split-attention networks[J]. arXiv preprint arXiv:2004.08955, 2020.

致谢

稀疏重建部分使用Colmap完成相机参数的获取。

稠密重建部分的代码主要来源于AA-RMVSNet。

点云切割与可视化使用CloudCompare及Meshlab完成。

调用Open3D进行表面重建。

Cascade+Transformer的代码主要基于kwea123实现的pytorch-lightning版本的Cascade-MVSNetl以及LoFTR进行实现。

窗户识别算法中部分思路参考了Color Space的矩形识别算法，图像处理技术主要基于冈萨雷斯的数字图像处理（第三版）。

语义分割部分调用了PyTorch-Encoding。

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

12 Nov 30, 2022

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints Official implementation for Reducing Footskate in Human Motion Recon

38 Nov 1, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

MeshTransformer ✨ This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers. MEsh TRansfOrmer is a simple yet effec

473 Dec 31, 2022

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

SinIR (Official Implementation) Requirements To install requirements: pip install -r requirements.txt We used Python 3.7.4 and f-strings which are in

47 Oct 11, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 6, 2023

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Releases(7)

7(Feb 16, 2022)

White mesh generated by Neus
Source code(tar.gz)
Source code(zip)
dongbeiya_neus.ply(11.21 MB)
gym_north_neus.ply(21.28 MB)
gym_south_neus.ply(16.59 MB)
6(Feb 16, 2022)

White mesh generated by Colmap and Meshlab
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(19.11 MB)
dongbeiya.png(8.45 MB)
gym_north.ply(31.93 MB)
gym_north.png(8.73 MB)
gym_south.ply(26.97 MB)
gym_south.png(9.32 MB)
5(Dec 29, 2021)

Original images for reconstruction
Source code(tar.gz)
Source code(zip)
PIC2.zip(755.68 MB)
PIC2.z01(900.00 MB)
PIC2.z02(900.00 MB)
dby.zip(735.16 MB)
dby.z02(900.00 MB)
dby.z01(900.00 MB)
4(Dec 19, 2021)

Semantic Segmentation Results of Problem 3
Source code(tar.gz)
Source code(zip)
filtered_segmentation_result_dongbeiya.zip(661.17 MB)
filtered_segmentation_result_gym.zip(786.65 MB)
segmentation_result_dongbeiya.zip(64.31 MB)
segmentation_result_dongbeiya_block.zip(53.27 MB)
segmentation_result_gym.zip(4.72 MB)
3(Dec 19, 2021)

Point Cloud Results of Problem 3
Source code(tar.gz)
Source code(zip)
2(Dec 19, 2021)

Point Cloud Results of Problem 2
Source code(tar.gz)
Source code(zip)
gym_south_window.ply(627.30 MB)
gym_north_window.ply(808.62 MB)
dongbeiya_window.ply(1800.53 MB)
gym_window.ply(1603.31 MB)
1(Dec 19, 2021)

Point Cloud Results of Problem 1
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(731.13 MB)
gym_south.ply(696.19 MB)
gym_north.ply(707.89 MB)
gym.ply(1404.08 MB)

Owner

HMT_Curo

GitHub Repository

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

Joint Entity and Relation Extraction with Set Prediction Networks Source code for Joint Entity and Relation Extraction with Set Prediction Networks. W

130 Dec 13, 2022

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

This project is a web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks. Thanks for NVlabs' excelle

150 Dec 15, 2022

Yet another video caption

5 May 26, 2022

Junction Tree Variational Autoencoder for Molecular Graph Generation (ICML 2018)

Junction Tree Variational Autoencoder for Molecular Graph Generation Official implementation of our Junction Tree Variational Autoencoder https://arxi

418 Jan 07, 2023

Code for the paper "Adversarial Generator-Encoder Networks"

This repository contains code for the paper "Adversarial Generator-Encoder Networks" (AAAI'18) by Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky. Pr

279 Jun 26, 2022

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

(ACMMM 2021 Oral) SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment This repository shows two tasks: Face landmark detection and Fac

51 Dec 13, 2022

An OpenAI Gym environment for Super Mario Bros

gym-super-mario-bros An OpenAI Gym environment for Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The Nintendo Entertainment System (NES) us

1 Jan 05, 2022

Aesara is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays.

898 Jan 07, 2023

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Related tags

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

Part II 基于计算机视觉方法的点云到点云窗户识别

Part III 基于ResNest的图像到点云的语义分割

参考文献

致谢

You might also like...

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Releases(7)

7(Feb 16, 2022)

6(Feb 16, 2022)

5(Dec 29, 2021)

4(Dec 19, 2021)

3(Dec 19, 2021)

2(Dec 19, 2021)

1(Dec 19, 2021)

Owner

HMT_Curo

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

Yet another video caption

Junction Tree Variational Autoencoder for Molecular Graph Generation (ICML 2018)

Code for the paper "Adversarial Generator-Encoder Networks"

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.

An OpenAI Gym environment for Super Mario Bros

Aesara is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays.

Implement slightly different caffe-segnet in tensorflow

Exploiting a Zoo of Checkpoints for Unseen Tasks

TensorFlow implementation of PHM (Parameterization of Hypercomplex Multiplication)

Neural Architecture Search Powered by Swarm Intelligence 🐜

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

A simple and useful implementation of LPIPS.

Highly comparative time-series analysis

Deep Multi-Magnification Network for multi-class tissue segmentation of whole slide images

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

Testing and Estimation of structural breaks in Stata

Source code for CVPR 2021 paper "Riggable 3D Face Reconstruction via In-Network Optimization"

AAI supports interdisciplinary research to help better understand human, animal, and artificial cognition.