CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

Last update: Dec 29, 2021

Related tags

Overview

Code and result about CCAFNet(IEEE TMM)
'CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images' IEEE TMM

Requirements

Python 3.7, Pytorch 1.5.0+, Cuda 10.2, TensorboardX 2.1, opencv-python

Dataset and Evaluate tools

RGB-D SOD Datasets can be found in: http://dpfan.net/d3netbenchmark/ or https://github.com/jiwei0921/RGBD-SOD-datasets

we use the matlab verison provide by Dengping Fan, and we provide our test datesets 百度网盘提取码：zust

Result

Test maps: 百度网盘提取码：zust
Pretrained model download:百度网盘提取码：zust
PS: we resize the testing data to the size of 224 * 224 for quicky evaluate, 百度网盘提取码：zust

Citation

@ARTICLE{9424966,
author={Zhou, Wujie and Zhu, Yun and Lei, Jingsheng and Wan, Jian and Yu, Lu},
journal={IEEE Transactions on Multimedia},
title={CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images},
year={2021},
doi={10.1109/TMM.2021.3077767}}

Acknowledgement

The implement of this project is based on the code of ‘Cascaded Partial Decoder for Fast and Accurate Salient Object Detection, CVPR2019’and 'BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network' proposed by Wu et al and Deng et al.

Contact

Please drop me an email for further problems or discussion: [email protected] or [email protected]

CCAFNet: Crossflow and Cross-scale Adaptive Fusion Network for Detecting Salient Objects in RGB-D Images

Related tags

Overview

Requirements

Dataset and Evaluate tools

Result

Citation

Acknowledgement

Contact

Owner

zyrant丶

Invasive Plant Species Identification

68 keypoint annotations for COFW test data

Automatically erase objects in the video, such as logo, text, etc.

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

Layered Neural Atlases for Consistent Video Editing

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

Code of the paper "Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition"

BuildingNet: Learning to Label 3D Buildings

mPose3D, a mmWave-based 3D human pose estimation model.

TumorInsight is a Brain Tumor Detection and Classification model built using RESNET50 architecture.

OverFeat is a Convolutional Network-based image classifier and feature extractor.

A project for developing transformer-based models for clinical relation extraction

Lightweight Cuda Renderer with Python Wrapper.

Expand human face editing via Global Direction of StyleCLIP, especially to maintain similarity during editing.

The code of NeurIPS 2021 paper "Scalable Rule-Based Representation Learning for Interpretable Classification".

Caffe implementation for Hu et al. Segmentation for Natural Language Expressions

In this tutorial, you will perform inference across 10 well-known pre-trained object detectors and fine-tune on a custom dataset. Design and train your own object detector.

Code for Mesh Convolution Using a Learned Kernel Basis

PASSL包含 SimCLR，MoCo，BYOL，CLIP等基于对比学习的图像自监督算法以及 Vision-Transformer，Swin-Transformer，BEiT，CVT，T2T，MLP_Mixer等视觉Transformer算法