Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

Last update: Dec 06, 2022

Overview

ConvNeXt-TF

This repository provides TensorFlow / Keras implementations of different ConvNeXt [1] variants. It also provides the TensorFlow / Keras models that have been populated with the original ConvNeXt pre-trained weights available from [2]. These models are not blackbox SavedModels i.e., they can be fully expanded into tf.keras.Model objects and one can call all the utility functions on them (example: .summary()).

As of today, all the TensorFlow / Keras variants of the models listed here are available in this repository except for the isotropic ones. This list includes the ImageNet-1k as well as ImageNet-21k models.

Refer to the "Using the models" section to get started. Additionally, here's a related blog post that jots down my experience.

Conversion

TensorFlow / Keras implementations are available in models/convnext_tf.py. Conversion utilities are in convert.py.

Models

The converted models are available on TF-Hub.

There should be a total of 15 different models each having two variants: classifier and feature extractor. You can load any model and get started like so:

import tensorflow as tf

model_gcs_path = "gs://tfhub-modules/sayakpaul/convnext_tiny_1k_224/1/uncompressed"
model = tf.keras.models.load_model(model_gcs_path)
print(model.summary(expand_nested=True))

The model names are interpreted as follows:

convnext_large_21k_1k_384: This means that the model was first pre-trained on the ImageNet-21k dataset and was then fine-tuned on the ImageNet-1k dataset. Resolution used during pre-training and fine-tuning: 384x384. large denotes the topology of the underlying model.
convnext_large_1k_224: Means that the model was pre-trained on the ImageNet-1k dataset with a resolution of 224x224.

Results

Results are on ImageNet-1k validation set (top-1 accuracy).

name	original [email protected]	keras [email protected]
convnext_tiny_1k_224	82.1	81.312
convnext_small_1k_224	83.1	82.392
convnext_base_1k_224	83.8	83.28
convnext_base_1k_384	85.1	84.876
convnext_large_1k_224	84.3	83.844
convnext_large_1k_384	85.5	85.376

convnext_base_21k_1k_224	85.8	85.364
convnext_base_21k_1k_384	86.8	86.79
convnext_large_21k_1k_224	86.6	86.36
convnext_large_21k_1k_384	87.5	87.504
convnext_xlarge_21k_1k_224	87.0	86.732
convnext_xlarge_21k_1k_384	87.8	87.68

Differences in the results are primarily because of the differences in the library implementations especially how image resizing is implemented in PyTorch and TensorFlow. Results can be verified with the code in i1k_eval. Logs are available at this URL.

Using the models

Pre-trained models:

Off-the-shelf classification: Colab Notebook
Fine-tuning: Colab Notebook

Randomly initialized models:

from models.convnext_tf import get_convnext_model

convnext_tiny = get_convnext_model()
print(convnext_tiny.summary(expand_nested=True))

To view different model configurations, refer here.

Upcoming (contributions welcome)

Align layer initializers (useful if someone wanted to train the models from scratch)
Allow the models to accept arbitrary shapes (useful for downstream tasks)
Convert the isotropic models as well
Fine-tuning notebook (thanks to awsaf49)
Off-the-shelf-classification notebook
Publish models on TF-Hub

References

[1] ConvNeXt paper: https://arxiv.org/abs/2201.03545

[2] Official ConvNeXt code: https://github.com/facebookresearch/ConvNeXt

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

Related tags

Overview

ConvNeXt-TF

Conversion

Models

Results

Using the models

Upcoming (contributions welcome)

References

Acknowledgements

Owner

Sayak Paul

Official implementation of "Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection" in CVPR 2022.

Example scripts for the detection of lanes using the ultra fast lane detection model in Tensorflow Lite.

Official Codes for Graph Modularity:Towards Understanding the Cross-Layer Transition of Feature Representations in Deep Neural Networks.

Material related to the Principles of Cloud Computing course.

Chainer Implementation of Semantic Segmentation using Adversarial Networks

PyTorch implementation for the ICLR 2020 paper "Understanding the Limitations of Variational Mutual Information Estimators"

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

This is a model made out of Neural Network specifically a Convolutional Neural Network model

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Differentiable molecular simulation of proteins with a coarse-grained potential

Source code for our paper "Molecular Mechanics-Driven Graph Neural Network with Multiplex Graph for Molecular Structures"

Code base for "On-the-Fly Test-time Adaptation for Medical Image Segmentation"

Code and hyperparameters for the paper "Generative Adversarial Networks"

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)

Python implementation of O-OFDMNet, a deep learning-based optical OFDM system,