Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Last update: Nov 01, 2022

Overview

Gradient Centralization TensorFlow

This Python package implements Gradient Centralization in TensorFlow, a simple and effective optimization technique for Deep Neural Networks as suggested by Yong et al. in the paper Gradient Centralization: A New Optimization Technique for Deep Neural Networks. It can both speedup training process and improve the final generalization performance of DNNs.

Installation

Run the following to install:

pip install gradient-centralization-tf

Usage

`gctf.centralized_gradients_for_optimizer`

Create a centralized gradients functions for a specified optimizer.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.

Example:

>>> opt = tf.keras.optimizers.Adam(learning_rate=0.1)
>>> optimizer.get_gradients = gctf.centralized_gradients_for_optimizer(opt)
>>> model.compile(optimizer = opt, ...)

`gctf.get_centralized_gradients`

Computes the centralized gradients.

This function is ideally not meant to be used directly unless you are building a custom optimizer, in which case you could point get_gradients to this function. This is a modified version of tf.keras.optimizers.Optimizer.get_gradients.

Arguments:

optimizer: a tf.keras.optimizers.Optimizer object. The optimizer you are using.
loss: Scalar tensor to minimize.
params: List of variables.

Returns:

A gradients tensor.

`gctf.optimizers`

Pre built updated optimizers implementing GC.

This module is speciially built for testing out GC and in most cases you would be using gctf.centralized_gradients_for_optimizer though this module implements gctf.centralized_gradients_for_optimizer. You can directly use all optimizers with tf.keras.optimizers updated for GC.

Example:

>>> model.compile(optimizer = gctf.optimizers.adam(learning_rate = 0.01), ...)
>>> model.compile(optimizer = gctf.optimizers.rmsprop(learning_rate = 0.01, rho = 0.91), ...)
>>> model.compile(optimizer = gctf.optimizers.sgd(), ...)

Returns:

A tf.keras.optimizers.Optimizer object.

Developing `gctf`

To install gradient-centralization-tf, along with tools you need to develop and test, run the following in your virtualenv:

git clone [email protected]:Rishit-dagli/Gradient-Centralization-TensorFlow
# or clone your own fork

pip install -e .[dev]

License

Copyright 2020 Rishit Dagli

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Comments

On windows Tensorflow 2.5 it gives error

On windows 10 with miniconda enviroment tensorflow 2.5 gives error on centralized_gradients.py file.

the solution is change import keras.backend as K with import tensorflow.keras.backend as K
bug

opened by mgezer 5

The results in the mnist example are wrong/misleading

Describe the bug The results in your colab ipython notebook are misleading: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

In this example, the model is first trained with a normal Adam optimizer:

model.compile(optimizer = tf.keras.optimizers.Adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics = ['accuracy'])

history_no_gctf = model.fit(training_images, training_labels, epochs=5, callbacks = [time_callback_no_gctf])

And afterwards the same model is recompiled with the gctf.optimizers.adam(). However, recompiling a keras model does not reset the weights. This means that in the first fit call the model is trained and then in the second fit call with the new optimizer the same model is used and of course then the results are better.

This can be fixed, by recreating the model for the second run, by just adding these few lines:

import gctf #import gctf

time_callback_gctf = TimeHistory()

# Model architecture
model = tf.keras.models.Sequential([
                                    tf.keras.layers.Flatten(), 
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(512, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(256, activation=tf.nn.relu),
                                    tf.keras.layers.Dense(64, activation=tf.nn.relu), 
                                    tf.keras.layers.Dense(10, activation=tf.nn.softmax)])

model.compile(optimizer = gctf.optimizers.adam(),
              loss = 'sparse_categorical_crossentropy',
              metrics=['accuracy'])

history_gctf = model.fit(training_images, training_labels, epochs=5, callbacks=[time_callback_gctf])

However, then the results are not better than without gctf:

Type                   Execution time    Accuracy      Loss
-------------------  ----------------  ----------  --------
Model without gctf:           24.7659    0.88825   0.305801
Model with gctf               24.7881    0.889567  0.30812

Could you please clarify what happens here. I tried this gctf.optimizers.adam() optimizer in my own research and it didn't change the results at all and now after seeing it doesn't work in the example which was constructed here. Makes me question the results of this paper.

To Reproduce Execute the colab file given in the repository: https://colab.research.google.com/github/Rishit-dagli/Gradient-Centralization-TensorFlow/blob/main/examples/gctf_mnist.ipynb

Expected behavior The right comparison would be if both models start from a random initialization, not that the second model can start with the already pre-trained weights.

Looking forward to a fast a swift explanation.

Best, Max

question

opened by themasterlink 2

Wider dependency requirements

The package as of now to be installed requires tensorflow ~= 2.4.0 and keras ~= 2.4.0. It turns out that this is sometimes problematic for folks who have custom installations of TensorFlow and a winder requirement could be set up.
enhancement

opened by Rishit-dagli 1
Release 0.0.3
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )
opened by Rishit-dagli 0
Add an "About The Examples" section

Add an "About The Examples" section which contains a summary of the usage example notebooks and links to run it on Binder and Colab.

Close #15

opened by Rishit-dagli 0
Update relevant pypi classifiers
Add PyPI classifiers for:

Development status

Intended Audience

Topic

Further also added the Programming Language :: Python :: 3 :: Only classifer

Closes #18
opened by Rishit-dagli 0
Update pypi classifiers
I am specifically thinking of adding three more categories of pypi classifiers:

Development status

Intended Audience

Topic

Apart from this I also think it would be great to add the Programming Language :: Python :: 3 :: Only to make sure the audience to know that this package is intended for Python 3 only.
opened by Rishit-dagli 0
Add an "About the examples" section

It would be great to write an "About the example" section which could demonstrate in short what the example notebooks aim to achieve and show.
documentation

opened by Rishit-dagli 0
Error in usage example for gctf.centralized_gradients_for_optimizer

I noticed that the docstrings for gctf.centralized_gradients_for_optimizer have an error in the example usage section. The example creates an Adam optimizer instance and saves it to opt however the centralized_gradients_for_optimizer is applied on optimizer which ideally does not exist and running the example would result in an error.
documentation

opened by Rishit-dagli 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 120.77kb | 98.16kb | 18.72% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0
[ImgBot] Optimize images

Beep boop. Your images are optimized!

Your image file size has been reduced by 19% 🎉

Details

| File | Before | After | Percent reduction | |:--|:--|:--|:--| | /images/gctf.png | 105.85kb | 86.11kb | 18.65% |

Black Lives Matter | 💰 donate | 🎓 learn | ✍🏾 sign

📝 docs | :octocat: repo | 🙋🏾 issues | 🏅 swag | 🏪 marketplace

opened by imgbot[bot] 0

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Allow wider versions for TensorFlow and Keras while installing the package (#14 )

Fixed incorrect usage example in docstrings and description for centralized_gradients_for_optimizer (#13 )

Add clear aims for each of the examples of using gctf (#15 )

Updates PyPi classifiers to clearly show the aims of this project. This should have no changes in the way you use this package (#18 )

Add clear instructions for using this with custom optimizers i.e. directly use get_centralized_gradients however a complete example has not been pushed due to the reasons mentioned in the issue (#16 )

Source code(tar.gz)
Source code(zip)
v0.0.2(Feb 21, 2021)
This release includes some fixes and improvements

✅ Bug Fixes / Improvements

Fix the issue of supporting multiple modules

Fix multiple typos.

Source code(tar.gz)
Source code(zip)
v0.0.1(Feb 20, 2021)
This is the initial version of the Gradient-Centralization-TensorFlow package.

Features:

Implement Gradient centralization for optimizers using tf.keras.optimizer.Optimizers base class

Supports custom optimizers

Pre-built optimizers implementing GC for testing purposes.

Thanks, @ialimustufa for his contributions to this package.
Source code(tar.gz)
Source code(zip)
gradient_centralization_tf-0.0.1-py3-none-any.whl(7.12 KB)

Owner

Rishit Dagli

High School, Ted-X, Ted-Ed speaker|Mentor, TFUG Mumbai|International Speaker|Microsoft Student Ambassador|#ExploreML Facilitator

GitHub Repository

Implementation of Convolutional enhanced image Transformer

CeiT : Convolutional enhanced image Transformer This is an unofficial PyTorch implementation of Incorporating Convolution Designs into Visual Transfor

82 Dec 13, 2022

NeuralForecast is a Python library for time series forecasting with deep learning models

NeuralForecast is a Python library for time series forecasting with deep learning models. It includes benchmark datasets, data-loading utilities, evaluation functions, statistical tests, univariate m

1.1k Jan 03, 2023

The Python ensemble sampling toolkit for affine-invariant MCMC

emcee The Python ensemble sampling toolkit for affine-invariant MCMC emcee is a stable, well tested Python implementation of the affine-invariant ense

1.3k Dec 31, 2022

Linescanning - Package for (pre)processing of anatomical and (linescanning) fMRI data

line scanning repository This repository contains all of the tools used during the acquisition and postprocessing of line scanning data at the Spinoza

4 Sep 14, 2022

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Geometry Processing with Neural Fields Pytorch implementation for the NeurIPS 2021 paper: Geometry Processing with Neural Fields Guandao Yang, Serge B

162 Dec 16, 2022

Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.

Batch Soft-DTW(Dynamic Time Warping) in TensorFlow2 including forward and backward computation Custom TensorFlow2 implementations of forward and backw

19 Aug 30, 2022

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition [ArXiv+Supplementary] [IEEE Xplore RA-L 2021] [ICRA 2021 YouTube Video]

63 Dec 12, 2022

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Implicit Internal Video Inpainting Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation paper | project

202 Dec 30, 2022

IEGAN — Official PyTorch Implementation Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

IEGAN — Official PyTorch Implementation Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation Independent Encoder for Deep

30 Nov 05, 2022

Implements Gradient Centralization and allows it to use as a Python package in TensorFlow

Related tags

Overview

Gradient Centralization TensorFlow

Installation

Usage

Arguments:

Example:

Arguments:

Returns:

Example:

Returns:

Developing gctf

License

Comments

✅ Bug Fixes / Improvements

Beep boop. Your images are optimized!

Beep boop. Your images are optimized!

Releases(v0.0.3)

v0.0.3(Mar 11, 2021)

✅ Bug Fixes / Improvements

v0.0.2(Feb 21, 2021)

v0.0.1(Feb 20, 2021)

Owner

Rishit Dagli

Implementation of Convolutional enhanced image Transformer

NeuralForecast is a Python library for time series forecasting with deep learning models

The Python ensemble sampling toolkit for affine-invariant MCMC

Linescanning - Package for (pre)processing of anatomical and (linescanning) fMRI data

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

IEGAN — Official PyTorch Implementation Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

Efficient Training of Audio Transformers with Patchout

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

Pcos-prediction - Predicts the likelihood of Polycystic Ovary Syndrome based on patient attributes and symptoms

Transformers are Graph Neural Networks!

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

MoCoGAN: Decomposing Motion and Content for Video Generation

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

Implementation of Bagging and AdaBoost Algorithm

Try out deep learning models online on Google Colab

Ejemplo Algoritmo Viterbi - Example of a Viterbi algorithm applied to a hidden Markov model on DNA sequence

Modeling CNN layers activity with Gaussian mixture model

Developing `gctf`