PyTorch implementation of some learning rate schedulers for deep learning researcher.

Last update: Dec 08, 2022

Overview

pytorch-lr-scheduler

PyTorch implementation of some learning rate schedulers for deep learning researcher.

Usage

`WarmupReduceLROnPlateauScheduler`

Visualize

Example code

import torch

from lr_scheduler.warmup_reduce_lr_on_plateau_scheduler import WarmupReduceLROnPlateauScheduler

if __name__ == '__main__':
    max_epochs, steps_in_epoch = 10, 10000

    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optimizer = torch.optim.Adam(model, 1e-10)

    scheduler = WarmupReduceLROnPlateauScheduler(
        optimizer, 
        init_lr=1e-10, 
        peak_lr=1e-4, 
        warmup_steps=30000, 
        patience=1,
        factor=0.3,
    )

    for epoch in range(max_epochs):
        for timestep in range(steps_in_epoch):
            ...
            ...
            if timestep < warmup_steps:
                scheduler.step()
                
        val_loss = validate()
        scheduler.step(val_loss)

`TransformerLRScheduler`

Visualize

Example code

import torch

from lr_scheduler.transformer_lr_scheduler import TransformerLRScheduler

if __name__ == '__main__':
    max_epochs, steps_in_epoch = 10, 10000

    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optimizer = torch.optim.Adam(model, 1e-10)

    scheduler = TransformerLRScheduler(
        optimizer=optimizer, 
        init_lr=1e-10, 
        peak_lr=0.1,
        final_lr=1e-4, 
        final_lr_scale=0.05,
        warmup_steps=3000, 
        decay_steps=17000,
    )

    for epoch in range(max_epochs):
        for timestep in range(steps_in_epoch):
            ...
            ...
            scheduler.step()

`TriStageLRScheduler`

Visualize

Example code

import torch

from lr_scheduler.tri_stage_lr_scheduler import TriStageLRScheduler

if __name__ == '__main__':
    max_epochs, steps_in_epoch = 10, 10000

    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optimizer = torch.optim.Adam(model, 1e-10)

    scheduler = TriStageLRScheduler(
        optimizer, 
        init_lr=1e-10, 
        peak_lr=1e-4, 
        final_lr=1e-7, 
        init_lr_scale=0.01, 
        final_lr_scale=0.05,
        warmup_steps=30000, 
        hold_steps=70000, 
        decay_steps=100000,
        total_steps=200000,
    )

    for epoch in range(max_epochs):
        for timestep in range(steps_in_epoch):
            ...
            ...
            scheduler.step()

`ReduceLROnPlateauScheduler`

Visualize

Example code

import torch

from lr_scheduler.reduce_lr_on_plateau_lr_scheduler import ReduceLROnPlateauScheduler

if __name__ == '__main__':
    max_epochs, steps_in_epoch = 10, 10000

    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optimizer = torch.optim.Adam(model, 1e-4)

    scheduler = ReduceLROnPlateauScheduler(optimizer, patience=1, factor=0.3)

    for epoch in range(max_epochs):
        for timestep in range(steps_in_epoch):
            ...
            ...
        
        val_loss = validate()
        scheduler.step(val_loss)

`WarmupLRScheduler`

Visualize

Example code

import torch

from lr_scheduler.warmup_lr_scheduler import WarmupLRScheduler

if __name__ == '__main__':
    max_epochs, steps_in_epoch = 10, 10000

    model = [torch.nn.Parameter(torch.randn(2, 2, requires_grad=True))]
    optimizer = torch.optim.Adam(model, 1e-10)

    scheduler = WarmupLRScheduler(
        optimizer, 
        init_lr=1e-10, 
        peak_lr=1e-4, 
        warmup_steps=4000,
    )

    for epoch in range(max_epochs):
        for timestep in range(steps_in_epoch):
            ...
            ...
            scheduler.step()

Troubleshoots and Contributing

If you have any questions, bug reports, and feature requests, please open an issue on Github.

I appreciate any kind of feedback or contribution. Feel free to proceed with small issues like bug fixes, documentation improvement. For major contributions and new features, please discuss with the collaborators in corresponding issues.

Code Style

I follow PEP-8 for code style. Especially the style of docstrings is important to generate documentation.

License

This project is licensed under the MIT LICENSE - see the LICENSE.md file for details

PyTorch implementation of some learning rate schedulers for deep learning researcher.

Related tags

Overview

pytorch-lr-scheduler

Usage

`WarmupReduceLROnPlateauScheduler`

`TransformerLRScheduler`

`TriStageLRScheduler`

`ReduceLROnPlateauScheduler`

`WarmupLRScheduler`

Troubleshoots and Contributing

Code Style

License

Owner

Soohwan Kim

Semantic Image Synthesis with SPADE

Auxiliary Raw Net (ARawNet) is a ASVSpoof detection model taking both raw waveform and handcrafted features as inputs, to balance the trade-off between performance and model complexity.

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.

CNN designed for pansharpening

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Converts given image (png, jpg, etc) to amogus gif.

Codebase for the Summary Loop paper at ACL2020

Contrastively Disentangled Sequential Variational Audoencoder

A Broader Picture of Random-walk Based Graph Embedding

IDRLnet, a Python toolbox for modeling and solving problems through Physics-Informed Neural Network (PINN) systematically.

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

Autotype on websites that have copy-paste disabled like Moodle, HackerEarth contest etc.

Go from graph data to a secure and interactive visual graph app in 15 minutes. Batteries-included self-hosting of graph data apps with Streamlit, Graphistry, RAPIDS, and more!

Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

Stitch it in Time: GAN-Based Facial Editing of Real Videos

realsense d400 -> jpg + csv

PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)

Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations)