Implementation of Online Label Smoothing in PyTorch

Last update: Dec 14, 2022

Related tags

Overview

Online Label Smoothing

Pytorch implementation of Online Label Smoothing (OLS) presented in Delving Deep into Label Smoothing.

Introduction

As the abstract states, OLS is a strategy to generates soft labels based on the statistics of the model prediction for the target category. The core idea is that instead of using fixed soft labels for every epoch, we go updating them based on the stats of correct predicted samples.

More details and experiment results can be found in the paper.

Usage

Usage of OnlineLabelSmoothing is pretty straightforward. Just use it as you would use PyTorch CrossEntropyLoss. The only thing that is different is that at the end of the epoch you should call OnlineLabelSmoothing.next_epoch(). It updates the OnlineLabelSmoothing.supervise matrix that will be used in the next epoch for the soft labels.

Standalone

from ols import OnlineLabelSmoothing
import torch

k = 4  # Number of classes
b = 32  # Batch size
criterion = OnlineLabelSmoothing(alpha=0.5, n_classes=k, smoothing=0.1)
logits = torch.randn(b, k)  # Predictions
y = torch.randint(k, (b,))  # Ground truth

loss = criterion(logits, y)

PyTorch

from ols import OnlineLabelSmoothing

criterion = OnlineLabelSmoothing(alpha=..., n_classes=...)
for epoch in range(...):  # loop over the dataset multiple times
    for i, data in enumerate(...):
        inputs, labels = data
        # zero the parameter gradients
        optimizer.zero_grad()
        # forward + backward + optimize
        outputs = net(inputs)
        loss = criterion(outputs, labels)
        loss.backward()
        optimizer.step()
    print(f'Epoch {epoch} finished!')
    # Update the soft labels for next epoch
    criterion.next_epoch()

PyTorchLightning

With PL you can simply call next_epoch() at the end of the epoch with:

import pytorch_lightning as pl
from ols import OnlineLabelSmoothing


class LitClassification(pl.LightningModule):
    def __init__(self):
        super().__init__()
        self.criterion = OnlineLabelSmoothing(alpha=..., n_classes=...)

    def forward(self, x):
        pass

    def configure_optimizers(self):
        pass

    def training_step(self, train_batch, batch_idx):
        pass

    def on_train_epoch_end(self, **kwargs):
        self.criterion.next_epoch()

Installation

pip install -r requirements.txt

Citation

@misc{zhang2020delving,
      title={Delving Deep into Label Smoothing}, 
      author={Chang-Bin Zhang and Peng-Tao Jiang and Qibin Hou and Yunchao Wei and Qi Han and Zhen Li and Ming-Ming Cheng},
      year={2020},
      eprint={2011.12562},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Implementation of Online Label Smoothing in PyTorch

Related tags

Overview

Online Label Smoothing

Introduction

Usage

Standalone

PyTorch

PyTorchLightning

Installation

Citation

Owner

PyTorch implementation of our paper How robust are discriminatively trained zero-shot learning models?

Course content and resources for the AIAIART course.

Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"

A high-level Python library for Quantum Natural Language Processing

Deep Learning for Time Series Classification

A program to recognize fruits on pictures or videos using yolov5

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Source code related to the article submitted to the International Conference on Computational Science ICCS 2022 in London

Deep Learning Training Scripts With Python

Code to compute permutation and drop-column importances in Python scikit-learn models

PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017

A Tensorfflow implementation of Attend, Infer, Repeat

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

[2021][ICCV][FSNet] Full-Duplex Strategy for Video Object Segmentation

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

[IJCAI-2021] A benchmark of data-free knowledge distillation from paper "Contrastive Model Inversion for Data-Free Knowledge Distillation"

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

Python code for loading the Aschaffenburg Pose Dataset.