iftopt

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations.

Requirements

Python 3.7+
PyTorch 1.x

Installation

$ pip install git+https://github.com/money-shredder/iftopt.git

Usage

Assuming a bi-level optimization of the form:

y* = argmin_{y} val_loss(x*, y), where x* = argmin_{x} train_loss(x, y).

To solve for the optimal x* and y* in the optimization problem, we can implement the following with iftopt:

from iftopt import HyperOptimizer
train_lr = val_lr = 0.1
# parameter to minimize the training loss
x = torch.nn.Parameter(...)
# hyper-parameter to minimize the validation loss
y = torch.nn.Parameter(...)
# training loss optimizer
opt = torch.optim.SGD([x], lr=train_lr)
# validation loss optimizer
hopt = HyperOptimizer(
    [y], torch.optim.SGD([y], lr=val_lr), vih_lr=0.1, vih_iterations=5)
# outer optimization loop for y
for _ in range(...):
    # inner optimization loop for x
    for _ in range(...):
        z = train_loss(x, y)
        # inner optimization step for x
        opt.zero_grad()
        z.backward()
        opt.step()
    # outer optimization step for y
    hopt.set_train_parameters([x])
    z = train_loss(x, y)
    hopt.train_step(z)
    v = val_loss(x, y)
    hopt.val_step(v)
    hopt.grad()
    hopt.step()

For a concrete simple example, please check out and run demo.py, where

train_loss = lambda x, y: (x + y) ** 2
val_loss = lambda x, y: x ** 2

with x = y = 1.0 initially. It will generate a video demo.mp4 showing the optimization trajectory in the animation below. Note that although the hyper-parameter y does not have a direct gradient w.r.t. the validation loss, iftopt can still minimize the validation loss by computing the hyper-gradient via implicit function theorem.

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Related tags

Overview

iftopt

Requirements

Installation

Usage

Owner

The Money Shredder Lab

Aydin is a user-friendly, feature-rich, and fast image denoising tool

NudeNet: Neural Nets for Nudity Classification, Detection and selective censoring

Bayesian dessert for Lasagne

PyTorch Lightning implementation of Automatic Speech Recognition

FasterAI: A library to make smaller and faster models with FastAI.

face_recognization (FaceNet) + TFHE (HNP) + hand_face_detection (Mediapipe)

Data augmentation for NLP, accepted at EMNLP 2021 Findings

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Vignette is a face tracking software for characters using osu!framework.

Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations

Machine Unlearning with SISA

Joint learning of images and text via maximization of mutual information

The Codebase for Causal Distillation for Language Models.

We simulate traveling back in time with a modern camera to rephotograph famous historical subjects.

Learning to Self-Train for Semi-Supervised Few-Shot

A GridMixup augmentation, inspired by GridMask and CutMix

ShapeGlot: Learning Language for Shape Differentiation

Anchor-free Oriented Proposal Generator for Object Detection

A TensorFlow implementation of Neural Program Synthesis from Diverse Demonstration Videos