This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Last update: Jul 14, 2022

Related tags

Deep Learning L2ight

Overview

L2ight

By Jiaqi Gu, Hanqing Zhu, Chenghao Feng, Zixuan Jiang, Ray T. Chen and David Z. Pan.

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Introduction

L2ight is a closed-loop ONN on-chip learning framework to enable scalable ONN mapping and efficient in-situ learning. L2ight adopts a three-stage learning flow that first calibrates the complicated photonic circuit states under challenging physical constraints, then performs photonic core mapping via combined analytical solving and zeroth-order optimization. A subspace learning procedure with multi-level sparsity is integrated into L2ight to enable in-situ gradient evaluation and fast adaptation, unleashing the power of optics for real on-chip intelligence. L2ight outperforms prior ONN training protocols with 3-order-of-magnitude higher scalability and over 30X better efficiency, when benchmarked on various models and learning tasks. This synergistic framework is the first scalable on-chip learning solution that pushes this emerging field from intractable to scalable and further to efficient for next-generation self-learnable photonic neural chips.

Dependencies

Python >= 3.6
pyutils >= 0.0.1. See pyutils for installation.
pytorch-onn >= 0.0.1. See pytorch-onn for installation.
Python libraries listed in requirements.txt
NVIDIA GPUs and CUDA >= 10.2

Structures

core/
- models/
  - layers/
    - custom_conv2d and custom_linear layers
    - utils.py: sampler and profiler
  - sparse_bp_*.py: model definition
  - sparse_bp_base.py: base model definition; identity calibration and mapping codes.
- optimizer/: mixedtrain and flops optimizers
- builder.py: build training utilities
script/: contains experiment scripts
train_pretrain.py, train_map.py, train_learn.py, train_zo_learn.py: training logic
compare_gradient.py: compare approximated gradients with true gradients for ablation

Usage

Pretrain model.
> python3 train_pretrain.py config/cifar10/vgg8/pretrain.yml
Identity calibration and parallel mapping. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/pm/pm.yml and run
> python3 train_map.py CONFIG --checkpoint.restore_checkpoint=path/to/your/pretrained/checkpoint
Subspace learning with multi-level sampling. Please set your hyperparameters in CONFIG=config/cifar10/vgg8/ds/learn.yml and run
> python3 train_learn.py CONFIG --checkpoint.restore_chekcpoint=path/to/your/mapped/checkpoint --checkpoint.resume=1
All scripts for experiments are in ./script. For example, to run subspace learning with feedback sampling, column sampling, and data sampling, you can write proper task setting in SCRIPT=script/vgg8/train_ds_script.py and run
> python3 SCRIPT
Comparison experiments with RAD [ICLR 2021] and SWAT-U [NeurIPS 2020]. Run with the SCRIPT=script/vgg8/train_rad_script.py and script/vgg8/train_swat_script.py,
> python3 SCRIPT
Comparison with FLOPS [DAC 2020] and MixedTrn [AAAI 2021]. Run with the METHOD=mixedtrain or flops,
> python3 train_zo_learn.py config/mnist/cnn3/METHOD/learn.yml

Citing L2ight

@inproceedings{gu2021L2ight,
  title={L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization},
  author={Jiaqi Gu and Hanqing Zhu and Chenghao Feng and Zixuan Jiang and Ray T. Chen and David Z. Pan},
  journal={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2021}
}

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Related tags

Overview

L2ight

Introduction

Dependencies

Structures

Usage

Citing L2ight

Owner

Jiaqi Gu

Official implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Music Generation using Neural Networks Streamlit App

Retrieval.pytorch - The code we used in [2020 DIGIX]

[CVPR 2022 Oral] MixFormer: End-to-End Tracking with Iterative Mixed Attention

This repository contains the code used to quantitatively evaluate counterfactual examples in the associated paper.

An Open Source Machine Learning Framework for Everyone

GluonMM is a library of transformer models for computer vision and multi-modality research

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

FEMDA: Robust classification with Flexible Discriminant Analysis in heterogeneous data

Universal Probability Distributions with Optimal Transport and Convex Optimization

A Quick and Dirty Progressive Neural Network written in TensorFlow.

Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

Dieser Scanner findet Websites, die nicht direkt in Suchmaschinen auftauchen, aber trotzdem erreichbar sind.

Sleep staging from ECG, assisted with EEG

Code for "Learning Graph Cellular Automata"

This is the official Pytorch implementation of the paper "Diverse Motion Stylization for Multiple Style Domains via Spatial-Temporal Graph-Based Generative Model"

3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A - Continual Learning Classification Challenge

A Home Assistant custom component for Lobe. Lobe is an AI tool that can classify images.

Detectorch - detectron for PyTorch

A vision library for performing sliced inference on large images/small objects