Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Last update: Dec 28, 2022

Related tags

Text Data & NLP recipes

Overview

torchrecipes

This library is currently under heavy development - if you have suggestions on the API or use-cases you'd like to be covered, please open an github issue or reach out. We'd love to hear about how you're using torchrecipes.

torchrecipes is a prototype is built on top of PyTORCH and provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

It aims to provide reproduci-able "applications" built on top of PyTorch with good performance and easy reproduciability. Because this project builds on the pytorch ecosystem and requires significant investment, we'd love to hear from and work with early adopters to shape the design. Please reach out on the issue tracker if you're interested in using this for your project.

Why `torchrecipes`?

The primary goal of the torchrecipes is to 10x ML development by providing standard blueprints to easily train production-ready ML models across environemnts (from local development to cluster deployment).

Requirements

PyTorch Recipes (torchrecipes):

python3 (3.8+)
torch

Running

The easiest way to run torchrecipes is to use torchx. You can install it directly (if not already included as part of our requirements.txt) with:

pip install torchx

Then go to torchrecipes/launcher/ and create a file torchx_app.py:

specs.AppDef: return specs.AppDef( name="run", roles=[ specs.Role( name="run", image=image, entrypoint="python", args=[*image_classification_args, *job_args], env={ "CONFIG_MODULE": "torchrecipes.vision.image_classification.conf", "MODE": "prod", "HYDRA_FULL_ERROR": "1", } ) ], ) ">

# 'torchrecipes/launcher/torchx_app.py'

import torchx.specs as specs

image_classification_args = [
    "-m", "run",
    "--config-name",
    "train_app",
    "--config-path",
    "torchrecipes/vision/image_classification/conf",
]

def torchx_app(image: str = "run.py:latest", *job_args: str) -> specs.AppDef:
    return specs.AppDef(
        name="run",
        roles=[
            specs.Role(
                name="run",
                image=image,
                entrypoint="python",
                args=[*image_classification_args, *job_args],
                env={
                    "CONFIG_MODULE": "torchrecipes.vision.image_classification.conf",
                    "MODE": "prod",
                    "HYDRA_FULL_ERROR": "1",
                }
            )
        ],
    )

This app defines the entrypoint, args and image for launching.

Now that we have created a torchx app, we are (almost) ready for launching a job!

Firstly, create a symlink for launcher/run.py at the top level of the repo:

ln -s torchrecipes/launcher/run.py ./run.py

Then we are ready-to-go! Simply launch the image_classification recipe with the following command:

torchx run --scheduler local_cwd torchrecipes/launcher/torchx_app.py:torchx_app trainer.fast_dev_run=True trainer.checkpoint_callback=False +tb_save_dir=/tmp/

Release

# install torchrecipes
pip install torchrecipes

Contributing

We welcome PRs! See the CONTRIBUTING file.

License

torchrecipes is BSD licensed, as found in the LICENSE file.

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Related tags

Overview

torchrecipes

Why `torchrecipes`?

Requirements

Running

Release

Contributing

License

Owner

Meta Research

jiant is an NLP toolkit

PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

null

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

Code for Text Prior Guided Scene Text Image Super-Resolution

ACL'2021: Learning Dense Representations of Phrases at Scale

An algorithm that can solve the word puzzle Wordle with an optimal number of guesses on HARD mode.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

This is a project of data parallel that running on NLP tasks.

A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, and deploy transformer specific models.

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

Addon for adding subtitle files to blender VSE as Text sequences. Using pysub2 python module.

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

The guide to tackle with the Text Summarization

Plugin repository for Macast

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

DeLighT: Very Deep and Light-Weight Transformers

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

Related tags

Overview

torchrecipes

Why torchrecipes?

Requirements

Running

Release

Contributing

License

Owner

Meta Research

jiant is an NLP toolkit

PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

null

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

Code for Text Prior Guided Scene Text Image Super-Resolution

ACL'2021: Learning Dense Representations of Phrases at Scale

An algorithm that can solve the word puzzle Wordle with an optimal number of guesses on HARD mode.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

A modular Karton Framework service that unpacks common packers like UPX and others using the Qiling Framework.

This is a project of data parallel that running on NLP tasks.

A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, and deploy transformer specific models.

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

Addon for adding subtitle files to blender VSE as Text sequences. Using pysub2 python module.

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata

The guide to tackle with the Text Summarization

Plugin repository for Macast

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

DeLighT: Very Deep and Light-Weight Transformers

Why `torchrecipes`?