Intent parsing and slot filling in PyTorch with seq2seq + attention

Last update: Apr 04, 2022

Overview

PyTorch Seq2Seq Intent Parsing

Reframing intent parsing as a human - machine translation task. Work in progress successor to torch-seq2seq-intent-parsing

The command language

This is a simple command language developed for the "home assistant" Maia living in my apartment. She's designed as a collection of microservices with services for lights (Hue), switches (WeMo), and info such as weather and market prices.

A command consists of a "service", a "method", and some number of arguments.

lights setState office_light on
switches getState teapot
weather getWeather "San Francisco"
price getPrice TSLA

These can be represented with variable placeholders:

lights setState $device $state
switches getState $device
weather getWeather $location
price getPrice $symbol

We can imagine a bunch of human sentences that would map to a single command:

"Turn the office light on."
"Please turn on the light in the office."
"Maia could you set the office light on, thank you."

Which could similarly be represented with placeholders.

TODO: Specific vs. freeform variables

A shortcoming of the approach so far is that the model has to learn translations of specific values, for example mapping all of the device names to their equivalent device_name. If we added a "basement light" the model would have no basement_light in the output vocabulary unless it was re-trained.

The bigger the potential input space, the more obvious the problem - consider the getWeather command, where the model would need to be trained with every possible location we might ask about. Worse yet, consider a playMusic command that could take any song or artist name...

This can be solved with a technique which I have implemented in Torch here. The training pairs have "variable placeholders" in the output translation, which the model generates during an intial pass. Then the network fills in the values of these placeholders with an additional pass over the input.

Intent parsing and slot filling in PyTorch with seq2seq + attention

Related tags

Overview

PyTorch Seq2Seq Intent Parsing

The command language

TODO: Specific vs. freeform variables

Owner

Sean Robertson

p-tuning for few-shot NLU task

Shirt Bot is a discord bot which uses GPT-3 to generate text

👄 The most accurate natural language detection library for Python, suitable for long and short text alike

Paradigm Shift in NLP - "Paradigm Shift in Natural Language Processing".

Uncomplete archive of files from the European Nopsled Team

Almost State-of-the-art Text Generation library

UniSpeech - Large Scale Self-Supervised Learning for Speech

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Text classification on IMDB dataset using Keras and Bi-LSTM network

NLP-SentimentAnalysis - Coursera Course ( Duration : 5 weeks ) offered by DeepLearning.AI

تولید اسم های رندوم فینگیلیش

Built for cleaning purposes in military institutions

CMeEE 数据集医学实体抽取

Anuvada: Interpretable Models for NLP using PyTorch

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"

xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building blocks.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Code for the paper "Flexible Generation of Natural Language Deductions"

Intent parsing and slot filling in PyTorch with seq2seq + attention

Related tags

Overview

PyTorch Seq2Seq Intent Parsing

The command language

TODO: Specific vs. freeform variables

Owner

Sean Robertson

p-tuning for few-shot NLU task

Shirt Bot is a discord bot which uses GPT-3 to generate text

👄 The most accurate natural language detection library for Python, suitable for long and short text alike

Paradigm Shift in NLP - "Paradigm Shift in Natural Language Processing".

Uncomplete archive of files from the European Nopsled Team

Almost State-of-the-art Text Generation library

UniSpeech - Large Scale Self-Supervised Learning for Speech

Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages

Text classification on IMDB dataset using Keras and Bi-LSTM network

NLP-SentimentAnalysis - Coursera Course ( Duration : 5 weeks ) offered by DeepLearning.AI

تولید اسم های رندوم فینگیلیش

Built for cleaning purposes in military institutions

CMeEE 数据集医学实体抽取

Anuvada: Interpretable Models for NLP using PyTorch

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"

xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building blocks.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

PyTorch Implementation of the paper Single Image Texture Translation for Data Augmentation

Code for the paper "Flexible Generation of Natural Language Deductions"

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。