Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

Last update: Dec 13, 2022

Related tags

Text Data & NLP Diaformer

Overview

Diaformer

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation (AAAI 2022)

Diaformer is an efficient model for automatic diagnosis via symptoms sequence generation. It takes the sequence of symptoms as input, and predicts the inquiry symptoms in the way of sequence generation.

Figure 1: Illustration of symptom attention framework.

Requirements

Our experiments are conducted on Python 3.8 and Pytorch == 1.8.0. The main requirements are:

transformers==2.1.1
torch
numpy
tqdm
sklearn
keras
boto3

In the root directory, run following command to install the required libraries.

pip install -r requirement.txt

Usage

Download data

Download the datasets, then decompress them and put them in the corrsponding documents in \data. For example, put the data of Synthetic Dataset under data/synthetic_dataset.

The dataset can be downloaded as following links:
Build data

Switch to the corresponding directory of the dataset and just run preprocess.py to preprocess data and generate a vocabulary of symptoms.

Train and test

Train and test models by the follow commands.

Diaformer

# Train and test on Diaformer
# Run on MuZhi dataset
python Diaformer.py --dataset_path data/muzhi_dataset --batch_size 16 --lr 5e-5 --min_probability 0.009 --max_turn 20 --start_test 10 

# Run on Dxy dataset
python Diaformer.py --dataset_path data/dxy_dataset --batch_size 16 --lr 5e-5 --min_probability 0.012 --max_turn 20 --start_test 10 

# Run on Synthetic dataset
python Diaformer.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10

Diaformer_GPT2

# Train and test on GPT2 variant of Diaformer
python GPT2_variant.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10

Diaformer_UniLM

# Train and test on UniLM variant of Diaformer
python UniLM_variant.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10

Ablation study

# run ablation study
# w/o Sequence Shuffle
python Diaformer.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10 --no_sequence_shuffle

# w/o Synchronous Learning
python Diaformer.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10 --no_synchronous_learning

# w/o Repeated Sequence
python Diaformer.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10 --no_repeated_sequence

Generative inference

# save the model
python Diaformer.py --dataset_path data/synthetic_dataset --batch_size 16 --lr 5e-5 --min_probability 0.01 --max_turn 20 --start_test 10 --model_output_path models
# use the trained model to output the results
python predict.py --dataset_path data/synthetic_dataset --min_probability 0.01 --max_turn 20 --pretrained_model models/ --result_output_path results.json

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

Related tags

Overview

Diaformer

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation (AAAI 2022)

Diaformer is an efficient model for automatic diagnosis via symptoms sequence generation. It takes the sequence of symptoms as input, and predicts the inquiry symptoms in the way of sequence generation.

Requirements

Usage

Owner

Junying Chen

Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Code for Editing Factual Knowledge in Language Models

Azure Text-to-speech service for Home Assistant

NSFW A chatbot based on GPT2-chitchat

Toy example of an applied ML pipeline for me to experiment with MLOps tools.

VMD Audio/Text control with natural language

Turkish Stop Words Türkçe Dolgu Sözcükleri

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

NLP topic mdel LDA - Gathered from New York Times website

CDLA: A Chinese document layout analysis (CDLA) dataset

내부 작업용 django + vue(vuetify) boilerplate. 짠 하면 돌아감.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

This repo contains simple to use, pretrained/training-less models for speaker diarization.

This is a project of data parallel that running on NLP tasks.

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Big Bird: Transformers for Longer Sequences

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

Related tags

Overview

Diaformer

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation (AAAI 2022)

Diaformer is an efficient model for automatic diagnosis via symptoms sequence generation. It takes the sequence of symptoms as input, and predicts the inquiry symptoms in the way of sequence generation.

Requirements

Usage

Owner

Junying Chen

Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Code for Editing Factual Knowledge in Language Models

Azure Text-to-speech service for Home Assistant

**NSFW** A chatbot based on GPT2-chitchat

Toy example of an applied ML pipeline for me to experiment with MLOps tools.

VMD Audio/Text control with natural language

Turkish Stop Words Türkçe Dolgu Sözcükleri

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

NLP topic mdel LDA - Gathered from New York Times website

CDLA: A Chinese document layout analysis (CDLA) dataset

내부 작업용 django + vue(vuetify) boilerplate. 짠 하면 돌아감.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

This repo contains simple to use, pretrained/training-less models for speaker diarization.

This is a project of data parallel that running on NLP tasks.

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy

Big Bird: Transformers for Longer Sequences

NSFW A chatbot based on GPT2-chitchat