Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Last update: Dec 20, 2022

Related tags

Deep Learning KEMP

Overview

Knowledge Bridging for Empathetic Dialogue Generation

This is the official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Model Architecture

Setup

Check the packages needed or simply run the command:

pip install -r requirements.txt

Download GloVe vectors from here (glove.6B.300d.txt) and put it into /data/.
Download other data sources regarding ConceptNet and NRC_VAD lexicon, please visit Google Drive and place processed dataset kemp_dataset_preproc.json into /data/.
For reproducibility purposes, we place the model checkpoints at Google Drive. You could download and move it under /result/[MODELNAME]/result/, e.g., /result/KEMP/result/KEMP_best.tar.
To skip training, please check folder /result/[MODELNAME]/predicition/.

Data preprocessing

The dataset (EmpatheticDialogue) is preprocessed and stored under data in pickle format

python preprocess.py

Training

KEMP (Our)

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model KEMP \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--attn_loss \
--pointer_gen \
--save_path result/KEMP/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o ECE

This model does not consider the emotional context graph of Emotional Context Encoder (ECE).

In ECE, we enrich the dialogue history with external knowledge into an emotional context graph. Then, the emotional signals of context are distilled based on the embeddings and emotion intensity values from the emotional context graph.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_ECE \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_ECE/ \
--emb_file data/glove.6B.300d.txt

KEMP w/o EDD

This model does not consider the emotional dependency strategies of Emotion-Dependency Decoder (EDD).

In EDD, given emotional signal and emotional context graph, we incorporate an emotional cross-attention mechanism to selectively learn the emotional dependencies.

python main.py \
--cuda \
--label_smoothing \
--noam \
--emb_dim 300 \
--hidden_dim 300 \
--hop 1 \
--heads 2 \
--pretrain_emb \
--model wo_EDD \
--device_id 0 \
--concept_num 1 \
--total_concept_num 10 \
--pointer_gen \
--save_path result/wo_EDD/ \
--emb_file data/glove.6B.300d.txt

Testing

Add --test into above commands.

You can directly run /result/cal_metrics.py script to evaluate the model predictions.

Citation

If you find our work useful, please cite our paper as follows:

@article{li-etal-2022-kemp,
  title={Knowledge Bridging for Empathetic Dialogue Generation},
  author={Qintong Li and Piji Li and Zhaochun Ren and Pengjie Ren and Zhumin Chen},
  booktitle={AAAI},
  year={2022},
}

Official implementation for paper Knowledge Bridging for Empathetic Dialogue Generation (AAAI 2021).

Related tags

Overview

Knowledge Bridging for Empathetic Dialogue Generation

Model Architecture

Setup

Data preprocessing

Training

KEMP (Our)

KEMP w/o ECE

KEMP w/o EDD

Testing

Citation

Owner

Qintong Li

A PyTorch implementation of the Relational Graph Convolutional Network (RGCN).

Diagnostic tests for linguistic capacities in language models

Evaluation framework for testing segmentation networks in PyTorch

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Boosted neural network for tabular data

Open source person re-identification library in python

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Compare outputs between layers written in Tensorflow and layers written in Pytorch

Given a 2D triangle mesh, we could randomly generate cloud points that fill in the triangle mesh

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Fuzzy Overclustering (FOC)

A multi-mode modulator for multi-domain few-shot classification (ICCV)

Educational API for 3D Vision using pose to control carton.

NVIDIA Deep Learning Examples for Tensor Cores

Download & Install mods for your favorit game with a few simple clicks

Official Implementation for HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing