Generating Korean Slogans with phonetic and structural repetition

Last update: May 23, 2022

Related tags

Overview

LexPOS_ko

Generating Korean Slogans with phonetic and structural repetition

Generating Slogans with Linguistic Features

LexPOS is a sequence-to-sequence transformer model that generates slogans with phonetic and structural repetition. For phonetic repetition, it searches for phonetically similar words with user keywords. Both the sound-alike words and user keywords become the lexical constraints while generating slogans. It also adjusts the logits distribution to implement further phonetic constraints. For structural repetition, LexPOS uses POS constraints. Users can specify any repeated phrase structure by POS tags.

Generating slogans with lexical, POS constraints

1. Code

Need to download pretrained Korean word2vec model from here and put it below phonetic_similarity/KoG2P

# clone this repo
git clone https://github.com/yeounyi/LexPOS_ko
cd LexPOS
# generate slogans 
python3 generate_slogans.py --keywords 카드,혜택 --num_beams 3 --temperature 1.2

-keywords: Keywords that you want to be included in slogans. You can enter multiple keywords, delimited by comma
-pos_inputs: You can either specify the particular list of POS tags delimited by comma, or the model will generate slogans with the most frequent syntax used in corpus. POS tags generally follow the format of Konlpy Mecab POS tags.
-num_beams: Number of beams for beam search. Default to 1, meaning no beam search.
-temperature: The value used to module the next token probabilities. Default to 1.0.
-model_path: Path to the pretrained model

2. Examples

Keyword: 카드, 혜택
POS: [NNG, JK, VV, EC, SF, NNG, JK, VV, EF]
Output: 카드를 택하면, 혜택이 바뀐다

Keyword: 안전, 항공
POS: [MM, NNG, SF, MM, NNG, SF]
Output: 새로운 공항, 안전한 항공

Keywords: 추석, 선물
POS: [NNG, JK, MM, NNG, SF, NNG, JK, MM, NNG]
Output: 추석을 앞둔 추억, 당신을 위한 선물

Model Architecture

Pretrained Model

https://drive.google.com/drive/folders/1opkhDApURnjibVYmmhj5bqLTWy4miNe4?usp=sharing

References

https://github.com/scarletcho/KoG2P

Citation

@misc{yi2021lexpos,
  author = {Yi, Yeoun},
  title = {Generating Korean Slogans with Linguistic Constraints using Sequence-to-Sequence Transformer},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/yeounyi/LexPOS_ko}}
}

Generating Korean Slogans with phonetic and structural repetition

Related tags

Overview

LexPOS_ko

Generating Slogans with Linguistic Features

Generating slogans with lexical, POS constraints

1. Code

2. Examples

Model Architecture

Pretrained Model

References

Citation

Owner

Yeoun Yi

File-based TF-IDF: Calculates keywords in a document, using a word corpus.

Lumped-element impedance calculator and frequency-domain plotter.

中文問句產生器；使用台達電閱讀理解資料集(DRCD)

A curated list of FOSS tools to improve the Hacker News experience

Crie tokens de autenticação íntegros e seguros com UToken.

Spacy-ginza-ner-webapi - Named Entity Recognition API with spaCy and GiNZA

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks

SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark.

Rootski - Full codebase for rootski.io (without the data)

Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization

A paper list of pre-trained language models (PLMs).

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

TalkNet: Audio-visual active speaker detection Model

Takes a string and puts it through different languages in Google Translate a requested amount of times, returning nonsense.

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

New Modeling The Background CodeBase