Text editor on python to convert english text to malayalam(Romanization/Transiteration).

Last update: May 11, 2022

Related tags

Overview

Manglish Text Editor

This is a simple transiteration (romanization ) program which is used to convert manglish to malayalam (converts njaan to ഞാൻ ). It is aimed to help people who have difficulty in typing malayalam and who is good in typing English.

Tkinter is used for text editor creation and simple database lookup along with frequency data is used for the transiteration program.

Requirements

The system should have python3 installed. Ths system i tested on python 3.8.
The system works on linux and Mac. Minor changes may be required to run this on windows.
The code requires tkinter to be installed. pip install tk command can be used for this.

How to use

download or clone the repository using command git clone
It is recommended to run the code in a separate virtual environment.
Get into the main folder manglish_text_editor by cd manglish_text_editor in terminal.
When you run the program for the first time the frequency table needs to get created. For that run python3 transiterator.py. Note that it is a time consuming operation.
run python3 main.py. This will open the text editor in another window.
The text editor is self explanatory.

How it works

The program makes up a database of possible english typings of a malayalam word and then for each user input it tries to find a near match in the database and along with that tries to create the original word.

The text editor is created using python package named tkinter.

Features

Text editor in which the typed english(manglish) word will be converted to malayalam on pressing space or enter key.
The text editor has options file save, open, save as, new etc.

Future Scope

Improve tokenizing
use a better method to remove noise
Improve learning algorithm
In text editor add malayalam key board, conversion of an entire file at once, Delete file
Give option to the user to select from the possible list of words on backspace press.
Add bold, text space, font, points to the text editor.
Add feature to convert malayalam to manglish.
Add option select all, search, replace etc.

Contributions

Pull requests are welcome. If someone wants to contribute to this project can fork and add the Functionalities.

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

Related tags

Overview

Manglish Text Editor

Requirements

How to use

How it works

Features

Future Scope

Contributions

Owner

Merin Rose Tom

Fast topic modeling platform

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Unlimited Call - Text Bombing Tool

This repository implements a brute-force spellchecker utilizing the Damerau-Levenshtein edit distance.

Open source annotation tool for machine learning practitioners.

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Opal-lang - A WIP programming language based on Python

Code for Findings at EMNLP 2021 paper: "Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning"

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Convolutional 2D Knowledge Graph Embeddings resources

Machine Learning Course Project, IMDB movie review sentiment analysis by lstm, cnn, and transformer

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Code for paper: An Effective, Robust and Fairness-awareHate Speech Detection Framework

Pytorch implementation of Tacotron

Precision Medicine Knowledge Graph (PrimeKG)

The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)

Code for "Generative adversarial networks for reconstructing natural images from brain activity".

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.