Similarity checking of sign languages

Overview

Similarity checking of sign languages

This repository checks for similarity between

  1. British sign language and Peruvian sign language
  2. British sign language and American sign language

with the "Temporal segmentation of sign language videos" model that is pre-trained in British sign language (BSL).

Contents

picture

Setups

All set up instructions for "Temporal segmentation of sign language videos" can be found here. The other pre-processing models are python files.

Data & Models/algorithms

The main datasets, processed datasets and all their pre-processing models are in the folder named dataset. Inside the dataset folder, for Peruvian sign language there are two datasets: datasets/ira_alegira & datasets/proteinas_porcentajes and for American sign language, there is one dataset: datasets/how2sign.

Data

  • ira_alegria:

    1. processed ira_alegria: It has srt, videos, vtt folders holding the input segmented signs, input videos, output segmented signs accordingly.
    2. raw ira_alegria: It has srt, videos folders holding the signs with temporal boundaries, input videos accordingly.
    3. process_ira.py: This is the model we implemented to process the raw dataset.
  • proteinas_porcentajes:

    1. processed proteinas_porcentajes: It has srt, videos, vtt folders holding the input segmented signs, input videos, output segmented signs accordingly.
    2. raw proteinas_porcentajes: It has srt, videos folders holding the signs with temporal boundaries, input videos accordingly.
    3. process_proteinas.py: This is the model we implemented to process the raw dataset.
  • how2sign:

    1. processed how2sign: It has srt, videos, vtt folders holding the input segmented signs, input videos, output segmented signs accordingly.
    2. raw how2sign: It has srt, videos folders holding the signs with temporal boundaries, input videos accordingly.
    3. process_how2sign.py: This is the model we implemented to process the raw dataset.

Models

The models for pre-processing the datasets are attached in the datasets folder.

Results

  • British sign language and Peruvian sign language: We processed two datasets namely ira_alegria and proteinas_porcentajes. We identified in how many files/lines, there is at-least one match. The input files are the srt files we processed from ira_alegria and proteinas_porcentajes datasets and the output files are the vtt files that we collected by running corresponding video files of these srt files in the "Temporal segmentation of sign language videos" model. These input srt files and output vtt files has a number of words for signs in that line/sentence/video file. Than we identified for how many input files, there is at-least one word match in the output file. For ira_alegria and proteinas_porcentajes, we found approximately 30% and 16%.

ira_alegria result proteinas result

  • British sign language and American sign language: We processed one dataset namely how2sign. We identified in how many files/lines, there is at-least one matched word. The input files are the srt files we processed from the csv translation file in the dataset and the output files are the vtt files that we collected by running corresponding video files of these srt files in the "Temporal segmentation of sign language videos" model. These input srt files and output vtt files has a number of words for signs in that line/sentence/video file. Than we identified for how many input files, there is at-least one word match in the output file. For how2sign, we found approximately 46%.

how2sign result

Owner
Tonni Das Jui
Tonni Das Jui
Batch Python Program Verify

Batch Python Program Verify About As a TA(teaching assistant) of Programming Class, it is very annoying to test students' homework assignments one by

Han-Wei Li 7 Dec 20, 2022
一个IDA脚本,可以检测出哈希算法(无论是否魔改常数)并生成frida hook 代码。

findhash 在哈希算法上,比Findcrypt更好的检测工具,同时生成Frida hook代码。 使用方法 把findhash.xml和findhash.py扔到ida plugins目录下 ida -edit-plugin-findhash 试图解决的问题 哈希函数的初始化魔数被修改 想快速

266 Dec 29, 2022
berisi kodingan kodingan python umum yang kubuat.

python-codevault berisi kodingan kodingan python umum yang kubuat. untuk memudahkan transisi dan menjadi refrensi tutorial. daily challange for myself

Agung Zon Blade 1 Dec 19, 2021
Runs macOS on linux with qemu.

mac-on-linux-with-qemu Runs macOS on linux with qemu. Pre-requisites qemu-system-x86_64 dmg2img pulseaudio python[click] Usage After cloning the repos

Arindam Das 177 Dec 26, 2022
Um Script De Mensagem anonimas Para linux e Termux Feito em python

Um Script De Mensagem anonimas Para linux e Termux Feito em python feito em um celular

6 Sep 09, 2021
Tracking stock volatility.

SP500-highlow-tracking Track stock volatility. Being a useful indicator of the stock price volatility, High-Low gap represents the price range of the

Thong Huynh 13 Sep 07, 2022
Headless - Wrapper around Ghidra's analyzeHeadless script

Wrapper around Ghidra's analyzeHeadless script, could be helpful to some? Don't tell me anything is wrong with it, it works on my machine.

8 Oct 29, 2022
A tool to assist in code raiding in rust

Kodelock a tool to assist in code raiding in rust This tool is designed to be used on a second monitor. This tools will allow you to see a easily read

3 Oct 27, 2022
Practice in Oxford_AI&ML class

Practice in Oxford_AI&ML class

St3ve Lee 2 Feb 04, 2022
YBlade - Import QBlade blades into Fusion 360

YBlade - Import QBlade blades into Fusion 360 Simple script for Fusion 360 that takes QBlade blade description and constructs the blade: Usage First,

Jan Mrázek 37 Sep 25, 2022
Hasklig - a code font with monospaced ligatures

Hasklig – Ligatures for code Programming languages are limited to relatively few characters. As a result, combined character operators surfaced quite

Ian Tuomi 5.3k Jan 03, 2023
DD监控室第一版

DD监控室 运行指南

执明神君 1.2k Dec 31, 2022
A python tool used for hacking WhatsApp by diverting otp

W-HACK A python tool used for hacking WhatsApp by diverting otp You can hack WhatsApp easily with this tool Note:OTP expires after 5 seconds HOW TO IN

Spider Anongreyhat 3 Oct 17, 2021
A performant state estimator for power system

A state estimator for power system. Turbocharged with sparse matrix support, JIT, SIMD and improved ordering.

9 Dec 12, 2022
A country information finder module

A country information finder module

Fayas Noushad 3 Nov 28, 2021
A python tool that creates issues in your repos based on TODO comments in your code

Krypto A neat little sidekick python script to create issues on your repo based on comments left in the code on your behalf Convert todo comments in y

Alex Antoniou 4 Oct 26, 2021
Create VSCode Extensions with python

About Create vscode extensions with python. Installation Stable version: pip install vscode-ext Why use this? Why should you use this for building VSc

Swas.py 134 Jan 07, 2023
A price calculator for multiple things

Price Calculator A price calculator for multiple things Example I have 0.0567kg diamond. The price of diamond in kg is: $4500. Then it says: The price

Abel 1 Nov 26, 2021
This is a simple leaderboard for 30 days of Google Cloud program for students of ASIET

30daysleaderboard #Hacktoberfest - Please don't make changes in readme file. Only improvement in the project will be accepted. Update - Now if you run

5 Oct 29, 2021
Why write code when you can import it directly from GitHub Copilot?

Copilot Importer Why write code when you can import it directly from GitHub Copilot? What is Copilot Importer? The copilot python module will dynamica

Mythic 41 Jan 04, 2023