I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Last update: Jan 13, 2022

Overview

Sentiment-of-movie-reviews

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive. Obstacles like sentence negation, sarcasm, terseness, language ambiguity, and many others make this task very challenging.

This project uses datasets available on kaggle for training and testing.

Transformers brings all these models together and makes it very easy to use each with only a few lines of code. In fact they even provide us with cool tools like pipelines or live demo that we can classify our text without any training or long periods of coding. But as you can geuss these simple and ready to use models have their weaknesses. For example, you can't classify the text with them with the number of labels you want because they've been pretrained on a text with specific labels. Also not all models used by them are as strong and accurate as we want them to be(for example the default model for sentiment analysis is uncased distillbert which is not the best model we can find out there). With all these in mind, we want to train .Transformers models on our own data with the models that we prefer.

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Related tags

Overview

Sentiment-of-movie-reviews

Owner

This repository collects together basic linguistic processing data for using dataset dumps from the Common Voice project

A library for finding knowledge neurons in pretrained transformer models.

Nested Named Entity Recognition

Two-stage text summarization with BERT and BART

chaii - hindi & tamil question answering

Code for the Python code smells video on the ArjanCodes channel.

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

Search Git commits in natural language

PhoNLP: A BERT-based multi-task learning toolkit for part-of-speech tagging, named entity recognition and dependency parsing

Text editor on python tkinter to convert english text to other languages with the help of ployglot.

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Stanford CoreNLP provides a set of natural language analysis tools written in Java

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

List of GSoC organisations with number of times they have been selected.

ChessCoach is a neural network-based chess engine capable of natural-language commentary.

Super easy library for BERT based NLP models

An open collection of annotated voices in Japanese language

Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences"