Natural Language Processing Specialization

Last update: Oct 06, 2022

Overview

Natural Language Processing Specialization

In this folder, Natural Language Processing Specialization projects and notes can be found.

WHAT I LEARNED

Use logistic regression, naïve Bayes, and word vectors to implement sentiment analysis, complete analogies & translate words.
Use dynamic programming, hidden Markov models, and word embeddings to implement autocorrect, autocomplete & identify part-of-speech tags for words.
Use recurrent neural networks, LSTMs, GRUs & Siamese networks in Trax for sentiment analysis, text generation & named entity recognition.
Use encoder-decoder, causal, & self-attention to machine translate complete sentences, summarize text, build chatbots & question-answering.

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

In the first course of the Natural Language Processing Specialization
I performed sentiment analysis of tweets using logistic regression and then naïve Bayes,
I used vector space models to discover relationships between words and used PCA to reduce the dimensionality of the vector space and visualize those relationships, and
I wrote a simple English to French translation algorithm using pre-computed word embeddings and locality-sensitive hashing to relate words via approximate k-nearest neighbor search.

Projects

Course 2 - Natural Language Processing with Probabilistic Models

In the second course of the Natural Language Processing Specialization
I wrote a simple auto-correct algorithm using minimum edit distance and dynamic programming,
I applied the Viterbi Algorithm for part-of-speech (POS) tagging, which is vital for computational linguistics,
I wrote a better auto-complete algorithm using an N-gram language model, and
I wrote my own Word2Vec model that uses a neural network to compute word embeddings using a continuous bag-of-words model.

Projects

Course 3 - Natural Language Processing with Sequence Models

In the third course of the Natural Language Processing Specialization
I trained a neural network with GLoVe word embeddings to perform sentiment analysis of tweets,
I generated synthetic Shakespeare text using a Gated Recurrent Unit (GRU) language model,
I trained a recurrent neural network to perform named entity recognition (NER) using LSTMs with linear layers, and
I used so-called ‘Siamese’ LSTM models to compare questions in a corpus and identify those that are worded differently but have the same meaning.

Projects

Course 4 - Natural Language Processing with Attention Models

In the fourth course of the Natural Language Processing Specialization
I translated complete English sentences into German using an encoder-decoder attention model,
I built a Transformer model to summarize text,
I used T5 and BERT models to perform question-answering, and
I built a chatbot using a Reformer model.

Projects

Disclaimer

DeepLearning.AI makes course notes available for educational purposes.
Project solutions are just for educational purposes. I highly recommend trying and solving project/program assignments on your own.

All the best 🤘

Natural Language Processing Specialization

Related tags

Overview

Natural Language Processing Specialization

WHAT I LEARNED

There are 4 Courses in this Specialization

Course 1 - Natural Language Processing with Classification and Vector Spaces

Projects

Course 2 - Natural Language Processing with Probabilistic Models

Projects

Course 3 - Natural Language Processing with Sequence Models

Projects

Course 4 - Natural Language Processing with Attention Models

Projects

Disclaimer

Owner

Kaan BOKE

The simple project to separate mixed voice (2 clean voices) to 2 separate voices.

NLPShala , the best IDE for all Natural language processing tasks.

Utilities for preprocessing text for deep learning with Keras

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems

NL. The natural language programming language.

SGMC: Spectral Graph Matrix Completion

Dual languaged (rus+eng) tool for packing and unpacking archives of Silky Engine.

Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

AI-powered literature discovery and review engine for medical/scientific papers

An Open-Source Package for Neural Relation Extraction (NRE)

Search with BERT vectors in Solr and Elasticsearch

This project deals with a simplified version of a more general problem of Aspect Based Sentiment Analysis.

BPEmb is a collection of pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) and trained on Wikipedia.

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

Deploying a Text Summarization NLP use case on Docker Container Utilizing Nvidia GPU

Tool to add main subject to items on Wikidata using a WMFs CirrusSearch for named entity recognition or a manually supplied list of QIDs

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"