Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Last update: Jan 29, 2022

Overview

Mortgage-Application-Analysis

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables: age, income level, occupancy type, accepted, and debt-income ratio, Eliminating all the demographic bias except for age We picked 5 attributes from the Mortgage data set provided and created a separate *.csv file to avoid extra data loss from the null values of the attributes which we neglect in our model. We preprocessed the data to drop any null values of the applicants which might skew our datasets using the pandas library For the processing part, we had some classification data with controlled intervals. We used Ordinal encoding to convert those into numeric discrete data for training and testing our model. We also had one, unique string data attribute, which was encoded using One-hot encoding to extract numeric values for processing. With this clean data, we divided the data into two groups, 80% for validation and 20%, and trained our model to establish a correlation between mortgage application acceptance.

Using Matlab plot, we carried out data/representation/ visualization and found out, other than debt-to-income ratio, there isn’t any significant correlation between acceptance and other non-demographic factors After this visualization to establish our hypothesis, we trained our model using the data set we created., and evaluate the model we created we applied 4 types of algorithms to test it out: We used the Logistic Regression model to create a line the best fit for log-odds values to calculate the acceptance rate for the mortgage application. The F1 score, precision score, and recall score for this testing were very high, which suggested that the non-demographic factor which we accounted for didn’t have many roles in the application being accepted or rejected. Similarly, we carried out a random forest model, Decision Tree, and Support Vector machine algorithm and each of those evaluations had really high precision, recall, and F1 score supporting the evidence from data visualization.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Related tags

Overview

Mortgage-Application-Analysis

Owner

TTS is a library for advanced Text-to-Speech generation.

Fuzzy String Matching in Python

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Spacy-ginza-ner-webapi - Named Entity Recognition API with spaCy and GiNZA

Turkish Stop Words Türkçe Dolgu Sözcükleri

Twitter-NLP-Analysis - Twitter Natural Language Processing Analysis

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

Club chatbot

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

The tool to make NLP datasets ready to use

Fine-tuning scripts for evaluating transformer-based models on KLEJ benchmark.

Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision

The entmax mapping and its loss, a family of sparse softmax alternatives.

Code for EMNLP20 paper: "ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training"

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

DELTA is a deep learning based natural language and speech processing platform.

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Tools, wrappers, etc... for data science with a concentration on text processing