Predicting Home Credit Default

Author - Alice Agrawal

Overview

The model built in this project is used to predict the default of house credit.

Problem

I am hired to create a model to improve Wells Fargo’s Home Credit portfolio performance going forward. Wells Fargo wants to predicts whether the client will default using the data provided by the client in its loan application.

Data

I used a Kaggel competition data which has about 48 thousand entries and 120 features. The dataset was sourced from: https://www.kaggle.com/datasets/julianocosta/home-credit

Methods

A variety of different data science techniques were used to improve estimation.

I started off with a Dummy Classifier to help establish a baseline to compare all future models against. After this I tried a variety of different algorithms hypertuning them where I felt necessary. The specific methods used are:

Logistic Regression
Random Forest
ADA boosting
Gradient Boosting

I also employed SMOTE on a few hypertuned parameters to help with the class imbalance.

Results

After the iterative process, our final model is Random Forest with parameter tuning and the result is:

Conclusion

Three recommendations:

Wells Fargo should aim to increase the number of revolving loans as the default on these are much lower.
Wells Fargo should target clients older than the age of 40 years.
Clients with employment type of working, businessmen and students should be focused on to improve their default rates.

Next Steps

To further improve the model, in the future I could look into:

Stacking the the different models to improve the model further.
Collecting more recent data
Including sentiment analysis using the worded answers in the application.

Repository Structure

├── Data/home-credit-default-risk
│     ├── HomeCredit_columns_description.csv
│     └── application_train.csv.zip
│ 
├── Images
│     ├── Age.png
│     ├── Employment.png
│     ├── EmploymentYrs.png
│     ├── Gender.png
│     ├── Income.png
│     ├── LoanType.png
│     ├── home_credit_data.png
│     ├── 
│     ├── 
│     └── 
│    
├── README.md
│ 
├── Final_notebook.ipynb
│
├── Slides
│ 
└── .gitignore

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Home Credit Default

Author - Alice Agrawal

Overview

Problem

Data

Methods

Results

Conclusion

Next Steps

Repository Structure

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Data/home-credit-default-risk		Data/home-credit-default-risk
Images		Images
.gitignore		.gitignore
Final_notebook.ipynb		Final_notebook.ipynb
Presentation.pdf		Presentation.pdf
README.md		README.md

aliceagrawal/Home-Credit-Default-Prediction

Folders and files

Latest commit

History

Repository files navigation

Predicting Home Credit Default

Author - Alice Agrawal

Overview

Problem

Data

Methods

Results

Conclusion

Next Steps

Repository Structure

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages