Login | Register

Bankruptcy prediction: A comparison of data mining models on unbalanced data and effects of sampling

Title:

Bankruptcy prediction: A comparison of data mining models on unbalanced data and effects of sampling

Javvadi, Gunin Ruthwik (2023) Bankruptcy prediction: A comparison of data mining models on unbalanced data and effects of sampling. Masters thesis, Concordia University.

[thumbnail of Javvadi_MSc_S2024.pdf]
Preview
Text (application/pdf)
Javvadi_MSc_S2024.pdf - Accepted Version
Available under License Spectrum Terms of Access.
788kB

Abstract

With the very unbalanced data found in financial risk prediction, this study hopes to aid in anticipating the financial risk that corporations may encounter. We can improve performance by employing oversampling and under-sampling algorithms. We were able to better understand how the performance of each classifier changed in each dataset by using a variety of classifiers across three distinctively sampled datasets. In addition, we analyzed our dataset using three different evaluation metrics: accuracy, sensitivity, and specificity, rather than being limited to just one. The results indicate that the accuracy on three separate datasets with various sampling methods differs greatly. The sensitivity and specificity of the under-sampled dataset differ from those of the original dataset and the oversampled dataset, which are fairly comparable to one another. It was discovered that gradient boosting trees produce better outcomes than other algorithms. When using oversampled data and measuring accuracy, logistic regression was found to be the most effective. However, when using under-sampled data, LightGBM Classifier had the best performance. Additionally, when considering sensitivity and specificity, CatBoost Classifier was the best choice.

Keywords: Bankruptcy, unbalanced data, classification

Divisions:Concordia University > John Molson School of Business > Supply Chain and Business Technology Management
Item Type:Thesis (Masters)
Authors:Javvadi, Gunin Ruthwik
Institution:Concordia University
Degree Name:M. Sc.
Program:Business Administration (Supply Chain and Business Technology Management specialization)
Date:27 November 2023
Thesis Supervisor(s):lahmiri, Salim
ID Code:993540
Deposited By: Gunin Ruthwik Javvadi
Deposited On:04 Jun 2024 14:24
Last Modified:04 Jun 2024 14:24
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top