uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Study of Single and Ensemble Machine Learning Models on Credit Data to Detect Underlying Non-performing Loans
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Statistics.
2016 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

In this paper, we try to compare the performance of two feature dimension reduction methods, the LASSO and PCA. Both simulation study and empirical study show that the LASSO is superior to PCA when selecting significant variables. We apply Logistics Regression (LR), Artificial Neural Network (ANN), Support Vector Machine (SVM), Decision Tree (DT) and their corresponding ensemble machines constructed by bagging and adaptive boosting (adaboost) in our study. Three experiments are conducted to explore the impact of class-unbalanced data set on all models. Empirical study indicates that when the percentage of performing loans exceeds 83.3%, the training models shall be carefully applied. When we have class-balanced data set, ensemble machines indeed have a better performance over single machines. The weaker the single machine, the more obvious the improvement we can observe.

Place, publisher, year, edition, pages
2016. , p. 77
Keywords [en]
Machine learning, Feature Dimension Reduction, NPL
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:uu:diva-297080OAI: oai:DiVA.org:uu-297080DiVA, id: diva2:940833
External cooperation
Emric AB
Subject / course
Statistics
Educational program
Master Programme in Statistics
Supervisors
Examiners
Available from: 2016-06-22 Created: 2016-06-21 Last updated: 2016-06-22Bibliographically approved

Open Access in DiVA

fulltext(625 kB)278 downloads
File information
File name FULLTEXT01.pdfFile size 625 kBChecksum SHA-512
90ec2dac4aa7140207b441fb2b47dc2c60807fbfb46e6943069d7b05fe7135d9592cde5c95f7b007a6be0305f49c98bfa0b6398f7c46d8855ff07a74750d96d2
Type fulltextMimetype application/pdf

By organisation
Department of Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 278 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 566 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf