Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88771
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor呂育道zh_TW
dc.contributor.advisorYuh-Dauh Lyuuen
dc.contributor.author黃冠瑋zh_TW
dc.contributor.authorGuan-Wei Huangen
dc.date.accessioned2023-08-15T17:43:12Z-
dc.date.available2023-11-09-
dc.date.copyright2023-08-15-
dc.date.issued2023-
dc.date.submitted2023-08-02-
dc.identifier.citation[1] V. Animah Ofosu-Hene. Literature reviews on loan default’s impact on ecobank finances. Journal of Engineering Applied Science and Humanities, 7(1):24–36, 2022.
[2] S.-C. Carlos, G.-N. Begoña, and L.-P. Luz. Determinants of default in P2P lending. PLoS ONE, 10:1–22, 2015.
[3] N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321–357, 2002.
[4] K. Chinnapat, V. Nattakan, and S. Anantapom. A combination of decision tree learning and clustering for data classification. 2011 Eighth International Joint Conference on Computer Science and Software Engineering (JCSSE), pages 363–367, 2011.
[5] R. Emekter, Y. Tu, B. Jirasakuldech, and M. Lu. Evaluating credit risk and loan performance in online peer-to-peer (P2P) lending. Applied Economics, 47(1):54–70, 2015.
[6] T. Fawcett. An introduction to ROC analysis. Pattern Recognition Letters, 27(8):861–874, 2006.
[7] H. Haibo and G. E. A. Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9):1263–1284, 2009.
[8] D. J. Hand and W. E. Henley. Statistical classification methods in consumer credit scoring: A review. Journal of the Royal Statistical Society. Series A (Statistics in Society), 160:523–541, 1997.
[9] J. A. Hartigan and M. A. Wong. Algorithm as 136: A K-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28:100–108, 1979.
[10] D. Jesse and G. Mark. The relationship between precision-recall and ROC curves. In Proceedings of the 23rd International Conference on Machine Learning, page 233–240, 2006.
[11] X. Junhui, L. Zekai, and X. Ying. Loan default prediction of Chinese P2P market: a machine learning methodology. Scientific Reports, 11(1):18759, 2021.
[12] S. Lessmann, B. Baesens, H.-V. Seow, and L. C. Thomas. Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247:124–136, 2015.
[13] V. Moscato, A. Picariello, and G. Sperlí. A benchmark of machine learning approaches for credit score prediction. Expert Systems with Applications, 165:113986, 2021.
[14] A. Namvar, M. Siami, F. Rabhi, and M. Naderpour. Credit risk prediction in an imbalanced social lending environment. International Journal of Computational Intelligence Systems, 11:925–935, 2018.
[15] G. Nie, W. Rowe, L. Zhang, Y. Tian, and Y. Shi. Credit card churn forecasting by logistic regression and decision tree. Expert Systems with Applications, 38:15273–15285, 2011.
[16] P. J. Rousseeuw. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20:53–65, 1987.
[17] M. Roweida, R. Jumanah, and A. Malak. Machine learning with oversampling and undersampling techniques: Overview study and experimental results. In 2020 11th International Conference on Information and Communication Systems (ICICS), pages 243–248, 2020.
[18] H.-W. Teng, M.-H. Kang, and I.-H. Lee. Improving credit scoring: A rescaled cluster-then-predict approach. Available at SSRN 4355268, 2023.
[19] C.-F. Tsai. Combining cluster analysis with classifier ensembles to predict financial distress. Information Fusion, 16:46–58, 2014.
[20] J. Xiao, Y. Tian, L. Xie, X. Jiang, and J. Huang. A hybrid classification framework based on clustering. IEEE Transactions on Industrial Informatics, 16(4):2177–2188, 2020.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88771-
dc.description.abstract在金融市場與貸款活動的發展下,對金融機構和借款人來說,預測貸款違約的機率是個極為重要的研究議題。本論文使用群聚演算法將資料根據相似性分成不同的群組,再依照分群結果建立分類器。根據實驗結果顯示,基於此種方式建立的預測系統可以有效的提高預測違約風險的準確性。另外本論文會透過選擇不同的特徵來分析其對違約預測的影響,發現同時使用借款人資料和貸款相關資料可以取得較好的表現。zh_TW
dc.description.abstractWith the development of financial markets and lending activities, predicting the probability of loan default has become a crucial research topic for financial institutions and lenders. This thesis utilizes clustering techniques for pre-classification and subsequently develops a classifier. The resulting system enhances the accuracy of default prediction. Additionally, this thesis examines the impact of different features for the prediction of default and finds that simultaneously utilizing borrower information and loan-related data achieves better performance.en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2023-08-15T17:43:12Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2023-08-15T17:43:12Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員審定書 i
摘要 ii
Abstract iii
目錄 iv
圖目錄 vi
表目錄 vii
第一章 緒論 1
1.1 研究動機 1
1.2 論文架構 2
第二章 背景知識 3
2.1 文獻回顧 3
2.2 Clustering 4
2.2.1 K-平均演算法 4
2.2.2 輪廓係數 5
2.3 不平衡資料 5
2.3.1 上採樣 5
2.3.2 下採樣 6
2.3.3 SMOTE 6
2.4 混淆矩陣 7
2.4.1 精確率 8
2.4.2 召回率 8
2.4.3 F1-score 8
2.4.4 ROC, AUC 9
第三章 實驗方法 10
3.1 實驗設計 10
3.2 資料來源及處理 11
3.3 資料分群 12
3.4 模型訓練 13
第四章 實驗結果 14
4.1 實驗一 15
4.2 實驗二 17
4.3 實驗三 20
第五章 結論與建議 23
5.1 結論 23
5.2 未來展望 24
參考文獻 25
-
dc.language.isozh_TW-
dc.subject貸款違約預測zh_TW
dc.subject不平衡資料zh_TW
dc.subject集成學習zh_TW
dc.subject分群zh_TW
dc.subject分類zh_TW
dc.subjectensemble learningen
dc.subjectimbalanced dataen
dc.subjectclusteringen
dc.subjectcredit default predictionen
dc.subjectclassificationen
dc.title混合模型應用於貸款違約預測zh_TW
dc.titleHybrid model for loan default predictionen
dc.typeThesis-
dc.date.schoolyear111-2-
dc.description.degree碩士-
dc.contributor.oralexamcommittee陸裕豪;王釧茹;金國興zh_TW
dc.contributor.oralexamcommitteeU-Hou Lok;Chuan-Ju Wang;Gow-Hsing Kingen
dc.subject.keyword貸款違約預測,集成學習,不平衡資料,分群,分類,zh_TW
dc.subject.keywordcredit default prediction,ensemble learning,imbalanced data,clustering,classification,en
dc.relation.page27-
dc.identifier.doi10.6342/NTU202301992-
dc.rights.note未授權-
dc.date.accepted2023-08-07-
dc.contributor.author-college電機資訊學院-
dc.contributor.author-dept資訊工程學系-
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-111-2.pdf
  未授權公開取用
1.07 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved