Please use this identifier to cite or link to this item:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74368
Title: | 以資料探勘技術偵測財務報表舞弊 Fraudulent Financial Statement Detection Using Data Mining Techniques |
Authors: | Wen-Yu Chiang 江玟諭 |
Advisor: | 林嬋娟 |
Keyword: | 資料探勘,隨機森林,文字探勘,財務報表舞弊,舞弊三角, Data Mining,Random Forest,Text Mining,Fraudulent Financial Statement,Fraud Triangle, |
Publication Year : | 2019 |
Degree: | 碩士 |
Abstract: | 本文以資料探勘技術應用於我國財務報表舞弊之偵測,並探討文字資訊對於預測財務報表舞弊是否具資訊增益量(Information Gain)。在考量舞弊之特性後,本文採隨機森林(Random Forest)此資料探勘技術,並參考舞弊三角架構以及從年報中致股東報告書及營運概況之文字資訊選取與舞弊相關之變數以建構偵測模型。研究結果顯示,隨機森林較傳統迴歸模型更能準確區分舞弊及非舞弊公司,而年報中之文字對於辨別舞弊與非舞弊公司較無資訊增益量。然而,值得注意的是,年報中之不確定性字詞對於區分舞弊及非舞弊公司之重要性於83個變數中居第13名,顯示年報中之不確定性字詞為偵測財務報表舞弊之重要指標之一。 This study attempts to apply data mining techniques on detection of fraudulent financial statements, and investigate whether textual information has information gain for fraudulent financial statements detection. Considering the characteristics of fraud, this study uses Random Forest as data mining techniques to build fraud detection model. Structured variables are selected based on fraud triangle, and textual information is extracted from letter to shareholders and operation review in annual report. The result shows that Random Forest achieved higher classification accuracy than traditional regression model, and the text in annual report has no explicit information gain for distinguishing fraudulent and non-fraudulent companies. However, it is worth noting that the importance of uncertain words in annual report ranks 13 among 83 variables. This implies that tentative words in annual report may be regarded as an important indicator to fraudulent financial statement occurrence. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/74368 |
DOI: | 10.6342/NTU201903029 |
Fulltext Rights: | 有償授權 |
Appears in Collections: | 會計學系 |
Files in This Item:
File | Size | Format | |
---|---|---|---|
ntu-108-1.pdf Restricted Access | 1.1 MB | Adobe PDF |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.