請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71160
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 周呈霙(Cheng-Ying Chou) | |
dc.contributor.author | Ching-Chen Lu | en |
dc.contributor.author | 呂晉成 | zh_TW |
dc.date.accessioned | 2021-06-17T04:56:08Z | - |
dc.date.available | 2020-08-21 | |
dc.date.copyright | 2020-08-21 | |
dc.date.issued | 2020 | |
dc.date.submitted | 2020-08-20 | |
dc.identifier.citation | 1.Centers for Disease Control and Prevention. “Diabetes Tests.” (2019).Taken from https://www.cdc.gov/diabetes/basics/getting-tested.html 2.CH.Tseng. '決策樹 Decision trees.' (2017) Taken from https://chtseng.wordpress.com/2017/02/10/%E6%B1%BA%E7%AD%96%E6%A8%B9-decision-trees/ 3.Chih-Sheng Huang. '支撐向量機(support vector machine, SVM)詳細推導' (2018). Taken from https://medium.com/@chih.sheng.huang821/%E6%A9%9F%E5%99%A8%E5%AD%B8%E7%BF%92-%E6%94%AF%E6%92%90%E5%90%91%E9%87%8F%E6%A9%9F-support-vector-machine-svm-%E8%A9%B3%E7%B4%B0%E6%8E%A8%E5%B0%8E-c320098a3d2e 4.Chih-Sheng Huang. '機器學習: 分類(Classification)-線性區別分析( Linear Discriminant Analysis)' (2018). Taken from https://medium.com/@chih.sheng.huang821/%E6%A9%9F%E5%99%A8%E5%AD%B8%E7%BF%92-lda%E5%88%86%E9%A1%9E%E6%BC%94%E7%AE%97%E6%B3%95-14622f29e4dc 5.Corriere, Mark, Nira Rooparinesingh, and Rita Rastogi Kalyani. 'Epidemiology of diabetes and diabetes complications in the elderly: an emerging public health burden.' Current diabetes reports 13.6 (2013): 805-813. 6.Data science beginners in R bloggers. 'Quick Tutorial On LASSO Regression With Example.' Taken from https://www.r-bloggers.com/quick-tutorial-on-lasso-regression-with-example/ 7.Green, David Marvin, and John A. Swets. Signal detection theory and psychophysics. Vol. 1. New York: Wiley, 1966. 8.Gupta, Prashant. 'Decision trees in machine learning.' Towards Data Science (2017). 9.HealthNewsReview.org. ” Understanding medical tests: sensitivity, specificity, and positive predictive value.” (2020) Taken from https://www.healthnewsreview.org/toolkit/tips-for-understanding-studies/understanding-medical-tests-sensitivity-specificity-and-positive-predictive-value/ 10.Heho Health. “What exactly is the urine test of the health check ? ” (2018) Taken from https://heho.com.tw/archives/28399 11.Heyden, Siegfried, et al. 'The relationship of weight change to changes in blood pressure, serum uric acid, cholesterol and glucose in the treatment of hypertension.' Journal of chronic diseases 38.4 (1985): 281-288. 12.Hinds, Terry D., and David E. Stec. 'Bilirubin Safeguards Cardiorenal and Metabolic Diseases: a Protective Role in Health.' Current hypertension reports 21.11 (2019): 87. 13.ITREAD01.COM. 'KNN演算法(有監督學習演算法). (2018) ' Taken from https://www.itread01.com/content/1546211528.html 14.NTUH檢驗醫學部. “閱讀檢驗報告是一門重要的學問.” (2020). 取自https://health.ntuh.gov.tw/health/hrc_v3/DataFiles/kensa.htm 15.O'Sullivan, John B. 'Age gradient in blood glucose levels: magnitude and clinical implications.' Diabetes 23.8 (1974): 713-715. 16.Quinlan, J. Ross. 'Induction of decision trees.' Machine learning 1.1 (1986): 81-106. 17.Saxena, Richa, et al. 'Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.' Science 316.5829 (2007): 1331-1336. 18.Segre, G., G. L. Turco, and G. Vercellone. 'Modeling blood glucose and insulin kinetics in normal, diabetic and obese subjects.' Diabetes 22.2 (1973): 94-103. 19.Sun, Guang, et al. 'Altered calcium homeostasis is correlated with abnormalities of fasting serum glucose, insulin resistance, and β-cell function in the Newfoundland population.' Diabetes 54.11 (2005): 3336-3339. 20.Tibshirani, Robert. 'Regression shrinkage and selection via the lasso.' Journal of the Royal Statistical Society: Series B (Methodological) 58.1 (1996): 267-288. 21.Tommy Huang. '機器/統計學習: 羅吉斯回歸(Logistic regression).'(2018) Taken from https://medium.com/@chih.sheng.huang821/%E6%A9%9F%E5%99%A8-%E7%B5%B1%E8%A8%88%E5%AD%B8%E7%BF%92-%E7%BE%85%E5%90%89%E6%96%AF%E5%9B%9E%E6%AD%B8-logistic-regression-aff7a830fb5d 22.王斌. “膽紅素對葡萄糖氧化酶法測定血糖結果的影響.” 中國誤診學志. (2009).頁820-821 23.向泽林, et al. '体质指数, 腰围, 腰臀比与高血压, 高血糖的关系及三者对高血压, 高血糖的预测价值.' 中華疾病控制雜誌 12.3 (2008): 207-210. 24.李美月. '糖尿病人為何容易發生高血脂?.' 高醫醫訊月刊 第35卷1期.(2015) 25.洪僖黛. '典型相關(Canonical Correlation).' 洪僖黛老師教材(IEM).(2020) 26.徐嘉吟. “心血管疾病的新指標—高敏感性C-反應蛋白(hs-CRP).” (2020). 頁65-67 27.許惠恆. “高敏感度C-反應蛋白(hsCRP) 在偵測及預防心血管疾病方面的臨床應用.” (2020).頁2-3 28.黃萱, and 林毅欣. '高齡 (65 歲以上) 第 2 型糖尿病患者之處置: 文獻回顧.' 內科學誌 29.6 (2018): 374-380. 29.廖敏萍. “什麼是糖化血色素(HbA1c).”(2020). 取自http://www.yeezen.com.tw/Document/KLFile/yzkl18001.htm 30.衛生福利部統計處. “107年國民醫療保健支出統計表.” (2018) 取自https://dep.mohw.gov.tw/DOS/cp-2156-51120-113.html 31.饒孝先, 蔡昆原, 邱泓文, 徐建業. '利用類神經網路預測糖尿病發生情形. ' (2009) 32.张京, et al. '北京市居民 2 型糖尿病现况调查和危险因素研究.' 中华流行病学杂志 32.4 (2011): 357-360. 33.衛生福利部國民健康署. ’’ 糖尿病.” (2018) 取自https://www.hpa.gov.tw/Pages/List.aspx?nodeid=359 34.高雄長庚紀念醫院檢驗醫學部. “檢驗醫學部檢驗手冊_ Cl (Chloride).” (2015) 取自https://www1.cgmh.org.tw/intr/intr4/c8920/manual/N_search_Tbl.asp?no=178 | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/71160 | - |
dc.description.abstract | 本研究的目的是利用最小絕對值收斂和選擇算子(LASSO)與典型相關分析(CCA)來篩選餐後血糖(GLU_2hr_PC)、空腹血糖(GLU_AC)和糖化血紅蛋白(HbA1c)這三種血糖指數之潛在因子。同時,應用了羅吉斯回歸(LR)、K-近鄰演算法(KNN)、決策樹(DT)、線性判別分析(LDA)和支持向量機(SVM)五種不同的機器學習方法,對健檢資料集的患者是否有糖尿病進行分類。然後,將混淆矩陣建立的相關指標用於比較這五項分類器的表現。從研究結果可以發現,糖尿病的危險因素可能包括鈣離子(Ca)、氯離子(Cl)、肌酸酐 (CRE)、高敏感度C-反應蛋白(hs_CRP)、尿比重(Sp_Gr_C)、年齡、身高體重指數…等等。而在機器學習的分類結果裡,綜觀所有的評分指標,LDA在此資料集對於糖尿病病患的偵測能力最佳。這項研究將提高我們對於健康檢查的重視,並且針對降低糖尿病病例的臨床決策有所幫助。 | zh_TW |
dc.description.abstract | The objective of this study is to identify the potential factors of glycemic indices, i.e., postprandial blood glucose (GLU_2hr_PC), fasting blood glucose (GLU_AC), and glycated hemoglobin (HbA1c) by least absolute shrinkage and selection operator (LASSO) and canonical correlation analysis (CCA). At the same time, five different machine learning methods including logistic regression (LR), K-nearest neighbor classification (KNN), decision tree (DT), linear discriminant analysis (LDA), and support vector machine (SVM) were applied to build the learning system in order to classify whether the patients in the healthcare dataset have diabetes or not. Then, the relevant indicators established by the confusion matrix were used to compare the performances of these five classifiers. It can be found from the research results that the risk factors for diabetes may include calcium (Ca), chloride (Cl), creatinine(CRE), high sensitivity C-reactive protein(hs_CRP), urine Specific Gravity(Sp_Gr_C), age, Body Mass Index, etc. In the classification results of machine learning, LDA has the best detection ability for diabetic patients in this dataset among all the evaluate indicators. This work will increase people’s attention to the health checkups and aid in the clinical decision making for reducing diabetes cases. | en |
dc.description.provenance | Made available in DSpace on 2021-06-17T04:56:08Z (GMT). No. of bitstreams: 1 U0001-1908202003171300.pdf: 1276376 bytes, checksum: c7ab259a32b5c6f8b249fb1f16812baf (MD5) Previous issue date: 2020 | en |
dc.description.tableofcontents | Content 口試委員會審定書 i 摘要 ii Abstract iii List of Figures vii List of Tables ix Chapter 1 Introduction 1 Chapter 2 Literature Reviews 3 Chapter 3 Methods 5 3.1 LASSO regression 5 3.2 Canonical Correlation Analysis 8 3.3 Machine Learning Classifiers 10 3.3.1 Logistic regression 10 3.3.2 K-nearest neighbor classification 12 3.3.3 Decision tree 14 3.3.4 Linear discriminant analysis 17 3.3.5 Support vector machine 18 Chapter 4 Data Analysis 22 4.1 The dataset 22 4.2 Dimension reduction processing 26 Chapter 5 Results 34 5.1 Model of CCA 34 5.2 Machine learning classifiers 37 5.2.1 Evaluation measurement 39 5.2.2 Logistic regression 42 5.2.3 K-nearest neighbor classification 47 5.2.4 Decision tree 48 5.2.5 Linear discriminant analysis 49 5.2.6 Support vector machine 51 Chapter 6 Conclusion 54 References 59 | |
dc.language.iso | en | |
dc.title | 機器學習在血糖指數建模和預測中的應用 | zh_TW |
dc.title | Application of Machine Learning in the Modeling and the Prediction of Glycemic Indices | en |
dc.type | Thesis | |
dc.date.schoolyear | 108-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 陳定立(Ting-Li Chen),王偉仲(Wei-chung Wang) | |
dc.subject.keyword | 糖尿病,機器學習,血糖,典型相關分析,最小絕對值收斂和選擇算子, | zh_TW |
dc.subject.keyword | diabetes,machine learning,blood glucose,canonical correlation analysis,least absolute shrinkage and selection operator, | en |
dc.relation.page | 62 | |
dc.identifier.doi | 10.6342/NTU202004062 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2020-08-20 | |
dc.contributor.author-college | 共同教育中心 | zh_TW |
dc.contributor.author-dept | 統計碩士學位學程 | zh_TW |
顯示於系所單位: | 統計碩士學位學程 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
U0001-1908202003171300.pdf 目前未授權公開取用 | 1.25 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。