Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 公共衛生學院
  3. 流行病學與預防醫學研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/86254
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor張淑惠(Shu-Hui Chang)
dc.contributor.authorYu-Ting Chuangen
dc.contributor.author莊羽婷zh_TW
dc.date.accessioned2023-03-19T23:45:02Z-
dc.date.copyright2022-10-07
dc.date.issued2022
dc.date.submitted2022-08-29
dc.identifier.citationBreiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. Blanche, P., Dartigues, J. F., & Jacqmin‐Gadda, H. (2013). Estimating and comparing time‐dependent areas under receiver operating characteristic curves for censored event times with competing risks. Statistics in Medicine, 32(30), 5381-5397. Blanche, P., Proust‐Lima, C., Loubère, L., Berr, C., Dartigues, J. F., & Jacqmin Gadda, H. (2015). Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time‐to‐event in presence of censoring and competing risks. Biometrics, 71(1), 102-113. Blanche, P., Kattan, M. W., & Gerds, T. A. (2019). The c-index is not proper for the evaluation of-year predicted risks. Biostatistics, 20(2), 347-357. Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1-3. Capitaine, L., Genuer, R., & Thiébaut, R. (2021). Random forests for high-dimensional longitudinal data. Statistical Methods in Medical Research, 30(1), 166-184. Dickson, E. R., Fleming, T. R., Wiesner, R. H., Baldus, W. P., Fleming, C. R., Ludwig, J., & McCall, J. T. (1985). Trial of penicillamine in advanced primary biliary cirrhosis. New England Journal of Medicine, 312(16), 1011-1015. Graf, E., Schmoor, C., Sauerbrei, W., & Schumacher, M. (1999). Assessment and comparison of prognostic classification schemes for survival data. Statistics in Medicine, 18(17‐18), 2529-2545. Heinzl, H., & Kaider, A. (1997). Gaining more flexibility in Cox proportional hazards regression models with cubic spline functions. Computer Methods and Programs in Biomedicine, 54(3), 201-208. Heagerty, P. J., & Zheng, Y. (2005). Survival model predictive accuracy and ROC curves. Biometrics, 61(1), 92-105. Harrell, F. E., Califf, R. M., Pryor, D. B., Lee, K. L., & Rosati, R. A. (1982). Evaluating the yield of medical tests. JAMA, 247(18), 2543-2546. Harrell Jr, F. E. (2015). Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis (Vol. 3). New York: Springer. Ishwaran, H., Kogalur, U. B., Blackstone, E. H., & Lauer, M. S. (2008). Random survival forests. The Annals of Applied Statistics, 2(3), 841-860. Ishwaran, H., Kogalur, U. B., Gorodeski, E. Z., Minn, A. J., & Lauer, M. S. (2010). High-dimensional variable selection for survival data. Journal of the American Statistical Association, 105(489), 205-217. Ishwaran, H., Kogalur, U. B., Chen, X., & Minn, A. J. (2011). Random survival forests for high‐dimensional data. Statistical Analysis and Data Mining: The ASA Data Science Journal, 4(1), 115-132. Kaplan, M. M. (1996). Primary biliary cirrhosis. The New England Journal of Medicine, 335(21), 1570-1580. Katzman, J. L., Shaham, U., Cloninger, A., Bates, J., Jiang, T., & Kluger, Y. (2018). DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Medical Research Methodology, 18(1), 1-12. Lin, J., Li, K., & Luo, S. (2021). Functional survival forests for multivariate longitudinal outcomes: Dynamic prediction of Alzheimer’s disease progression. Statistical Methods in Medical Research, 30(1), 99-111. Moradian, H., Yao, W., Larocque, D., Simonoff, J. S., & Frydman, H. (2021). Dynamic estimation with random forests for discrete‐time survival data. Canadian Journal of Statistics. Li, K., Chan, W., Doody, R. S., Quinn, J., Luo, S., & Alzheimer’s Disease Neuroimaging Initiative. (2017). Prediction of conversion to Alzheimer’s disease with longitudinal measures and time-to-event data. Journal of Alzheimer's Disease, 58(2), 361-371. Perperoglou, A., Sauerbrei, W., Abrahamowicz, M., & Schmid, M. (2019). A review of spline function procedures in R. BMC Medical Research Methodology, 19(1), 1-16. Tanner, K. T., Sharples, L. D., Daniel, R. M., & Keogh, R. H. (2021). Dynamic survival prediction combining landmarking with a machine learning ensemble: Methodology and empirical comparison. Journal of the Royal Statistical Society: Series A (Statistics in Society), 184(1), 3-30. van Houwelingen, H. C. (2007). Dynamic prediction by landmarking in event history analysis. Scandinavian Journal of Statistics, 34(1), 70-85. van Houwelingen, H., & Putter, H. (2011). Dynamic prediction in clinical survival analysis. Boca Raton: CRC Press. 葉憲周(2019)探討半競爭風險資料下之不同特徵點比例風險模式動態預測表現(碩士論文)‧台北:國立台灣大學公共衛生學院流行病學與預防醫學研究所。 莊子瑤(2021)半競爭風險資料之不同特徵點回歸模式的受限平均餘命估計比較(碩士論文)‧台北:國立台灣大學公共衛生學院流行病學與預防醫學研究所。
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/86254-
dc.description.abstract在長期追蹤研究中,利用電子健康紀錄 (electronic health record) 準確預測慢性病患者的預後是一項重要的精準醫學議題。在慢性病的長期病程中,首次的復發事件 (recurrent event) 為影響病患存活的重要生物標誌 (biomarker)。而在存活資料中若包含生物標誌事件(如,首次復發)資訊,以及可能的預後因子,此種資料結構即為半競爭風險資料 (semi-competing risks data)。特徵點比例風險模式可在不同的特徵時間點 (landmark times) 納入隨時間變動的共變數資訊,常使用來進行動態存活預測。本研究利用所有不同特徵時間點下半競爭風險資料之子集合組成為堆疊競爭風險資料集,進行動態存活預測。在每個特徵時間點之子集合中,病患可被分為是否發生標誌事件兩組,且只有發生標誌事件的病患才會有標誌事件時間做為預測因子。隨機存活森林是一種常用於預測的機器學習方法,但卻無法直接用於堆疊半競爭風險資料集。因此,我們將結合機器學習方法與特徵點比例風險模式,發展針對堆疊半競爭風險資料集之資料結構之三階段模式建構方法,以進行動態存活預測。於第一階段,我們使用隨機存活森林篩選重要變數。第二階段則基於第一階段被篩選出的重要變數,使用單棵存活樹得到分類資訊,並且以存活樹所得連續型變數的切點作為平滑曲線 (splines) 的節點 (knots),使得第三階段建構之特徵點比例風險模式能更具彈性。因此於第三階段,我們結合在每個特徵時間點的標誌事件狀態以及第二階段得到的分群資訊建構特徵點比例風險模式,以符合資料結構之動態存活預測。在模擬部分,我們考慮由兩種不同的時間相依比例風險模式以及雙變數對數常態模式生成資料之標誌事件時間和存活時間,並且以 AUC、C-index 和 Brier分數評估本研究所提出的各個特徵點比例風險模式以及隨機存活森林之動態存活預測。有限模擬之結果顯示我們所提出之三階段動態預測方法有良好的表現,特別是在雙變數對數常態模式情境況下,整體動態預測表現尤佳。zh_TW
dc.description.abstractIt is an important issue of precision medicine to accurately predict the prognosis of patients with chronic diseases in longitudinal follow-up studies using electronic health records. In the long-term course of a chronic disease, the first recurrent event is a key biomarker affecting the patient’s survival. The survival data including the information of the first recurrent events as the biomarker events and potential prognostic factors are referred to as semi-competing risks data. For the dynamic survival prediction, landmark proportional hazards modeling approach is often used to include the information of time-varying covariates at different landmark times. In this study, we consider the stacked semi-competing risks data across different landmark times. In such data set, patients at each landmark time are categorized into two groups, with and without the biomarker event, and the time to biomarker event can be used as a predictor only for the patient with biomarker event. Random survival forest is a popular machine learning method for prediction but cannot be directly used for the stacked semi-competing risks data. We integrate the machine learning tools and landmark proportional hazards modeling approach to develop a three-stage model construction method for dynamic survival prediction based on semi-competing data. In the first stage, we use the random survival forest to select important covariates. In the second stage, a classification tree is adopted to obtain the classification information based the selected covariates in the first stage. The cut points for continuous covariates in the second stage will be used as the knots of splines in these covariates so that their flexible trends are incorporated into the landmark models in the third stage. The third stage is to construct the landmark proportional hazards models by combining the status of biomarker event at each landmark time and the grouping information in the second stage for dynamic survival prediction to be consistent with the data structure. In simulation study, we consider that the times to biomarker event and death are generated from two different time-dependent proportional hazards models and one bivariate log-normal model. The AUC, C-index and Brier score are used to evaluate the performance of dynamic survival prediction based on the landmark proportional hazards models from the proposed methods and random survival forest. The limited simulation results show that the proposed three-stage dynamic prediction method performs well, especially in the bivariate log-normal model scenario.en
dc.description.provenanceMade available in DSpace on 2023-03-19T23:45:02Z (GMT). No. of bitstreams: 1
U0001-2908202216385000.pdf: 6011619 bytes, checksum: e7af9df2c7c75fedc53e4d06be9438cd (MD5)
Previous issue date: 2022
en
dc.description.tableofcontents誌謝 i 中文摘要 ii 英文摘要 iii 目錄 v 圖目錄 vii 表目錄 xi 第一章 研究動機與目的 1 第二章 文獻回顧 4 第一節 隨機存活森林 4 第二節 動態存活預測之特徵點模式 8 第三節 平滑曲線 10 第四節 評估方法 12 第三章 研究方法 16 第一節 模型建構 17 第二節 比較模型 22 第四章 模擬與實例 26 第一節 資料生成 26 第二節 模擬結果 32 第三節 實例分析 63 第五章 結果與討論 68 參考文獻 70 附錄 73
dc.language.isozh-TW
dc.subject特徵點模式zh_TW
dc.subject動態預測zh_TW
dc.subject隨機存活森林zh_TW
dc.subject機器學習zh_TW
dc.subjectrandom survival forestsen
dc.subjectdynamic predictionen
dc.subjectmachine learningen
dc.subjectlandmark modelen
dc.title結合機器學習的特徵點比例風險模式的半競爭風險資料動態預測zh_TW
dc.titleDynamic survival prediction for semi-competing risks data by landmark proportional hazards modeling combined with machine learningen
dc.typeThesis
dc.date.schoolyear110-2
dc.description.degree碩士
dc.contributor.oralexamcommittee陳秀熙(Hsiu-Hsi Chen),蔡政安(Chen-An Tsai)
dc.subject.keyword機器學習,隨機存活森林,動態預測,特徵點模式,zh_TW
dc.subject.keywordmachine learning,random survival forests,dynamic prediction,landmark model,en
dc.relation.page94
dc.identifier.doi10.6342/NTU202202943
dc.rights.note同意授權(全球公開)
dc.date.accepted2022-08-30
dc.contributor.author-college公共衛生學院zh_TW
dc.contributor.author-dept流行病學與預防醫學研究所zh_TW
dc.date.embargo-lift2025-09-07-
顯示於系所單位:流行病學與預防醫學研究所

文件中的檔案:
檔案 大小格式 
U0001-2908202216385000.pdf5.87 MBAdobe PDF檢視/開啟
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved