Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 公共衛生學院
  3. 健康數據拓析統計研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/102168
標題: 結合變異數數量性狀基因座篩選與懲罰張量迴歸於基因基因交互作用分析
Combining Variance Quantitative Trait Loci Selection and Penalized Tensor Regression in Gene-Gene Interactions Analysis
作者: 陳詠恕
Yong-Shu Chen
指導教授: 林菀俞
Wan-Yu Lin
關鍵字: 基因基因交互作用,變異數數量性狀基因座張量迴歸臺灣人體生物資料庫單核苷酸多態性
gene-gene interaction,variance quantitative trait locitensor regressionTaiwan Biobanksingle nucleotide polymorphism
出版年 : 2026
學位: 碩士
摘要: 許多基因遺傳變異已經被發現與人類表型或複雜疾病的形成有關。然而,基因與基因間(G×G)的交互作用卻難以被偵測出來,更遑論其效果量的估計。現今,檢測變異數數量性狀基因座(variance quantitative trait loci, vQTL)以及使用懲罰張量迴歸(penalized tensor regression, PTensor)有助於同時達成上述兩項目標。考量傳統估計G×G交互作用的效果量其時間成本較高,本次研究目的在於結合這兩種方法,建立一套兩階段分析策略。首先,在第一階段時,分別使用單變量尺度檢定(univariate scale test, UST)、分位數整合線性模型(quantile integral linear model, QUAIL)、離差迴歸模型(deviation regression model, DRM)、Levene’s檢定(Levene’s test, LT),以及雙重廣義線性模型(double generalized linear model, DGLM)等方法來篩選vQTL。接著,於第二階段時,將篩選出來的vQTL透過PTensor來估計G×G交互作用的效果量。
我們將此策略應用在臺灣人體生物資料庫(Taiwan Biobank, TWB),並將其區分為TWB1(作為複製性群體)與TWB2(作為探索性群體)兩個子群體進行分析,樣本數分別為25,200以及93,708位受試者。本次分析主要探討血紅素(hemoglobin, HB)、血球比容(hematocrit, HCT)、紅血球數量(red blood cells, RBC)、白血球數量(white blood cells, WBC)、血小板數量(platelets)等性狀的G×G交互作用。此外,也會根據運算時間和G×G交互作用從TWB2成功複製到TWB1的比例來評估各分析策略的表現。我們的結果顯示,兩階段分析策略可以減少PTensor需要估計的係數數量,因此能有效減少運算時間。同時,由於兩階段分析策略確保G×G交互作用皆是由兩個vQTL所組成,故在血紅素和紅血球數量等性狀中,能提高G×G交互作用在不同的群體中成功複製的比例。在這些兩階段分析中,特別是在血紅素和紅血球數量等性狀上,DRM-PTensor和DGLM-PTensor這兩個方法表現出最高的複製成功率。然而,如果同時考量到計算效率,則以DRM-PTensor的表現較佳,使其成為兼具高效能與實用性的首選方法。
Numerous genetic variants have been identified as contributing factors to human phenotypes and the development of complex diseases. However, detecting gene-gene (G×G) interactions is much more difficult, not to mention the challenge of estimating their effect size. Nowadays, identifying variance quantitative trait loci (vQTL) and using penalized tensor regression (PTensor) can simultaneously achieve the two goals mentioned above. Given the computational burden associated with traditional G×G effect size estimation, this study proposes a two-stage analytical strategy integrating these two methods. In the first stage, we screened for variance quantitative trait loci (vQTL) using the Univariate Scale Test (UST), Deviation Regression Model (DRM), Levene’s Test (LT), Double Generalized Linear Model (DGLM), and Quantile Integral Linear Model (QUAIL). Subsequently, in the second stage, PTensor was employed to estimate the G×G interaction effect sizes for the identified vQTL.
We applied our strategies to the Taiwan Biobank (TWB) data, which were divided into two cohorts, TWB1 (as a replication cohort) and TWB2 (as a discovery cohort), containing 25,200 and 93,708 individuals, respectively. The analysis aimed to explore G×G interactions for hemoglobin (HB), hematocrit (HCT), red blood cells (RBC), white blood cells (WBC), and platelets. In addition, the performance of each analysis strategy was evaluated based on the computation time and the replication rate of G×G interactions from TWB2 to TWB1. Our results indicated that the two-stage strategies reduced the number of coefficients to be estimated by PTensor, thereby effectively decreasing the computation time. Moreover, because the two-stage strategy ensured that G×G interactions were composed of two vQTL, it improved the replication rate of G×G interactions for traits such as HB and RBC across different cohorts. Among the two-stage analyses, the DRM-PTensor and DGLM-PTensor methods achieved the highest replication rate across multiple traits, specifically HB and RBC. However, when computational efficiency was also taken into consideration, the DRM-PTensor method performed more optimally, making it the preferred approach for both high efficacy and practical utility.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/102168
DOI: 10.6342/NTU202600429
全文授權: 未授權
電子全文公開日期: N/A
顯示於系所單位:健康數據拓析統計研究所

文件中的檔案:
檔案 大小格式 
ntu-114-1.pdf
  未授權公開取用
3.93 MBAdobe PDF
顯示文件完整紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved