請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67743
標題: | 本質相關係數套件cidr 及其應用 - 以找尋阿拉伯芥非生物逆境專一之基因群為例 cidr: A Package of Coefficient of Intrinsic Dependence (CID) and its Application of Finding the Abiotic Stress-specific Gene Modules in Arabidopsis |
作者: | Po-Chih Shen 沈柏志 |
指導教授: | 劉力瑜(Li-Yu Liu) |
關鍵字: | 本質相關係數,非生物逆境,加權基因共表現網絡分析,雙軸圖,非生物逆境專一基因群, the coefficient of intrinsic dependence,abiotic stress,the weighted gene co-expression network analysis,biplot,abiotic stress-specific gene module, |
出版年 : | 2017 |
學位: | 博士 |
摘要: | 在建立模型或決策時,找出關鍵特徵變數是很重要的一環。本質相關係數 (coefficient of intrinsic dependence, CID) 是一個關聯統計量,可以用來度量變數間的關聯性。它在一些找尋相關性的應用上有很好的表現,例如用來建立基因調控網路或是測量兩組變數的關聯性。為了更方便給其他人使用,例如生物學家,在本研究中,開發一個R套件(cidr),讓大家可以更容易且方便的進行本質相關係數計算。本研究也整合加權基因共表現網絡分析 (weighted gene co-expression network analysis, WGCNA) 與本質相關係數 (CID),應用在找尋阿拉伯芥非生物逆境專一 (abiotic stress-specific) 之基因群,並利用熱圖 (heatmap) 與雙軸圖 (biplot) 來進行視覺化呈現。在低溫、高溫、鹽害逆境下,分別找到2個低溫、3個高溫、5個鹽害逆境專一基因群,提供了解生物交互影響過程的初步參考。此外,我們應用了子本質相關係數 (subCID) 更詳細的在基因群中找尋逆境專一基因,透過各基因 subCID 數值矩陣製作雙軸圖,有助於區分出各個逆境專一的基因。希望本論文開發之 cidr套件以及論文中描述的方法,有助於揭示隱藏在大規模基因體數據中的相關生物機制。 Feature selection plays an important rule for modeling or decision making. The coefficient of intrinsic dependence (CID) is an association measure which can be used to measure the relationship among the variables. It had been applied to construct gene regulatory networks and to measure the relationships between two groups of one- or multiple-dimensional variables. For the convenience of potential users to obtain the CID values, we had developed an R package, cidr, for the computation and the visualization of CID. In Chapter 3, we also incorporated the weighted gene co-expression network analysis (WGCNA) and CID to find the abiotic stress-specific gene module in Arabidopsis and the results had be summarized using the heatmaps and biplots. Two cold stress-specific, three heat stress-specific, and five salt stress-specific gene modules were identified, respectively. The results may provide hints about the underlying biological processes. In Chapter 4, we further adopted the subCID values to identify the stress-specific genes in a gene module. The biplot derived from the subCID matrix assisted to visualize the stress-specific genes. In conclusion, we hope the cidr package as well as the methodologies described in the dissertation can assist to reveal the biological insights hidden in massive genomic-level datasets. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67743 |
DOI: | 10.6342/NTU201701822 |
全文授權: | 有償授權 |
顯示於系所單位: | 農藝學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-106-1.pdf 目前未授權公開取用 | 3.56 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。