基因體選拔中兩種訓練集最佳化準則之比較

Po-Chun Liao; 廖柏鈞

Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/79896

Title:	基因體選拔中兩種訓練集最佳化準則之比較 A Comparison of two criteria for training set optimization in genomic selection
Authors:	Po-Chun Liao 廖柏鈞
Advisor:	廖振鐸(Chen-Tuo Liao)
Keyword:	基因體選拔,基因演算法,全基因組迴歸模式,線性混合模型,限制最大概似估值, genomic selection,genetic algorithm,whole genome regression,linear mixed effects model,restricted maximum likelihood estimate,
Publication Year :	2021
Degree:	碩士
Abstract:	"雖然次世代定序 (Next Generation Sequencing) 技術目前可協助降低基因型獲得 (genotyping) 的成本，但表現型獲得 (phenotyping) 於育種領域的執行成本上仍是一大考驗。因此，基因體選拔 (genomic selection) 可藉由篩選出使預測準確率最大化的特定訓練集 (training set) 資料來降低該訓練集於表現型獲得所需的成本並建立預測模型。在基因體選拔的過程中，較佳的訓練集能協助我們建立預測測試集數量性狀較為精準的模型。而從候選集 (candidate set) 篩選對應每組測試集 (testing set) 的最佳訓練集過程中，本論文各採用以r-score和mspe-score作為目標函數 (objective function) 的基因演算法 (genomic algorithms, GA) 來求之。透過基因演算法選出的訓練集在表現型獲得後，將可用來估計測試集個體的育種價 (genomic estimated breeding values, GEBVs)。基因演算法中採用的目標函數r-score和mspe-score可分別由測試集的表現型值與育種價間的皮爾森相關係數 (Pearson's correlation) 與均方預測誤差 (mean squared prediction error) 推導而得。此外，本論文以Tropical rice和 rice44k兩組資料作為範例，並採用一般常見的皮爾森相關係數及均方根誤差來評估預測模型的準確度；其中，由於rice44k資料的水稻個體共含六種次族群 (subpopulations) 結構，在建模過程除了比較測試集已知及未知外，還需考量次族群的影響。 "
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/79896
DOI:	10.6342/NTU202101936
Fulltext Rights:	同意授權(全球公開)
Appears in Collections:	農藝學系

Files in This Item:

File	Size	Format
U0001-3007202115234900.pdf	2.08 MB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets