請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40344
標題: | 預測轉錄因子上與去氧核醣酸結之區段 Prediction of DNA Binding Transcription Factor segments under Specified structure |
作者: | Jen-Jay Hsu 徐振傑 |
指導教授: | 歐陽彥正(Yen-Jen Oyang) |
關鍵字: | 支援向量機,二級結構,去氧核醣酸,預測,結合片段, SVM,secondary structure,DNA,prediction,binding segments, |
出版年 : | 2008 |
學位: | 碩士 |
摘要: | 本篇論文主要探討如何準確預測蛋白質上的二級結構片段是否會與DNA進行結合,二級結構片段的特徵建構則利用目前許多已知在蛋白質與DNA結合上有意義的特徵進行建構,本論文會展示如何將不同的長度的二級結構片段的位置加權矩陣轉換成與長度無關的特徵表。
本論文收集了兩組資料集,各有其不同的生物意義,我們將會討論是那些因素造成兩組資料集之間效能的落差,同時提出一個兩階段方法用於表現較不好的那組資料集上,以期能將兩組資料集的效能落差近可能的縮小,同時也會呈現何種二級結構是最容易被預測是否會與DNA結合並討論其原因。 在兩組資料集下,我們的方法對helix型態的二級結構分別可以達到75%的涵蓋度、80%的精確度、92%的專一度以及65%的涵蓋度、85%的精確度、98%的專一度。 This thesis discusses the design of a predictor aimed at identifying the secondary structures in a transcription factor that are involved in interaction with the DNA. In particular, the design of the predictor has been optimized for identifying the alpha-helix structures involved in interaction with the DNA due to their prevalence. In the design of the predictor, the support vector machine (SVM) was employed and the study reported in this thesis focused on the features exploited for making prediction. In the experiments conducted in this study, two datasets have been used. The first dataset was derived from the TF-DNA complexes deposited in the Protein Data Bank (PDB) and the second dataset was derived from the TF sequences deposited in SWISS-PROT. With respect to identifying the alpha-helix structures involved in interaction with the DNA, the predictor proposed in this thesis delivered sensitivity of 75%, precision of 80%, and specificity of 92% with the first dataset and sensitivity 65%, precision 85%, and specificity 98% with the second dataset. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40344 |
全文授權: | 有償授權 |
顯示於系所單位: | 資訊工程學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-97-1.pdf 目前未授權公開取用 | 292.83 kB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。