Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 工學院
  3. 工程科學及海洋工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/26499
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor黃乾綱
dc.contributor.authorChia-Jui Yangen
dc.contributor.author楊家瑞zh_TW
dc.date.accessioned2021-06-08T07:12:39Z-
dc.date.copyright2008-08-05
dc.date.issued2008
dc.date.submitted2008-07-29
dc.identifier.citationBennett, S. P., C. G. Nevill-Manning, et al. (2003). '3MOTIF: visualizing conserved protein sequence motifs in the protein structure database.' Bioinformatics 19(4): 541-2.
Birzele, F. and S. Kramer (2006). 'A new representation for protein secondary structure prediction based on frequent patterns.' Bioinformatics 22(21): 2628-34.
Bystroff, C. and D. Baker (1998). 'Prediction of local structure in proteins using a library of sequence-structure motifs.' J Mol Biol 281(3): 565-77.
Capra, J. A. and M. Singh (2007). 'Predicting functionally important residues from sequence conservation.' Bioinformatics 23(15): 1875-82.
Chen, S. C. and I. Bahar (2004). 'Mining frequent patterns in protein structures: a study of protease families.' Bioinformatics 20 Suppl 1: i77-85.
Espadaler, J., O. Romero-Isart, et al. (2005). 'Prediction of protein-protein interactions using distant conservation of sequence patterns and structure relationships.' Bioinformatics 21(16): 3360-8.
Fields, P. A. (2001). 'Review: Protein function at thermal extremes: balancing stability and flexibility.' Comp Biochem Physiol A Mol Integr Physiol 129(2-3): 417-31.
Halperin, E., J. Buhler, et al. (2003). 'Detecting protein sequence conservation via metric embeddings.' Bioinformatics 19 Suppl 1: i122-9.
Hvidsten, T. R., A. Kryshtafovych, et al. (2003). 'A novel approach to fold recognition using sequence-derived properties from sets of structurally similar local fragments of proteins.' Bioinformatics 19 Suppl 2: ii81-91.
Livingstone, C. D. and G. J. Barton (1993). 'Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation.' Comput Appl Biosci 9(6): 745-56.
Miguel, R. N. (2004). 'Sequence patterns derived from the automated prediction of functional residues in structurally-aligned homologous protein families.' Bioinformatics 20(15): 2380-9.
Pei, J. and N. V. Grishin (2001). 'AL2CO: calculation of positional conservation in a protein sequence alignment.' Bioinformatics 17(8): 700-12.
Schueler-Furman, O. and D. Baker (2003). 'Conserved residue clustering and protein structure prediction.' Proteins 52(2): 225-35.
Shi, S., Y. Zhong, et al. (2007). 'Searching for three-dimensional secondary structural patterns in proteins with ProSMoS.' Bioinformatics 23(11): 1331-8.
Wallace, A. C., N. Borkakoti, et al. (1997). 'TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites.' Protein Sci 6(11): 2308-23.
Wang, K. and R. Samudrala (2005). 'FSSA: a novel method for identifying functional signatures from structural alignments.' Bioinformatics 21(13): 2969-77.
Yang, A. S. and L. Y. Wang (2003). 'Local structure prediction with local structure-based sequence profiles.' Bioinformatics 19(10): 1267-74.
Ye, K., W. A. Kosters, et al. (2007). 'An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.' Bioinformatics 23(6): 687-93.
Yuan, Z., J. Zhao, et al. (2003). 'Flexibility analysis of enzyme active sites by crystallographic temperature factors.' Protein Eng 16(2): 109-14.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/26499-
dc.description.abstract蛋白質結構分析主要可分成整體(global)結構與局部(local)結構兩個層面,其中局部結構對於蛋白質功能性的分析特別息息相關,在某一蛋白質分群中常出現的相似結構片段,可能具有蛋白質生物性或演化上的意義,生物學家稱這些局部結構為保存區域(conserved region)。然而根據PDB網站所蒐集的蛋白質結構數量,目前已突破五萬大關,在這麼多的蛋白質中結構中,如何利用資料探勘技術,擷取出有意義的局部結構,進一步鑑定是否就是所謂的保存區域,便成為生物學家熱切的研究方向。
在本論文中,以NRS(neighborhood residues sphere)的概念,利用一個球體空間來記錄蛋白質局部結構中胺基酸殘基的分佈狀況,為了達到快速比對與分群的目的,我們嘗試將蛋白質結構編碼,以一維資訊儲存成特徵值來代表每個局部結構的空間資訊,經過不斷的探討與實驗,驗證了各種不同雜湊格設計方式的優劣以及緩衝區的必要性,進而改良出最能準確紀錄局部結構資訊的編碼方式,並應用在保存區域的探索上,期望在分類於同一種酵素底下的蛋白質中,找出那些對催化作用有影響力的區域。
此外,我們也將局部結構編碼方式應用在蛋白質整體結構比對上,快速地找出整體結構間高度相似的局部區域,利用它們作為比對時的基礎,將結構轉換至同一座標系統以方便作比對,並觀察蛋白質結構中其他區域的相似度,進而延伸探討stability與flexibility的問題。
本論文的出發點,在於希望能發展出一套快速描述局部結構之空間資訊的演算法流程,為每個蛋白質建立出可能的保存區域候選結構,實驗成果以及中間所遭遇的問題都值得將來在面對同樣的議題時,提出思考或是改進的空間。
zh_TW
dc.description.abstractAnalysis of protein structure were mainly divided into two aspects – global structure and local structure,especially the latter correlated closely with analysis of protein function. Most biologists supposed when some frequent patterns reveal in certain protein structure group, it may have some meanings of protein function or evolution in these regions, biologists usually name these regions “conserved regions”. Unfortunately it is very time-consuming when we want to find these conserved regions in a huge database of protein structure, and therefore how to use technology of data mining to solve this problem has become a hot thesis of bioinformatics.
In this paper, we use concept of NRS (neighborhood residues sphere) to record distribution of amino acid residue of protein local structure. In order to cluster similar local structure quickly, we encoded every protein local structure to 1-Dimension information. Through heuristic experiments and discussions, we verified accuracy of every encoding method. Further we applied encoding method to mine possible conserved regions which may catalyze in enzyme structure classification database. Finally we also discussed the issue of flexibility and stability of global structure based on this structure encoding method scheme.
en
dc.description.provenanceMade available in DSpace on 2021-06-08T07:12:39Z (GMT). No. of bitstreams: 1
ntu-97-R95525053-1.pdf: 3578944 bytes, checksum: 71d0aeb9622ea98318cbfdb52493ce82 (MD5)
Previous issue date: 2008
en
dc.description.tableofcontents口試委員會審定書………………………………………………………………... ii
誌謝………………………………………………………………………………... iii
中文摘要…………………………………………………………………………... iv
英文摘要…………………………………………………………………………... v
Chapter 1 導論 - 1 -
1.1 動機 - 1 -
1.2 蛋白質整體結構與區域結構 - 1 -
1.3 保存區域(conserved region) - 3 -
1.4 以結構編碼方式應用在蛋白質分群 - 3 -
1.5 章節介紹 - 4 -
Chapter 2 文獻探討 - 5 -
2.1 研究背景介紹 - 5 -
2.1.1 蛋白質結構的組成 - 5 -
2.1.1 酵素(Enzyme) - 5 -
2.1.2 Protein Data Bank - 6 -
2.1.3 其他蛋白質資料庫 - 7 -
2.2 保存區域的層級 - 7 -
2.2.1 序列上的保存區域 - 7 -
2.2.2 結構上的保存區域 - 8 -
2.2.3 功能性residue與保存區域關係 - 9 -
2.2.4 Conservation與蛋白質演化、功能上的關係 - 10 -
2.3 保存區域定義與尋找 - 10 -
Chapter 3 區域結構編碼設計與討論 - 14 -
3.1 設計背景與動機 - 14 -
3.2 基本編碼方式 - 15 -
3.2.1 新座標系統轉換 - 15 -
3.2.2 鄰近區域結構編碼 - 16 -
3.3 結構編碼演算法流程 - 18 -
3.4 結構編碼方法分析與探討 - 18 -
3.5 結構編碼方法改良 - 20 -
3.6 蛋白質結構比對 - 21 -
Chapter 4 實驗與保存區域探尋 - 23 -
4.1 實驗:結構編碼方法比較 - 23 -
4.1.1 實驗設計 - 23 -
4.1.2 實驗流程 - 24 -
4.1.3 實驗結果與討論 - 25 -
4.1.4 門檻值(threshold)調整測試 - 28 -
4.2 實驗:保存區域探尋 - 30 -
4.2.1 實驗設計 - 30 -
4.2.2 實驗結果 - 30 -
4.2.3 更多的實驗結果 - 33 -
Chapter 5 應用:結構編碼方法應用於整體結構比對 - 41 -
5.1 應用背景 - 41 -
5.2 整體結構比對設計 - 42 -
5.3 演算法流程 - 44 -
5.4 實驗 - 45 -
5.4.1 實驗設計 - 45 -
5.4.2 比對數據 - 45 -
5.4.3 實驗結果探討 - 46 -
Chapter 6 結論與未來展望 - 49 -
6.1 結論與貢獻 - 49 -
6.2 未來展望 - 50 -
參考文獻 - 52 -
dc.language.isozh-TW
dc.title以改良式蛋白質結構編碼方法應用在保存區域探尋zh_TW
dc.titleMining conserved regions by an improved protein structural encoding methoden
dc.typeThesis
dc.date.schoolyear96-2
dc.description.degree碩士
dc.contributor.oralexamcommittee張瑞益,歐陽彥正,鄭貽生
dc.subject.keyword保存區域,蛋白質結構比對,編碼,索引,幾何雜湊,zh_TW
dc.subject.keywordconserved region,protein structure comparison,encoding,indexing,geometric hashing,en
dc.relation.page52
dc.rights.note未授權
dc.date.accepted2008-07-31
dc.contributor.author-college工學院zh_TW
dc.contributor.author-dept工程科學及海洋工程學研究所zh_TW
顯示於系所單位:工程科學及海洋工程學系

文件中的檔案:
檔案 大小格式 
ntu-97-1.pdf
  目前未授權公開取用
3.5 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved