利用蛋白質序列與結構關係預測酵素種類

Yu-Shing Lin; 林育星

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/32645

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	歐陽彥正
dc.contributor.author	Yu-Shing Lin	en
dc.contributor.author	林育星	zh_TW
dc.date.accessioned	2021-06-13T04:12:49Z	-
dc.date.available	2006-07-31
dc.date.copyright	2006-07-31
dc.date.issued	2006
dc.date.submitted	2006-07-24
dc.identifier.citation	1. Brenner, S.E., A tour of structural genomics. Nat Rev Genet, 2001. 2(10): p. 801-9. 2. Pellegrini, M., Computational methods for protein function analysis. Curr Opin Chem Biol, 2001. 5(1): p. 46-50. 3. Ferre, F., et al., Functional annotation by identification of local surface similarities: a novel tool for structural genomics. BMC Bioinformatics, 2005. 6: p. 194. 4. Shulman-Peleg, A., R. Nussinov, and H.J. Wolfson, SiteEngines: recognition and comparison of binding sites and protein-protein interfaces. Nucleic Acids Res, 2005. 33(Web Server issue): p. W337-41. 5. Torrance, J.W., et al., Using a library of structural templates to recognise catalytic sites and explore their evolution in homologous families. J Mol Biol, 2005. 347(3): p. 565-81. 6. Wallace, A.C., N. Borkakoti, and J.M. Thornton, TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites. Protein Sci, 1997. 6(11): p. 2308-23. 7. Campbell, S.J., et al., Ligand binding: functional site location, similarity and docking. Curr Opin Struct Biol, 2003. 13(3): p. 389-95. 8. Shin, J.M. and D.H. Cho, PDB-Ligand: a ligand database based on PDB for the automated and customized classification of ligand-binding structures. Nucleic Acids Res, 2005. 33(Database issue): p. D238-41. 9. Casari, G., C. Sander, and A. Valencia, A method to predict functional residues in proteins. Nat Struct Biol, 1995. 2(2): p. 171-8. 10. Hannenhalli, S.S. and R.B. Russell, Analysis and prediction of functional sub-types from protein sequence alignments. J Mol Biol, 2000. 303(1): p. 61-76. 11. Landgraf, R., I. Xenarios, and D. Eisenberg, Three-dimensional cluster analysis identifies interfaces and functional residue clusters in proteins. J Mol Biol, 2001. 307(5): p. 1487-502. 12. Campbell, S.J. and R.M. Jackson, Diversity in the SH2 domain family phosphotyrosyl peptide binding site. Protein Eng, 2003. 16(3): p. 217-27. 13. Berman, H.M., et al., The Protein Data Bank. Nucleic Acids Res, 2000. 28(1): p. 235-42. 14. Bairoch, A., The ENZYME data bank in 1999. Nucleic Acids Res, 1999. 27(1): p. 310-1. 15. Zuker, M. and R.L. Somorjai, The alignment of protein structures in three dimensions. Bull Math Biol, 1989. 51(1): p. 55-78. 16. Li, W., L. Jaroszewski, and A. Godzik, Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics, 2001. 17(3): p. 282-3. 17. Jonassen, I., et al., Structure motif discovery and mining the PDB. Bioinformatics, 2002. 18(2): p. 362-7. 18. Pennec, X. and N. Ayache, A geometric algorithm to find small but highly similar 3D substructures in proteins. Bioinformatics, 1998. 14(6): p. 516-22. 19. Hsu, T.-W., The Application of Geometric Hashing in Special Protein Structure Comparison. Master Thesis, 2006. 20. Darby Tien-Hao Chang, Y.-Z.W., Jung-Hsin Lin, Ming-Jing Hwang and Yen-Jen Oyang, Protemot:prediction of protein binding sites with automatically extracted geometrical templates. Nucleic Acids Res (accepted), 2006.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/32645	-
dc.description.abstract	蛋白質結構上的active site和binding site是蛋白質功能的主要作用區域。因此定性或定量分析active site和binding site的構形和周邊胺基酸的位置分布，一直是以生物計算法預測蛋白質功能的主要方式。目前的預測蛋白質功能預測方法大多是利用已知的protein-ligand complex進行人工或自動結構分析來取得binding site周邊原子的特殊樣式(Motif)，進而預測蛋白質的可能功能。本論文則提供另一個不同的觀點，本研究團隊中認為要決定蛋白質功能雖然是由active site或是binding site的位置而決定，但是在相同功能的蛋白質之間應該會存在一些local conserved region，而這些local conserved region會是在演化過程或是產生作用的過程中變動幅度較小的區塊。換言之，這些local conserved region雖然可能不是binding site周邊的胺基酸，卻影響了active site或是binding site的結構與作用方式。為了描述這些區塊的架構，無法只單就整個單一序列或結構上的分析而決定，而是需要進一步瞭解區塊內序列與結構兩者之間的關係一併考慮。為此，本論文利用了針對每個胺基酸周圍固定範圍圈出一個球體來代表一個局部區塊的資訊，針對每個球體進行序列與結構的分析，進而利用蛋白質分類資料庫enzyme data bank，透過相同分類的蛋白質找出屬於該分類的local conserved region，並討論這些local conserved region與功能間的關係。	zh_TW
dc.description.abstract	The active sites and binding sites of the protein structure are the main working areas of the functional proteins. Based on this assessment, the qualitative and quantitative analyses of the variant characteristics of the active site and binding site are the first steps towards predicting protein functionalities. These characteristics include conformation, bio-chemical property etc. Present methods to predict protein function mostly exploit known protein-ligand complex to conduct factitious or automatic structure analysis to obtain the Motif of surrounded atoms of binding site. However, this thesis is providing a different point of view. Our assumption is though the protein function is decided by the location of active site or binding site, there should be some local conserved regions between the proteins that possess same function. In other words, these local conserved regions might not be the surrounded amino acids of binding site, but it indeed affects the structure and the way of working for active site or binding site. To describe the framework of these areas, it can’t be decided only by a single sequence or the structure analysis. What it needs is a further understanding of the relationship between sequence and structure, and takes it into consideration. To reach this goal, we developed the following steps. First, circle each residue around 10Å as a spheroid to represent a local region. Second, use each spheroid to conduct sequence and structure analysis. Third, with enzyme data bank, we can identify the belonging local conserved region of protein through same category of protein. In the future, further discussion about the relationships between theses local conserved region and the functions should be investigated.	en
dc.description.provenance	Made available in DSpace on 2021-06-13T04:12:49Z (GMT). No. of bitstreams: 1 ntu-95-R93922103-1.pdf: 1360366 bytes, checksum: 273032581ddd57d88778de8ae5e5a93f (MD5) Previous issue date: 2006	en
dc.description.tableofcontents	1. 引言 1 1.1 前言 1 1.2 研究動機 1 1.2.1 Functional site location 2 1.2.2 解決問題 4 1.3 預期成果 5 2. 文獻探討與相關研究 6 2.1 蛋白質資料庫相關研究 6 2.1.1 PDB (Protein Data Bank)： 6 2.1.2 Protein Classification Database 6 2.1.3 SCOP (Structural Classification of Proteins) 6 2.1.4 Enzyme Data Bank 7 2.2 整體比對與局部比對 7 2.3 序列相似度:(Sequence Similarity) 8 2.3.1 動態規劃演算法(Dynamic programming) 8 2.3.2 PAM250 9 2.3.3 P-value 10 2.4 結構相似度 (Structure Similarity) 10 2.4.1 幾何雜湊(Geometric Hash)的基本原理 10 2.4.2 二維的幾何雜湊(Two-dimensional geometric hash)： 11 2.4.3 三維的幾何雜湊 11 2.4.4 幾何雜湊的雜湊表 12 2.4.5 對應殘基數( the number of equivalent residue)： 12 2.4.6 Root Mean Square Deviation, RMSD 12 2.5 分群演算法(Clustering Algorithm) 13 2.6 Motif 14 3. 研究方法： 16 3.1 基本假設 16 3.2 定義Neighborhood Residue Sequence(NRS)： 16 3.3 Sequence alignment 17 3.4 Clustering 17 3.5 Structure alignment 18 3.6 建立pattern 19 3.7 演算法流程： 20 3.7.1 建立library流程： 20 3.7.2 query 流程： 22 4. 實驗結果 24 4.1 NRS分析 24 4.2 利用SCOP database 分類 24 4.2.1 SCOP: Structural Classification of Proteins 24 4.2.2 實驗 25 4.3 利用Enzyme Data Bank分類 27 4.3.1 實驗 27 4.4 motif location 28 4.4.1 實驗 28 4.5 利用酵素分類建立conserved region database 29 4.5.1 實驗 29 4.5.2 實驗 33 4.6 利用序列與結構預測enzyme種類 33 4.6.1 預測enzyme種類 33 4.6.2 實驗 34 5. 結果與討論 36 5.1 結果 36 5.2 相關比較 37 5.3 改進方式 38 5.3.1 序列比對方面 38 5.3.2 結構比對方面 39 6. 結論與未來展望 40 6.1 結論 40 6.2 未來展望 40 7. 參考文獻 42
dc.language.iso	zh-TW
dc.subject	酵素	zh_TW
dc.subject	序列	zh_TW
dc.subject	結構	zh_TW
dc.subject	蛋白質	zh_TW
dc.subject	structure	en
dc.subject	protein	en
dc.subject	enzyme	en
dc.subject	sequence	en
dc.title	利用蛋白質序列與結構關係預測酵素種類	zh_TW
dc.title	Enzyme Class Prediction via Mining Conserved Region in Sequence and Structure	en
dc.type	Thesis
dc.date.schoolyear	94-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	趙坤茂,林榮信,黃乾綱
dc.subject.keyword	蛋白質,酵素,結構,序列,	zh_TW
dc.subject.keyword	protein,enzyme,structure,sequence,	en
dc.relation.page	43
dc.rights.note	有償授權
dc.date.accepted	2006-07-26
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-95-1.pdf 未授權公開取用	1.33 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。