Please use this identifier to cite or link to this item:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22282Full metadata record
| ???org.dspace.app.webui.jsptag.ItemTag.dcfield??? | Value | Language |
|---|---|---|
| dc.contributor.advisor | 陳中明(Chung-Ming Chen) | |
| dc.contributor.author | Ren-Cheng Wang | en |
| dc.contributor.author | 王人晟 | zh_TW |
| dc.date.accessioned | 2021-06-08T04:14:52Z | - |
| dc.date.copyright | 2010-08-17 | |
| dc.date.issued | 2010 | |
| dc.date.submitted | 2010-08-10 | |
| dc.identifier.citation | Perler, F.B.: InBase, the Intein Database. Nucleic Acids Res. 30, 383-384. (2002)
Pietrokovski, S.: Intein spread and extinction in evolution. Trends Genet. 17, 465–472. (2001) Liu X.Q.: PROTEIN-SPLICING INTEIN: Genetic Mobility, Origin, And Evolution. Annual Review of Genetics. 34: 61-76 (2000) Schwarzer, D., and Cole, P.A.: Protein semisynthesis and expressed protein ligation: chasing a protein's tail. Curr Opin Chem Biol 9(6): 561–9 (2005) De Grey, A.D.N.J.: Mitochondrial gene therapy: an arena for the biomedical use of inteins. Trends Biotechnol. 18(9): 394-399 (2000) Gogarten, J.P., Senejani, A.G., Zhaxybayeva, O., et al.: Inteins: structure, function, and evolution.Annu. Rev. Microbiol. 56, 263–287 (2002) Chang, C.-C. and C.-J. Lin.: LIBSVM: a library for support vector machines.Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm. (2001) Kyte, J. &Doolittle, R. F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol.IS7, 109-132. (1982) Sweet R.M., Eisenberg D.: Amino acid scale: Optimized matching hydrophobicity (OMH). J. Mol. Biol. 171:479-488 (1983). Eisenberg D., Schwarz E., Komarony M., Wall R.: Amino acid scale: Normalized consensus hydrophobicity scale. J. Mol. Biol. 179:125-142 (1984). Hoop TP and Woods KR.: Prediction of protein antigenic determinants from amino acid sequences. Proc Natl Acad Sci USA 78:3824 (1981) Guy H.R.: Amino acid scale: Hydrophobicity scale based on free energy of transfer (kcal/mole). Biophys J. 47:61-70 (1985). Shemella P. , Pereira B., Zhang Y., Roey P.V. , Belfort G., Garde S. and Nayak S.K.: Mechanism for Intein C-Terminal Cleavage: A Proposal from Quantum Mechanical Calculations. Biophysical Journal, Volume 92, Issue 3, 847-853, 1 (2007) Wood, D. W., W. Wu, G. Belfort, V. Derbyshire, and M. Belfort. A genetic system yields self-cleaving inteins for bioseparations. Nat. Biotechnol. 17:889–892 (1999) Wood, D. W., V. Derbyshire, W. Wu, M. Chartrain, M. Belfort, and G. Belfort. Optimized single-step affinity purification with a self-cleaving intein applied to human acidic fibroblast growth factor. Biotechnol. Prog. 16:1055–1063 (2000) Brown T.A. , Genomes, 2nd edition, (2002) Chevalier B.S. , Stoddard B.L. : Homing endonucleases: structural and functional insight into the catalysts of intron/intein mobility. Nucleic Acids Research,Vol. 29, No. 18 3757-3774 (2001) V. Vapnik, 'The Nature of Statistical Learning Theory. 1995,' NY Springer RandSeq - http://www.expasy.ch/tools/randseq.html Hall T.A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT (1999) 毛健民,古紅梅。 蛋白質內含子。 生物學教學.(第28卷)第11期;50-51(2003) 魏新元。 内含肽的研究及應用進展。內含肽的研究及應用進展。 西北農林科技大學學報(自然科學版)第36卷第5期;171-177 (2008) 張相萍,屈艾,丁鐵林。 蛋白質內含子的研究進展。生物學雜誌。03;59-62 (2009) | |
| dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22282 | - |
| dc.description.abstract | 蛋白質內含子(intein)是一種寄生基因,類似於去氧核醣核酸中的內含子(intron)。它會跟著宿主基因一同進行轉錄跟轉譯反應,在轉譯完成形成蛋白質序列後,它會產生自發性的反應,會自宿主蛋白中自切(self-splicing)出來,並且它不會對宿主蛋白原有的功能造成影響,因為前期的寄生以及在形成蛋白質後的自切行為,故命名它為蛋白質內含子(intein)。目前有超過三百筆的蛋白質內含子資料被蒐集並且公佈在蛋白質內含子資料庫中(Perler, 2002 Inbase)。由資料裡我們可以發現蛋白質內含子它遍佈於構成所有生物種類的三域系統(細菌、古細菌、真核生物)中,不管是哪種生物系統,我們均可以發現蛋白質內含子的存在。而百分之七十的蛋白質內含子是寄生在去氧核醣核酸或是核醣核酸的合成酶基因中,另外有百分之二十五是寄生在代謝相關的基因上。也因為它具有自切的生化特性,它在蛋白質工程的應用上相當廣泛,像是蛋白質合成、藥物研發或是基因治療均有不少的應用。
除了它獨特的生化特性外,另外Liu(2000)與Pietrokovski(2001)的爭論中尚未能有一個結論的焦點便是它的演化過程,而為了能夠發現更多的蛋白質內含子以及在它演化樹的探討上有所幫助,我們便設計了一個工具來幫助我們達到這樣的目的,這也是撰寫本篇論文的主要原因。 蛋白質內含子以功能區域來說,可以分為兩大區塊,一部分是自切區塊(splicing domain),另一部分則是DOD內切酶區塊(DOD Endonuclease domain)。其中DOD內切酶區塊會大約而不精確的辨識宿主去氧核醣核酸序列中14~40個左右的鹼基片段,進一步將其序列切開造成歸巢現象(homing process)跟散佈蛋白質內含子的序列。因此由功能上可以看出DOD內切酶區塊並不是一個蛋白質內含子構成必要的區塊,而約莫有百分之十的蛋白質內含子僅有自切區塊,我們稱之為微型蛋白質內含子(mini-intein)。因為DOD內切酶區塊的不具必要性,加上它因著要辨識不同的宿主基因而變化性也大,在本篇研究中,我們並不會利用到它。實驗中主要使用的序列是自切區塊內的A、B、F跟G功能域(motif)。進一步利用支持向量機(SVM),可成功的分辨出蛋白質內含子的序列。 | zh_TW |
| dc.description.abstract | An intein is a parasitic genetic element similar to self-splicing introns . Intein are able to splice itself from its host protein and rejoin other protein segments, namely exteins, without influencing the function of the host protein. The self-splicing process of an intein is a spontaneous reaction. Up to date, more than 300 distinctive inteins have been discovered and are archived in the public intein database (Inbase). Inteins can be found in various living organisms across three life domains, i.e. archaea, eubacteria and eucarya. Currently, approximate seventy percentage of discovered intein parasitize in the host genes of DNA/RNA polymerase, while other 25% or more can be retrieved in the genes of metabolism. Inteins are very versatile in the biotechnological applications, e.g., protein synthesis, drug development, gene therapy , and so on. It may be ascribed to that inteins are very efficient in protein splicing.
Important as it is, the evolutionary process of inteins remains controversy and the classification of inteins has been rarely investigated. To assist the understanding and discovery of intein sequences, we propose a tool to predict intein sequences based on known inteins. The tool can distinguish inteins from other proteins and hence may help the identification of inteins in a host protein. With this tool, we may be able to find some sequence features of inteins to boost the understanding of their evolutionary process. Intein can be functionally divided into splicing domain and DOD Endonuclease domain, as shown in Figure 2. DOD Endonuclease domain can recognize sites of 14–40 DNA residues and usually does not require a complete match with the target sequence for a homing process to spread intein. Accordingly DOD domains of inteins may vary with different target genes. Nevertheless, DOD domain is not a necessary domain for an intein. An intein without DOD domain is regarded as a mini-intein. On the other hand, splicing domain exists in all inteins and plays a very crucial role for the function of protein splicing. As a result, we adopt the A, B, F, and G motifs in splicing domain as features to characterize an intein for the purpose of classification. In this study, we adopt support vector machine (SVM) technique to classify inteins from other proteins. | en |
| dc.description.provenance | Made available in DSpace on 2021-06-08T04:14:52Z (GMT). No. of bitstreams: 1 ntu-99-R96548013-1.pdf: 4168146 bytes, checksum: ade4c3340a01a18cac7e4eae2c9772fb (MD5) Previous issue date: 2010 | en |
| dc.description.tableofcontents | 第一章 序論 1
1.1 研究背景與動機 1 1.2 研究目的 2 1.3 論文架構 3 第二章 文獻探討 4 2.1蛋白質內含子文獻探討 4 2.1.1蛋白質內含子的發現與研究史 4 2.1.2蛋白質內含子的種類與特徵 5 2.1.3蛋白質內含子的自切反應機制與歸巢現象 7 2.1.4蛋白質內含子在生物工程上的應用 9 2.1.5蛋白質內含子在演化上的特性與爭議 10 2.2演算法方面文獻 11 2.2.1支持向量機之原理與介紹 11 SVM 經過學習之後,對於未知類別的新資料,可以依照規則(2.4)分類之: 13 2.2.2特徵值的給分方式-胺基酸疏水性及序列統計資料 14 2.2.3 特徵粹取-向前選取法(Forward selection) 14 2.2.4 檢定方式-交叉驗證(Cross-validation) 15 第三章 研究材料與方法 17 3.1 研究流程 17 3.2 研究材料及前處理 18 3.2.1 蛋白質內含子序列的前處理 18 3.3 序列統計與生化分析 19 3.4 計算及決定疏水性特徵值 20 3.5 利用疏水性方法分辨蛋白質內含子所屬物種及與演化樹比較 20 3.6 特徵粹取及判定一個蛋白質序列是否含有蛋白質內含子 21 第四章 實驗結果與討論 23 4.1 序列統計分析結果 23 4.1.1 蛋白質內含子序列之間的兩兩比對之結果 23 4.1.2 每個位置上的胺基酸組成比例 24 4.1.3 蛋白質內含子的疏水性結果 27 4.1.4 三大域物種蛋白質內含子的胺基酸組成比例 29 4.2計算疏水性方式的比較 30 4.2.1 以隨機蛋白質序列作為錯誤資料決定特徵值 30 4.2.2 使用隨機重組蛋白質內含子序列作為初步驗證 33 4.3 利用疏水性方法分辨蛋白質內含子所屬物種及與演化樹比較 34 4.4特徵粹取及判定一個蛋白質序列是否含有蛋白質內含子 36 第五章 結論與未來研究方向 42 5.1結論 42 5.2未來研究方向 43 | |
| dc.language.iso | zh-TW | |
| dc.subject | 演化 | zh_TW |
| dc.subject | 生物資訊 | zh_TW |
| dc.subject | 蛋白質內含子預測 | zh_TW |
| dc.subject | 支持向量機 | zh_TW |
| dc.subject | 特徵篩選 | zh_TW |
| dc.subject | SVM | en |
| dc.subject | Evolution | en |
| dc.subject | Feature Selection | en |
| dc.subject | Bioinformatics | en |
| dc.subject | Intein Prediction | en |
| dc.title | 蛋白質內含子之預測 | zh_TW |
| dc.title | The Prediction Of Intein Sequence | en |
| dc.type | Thesis | |
| dc.date.schoolyear | 98-2 | |
| dc.description.degree | 碩士 | |
| dc.contributor.oralexamcommittee | 黃乾綱,陳倩瑜 | |
| dc.subject.keyword | 生物資訊,蛋白質內含子預測,支持向量機,特徵篩選,演化, | zh_TW |
| dc.subject.keyword | Bioinformatics,Intein Prediction,SVM,Feature Selection,Evolution, | en |
| dc.relation.page | 67 | |
| dc.rights.note | 未授權 | |
| dc.date.accepted | 2010-08-10 | |
| dc.contributor.author-college | 工學院 | zh_TW |
| dc.contributor.author-dept | 醫學工程學研究所 | zh_TW |
| Appears in Collections: | 醫學工程學研究所 | |
Files in This Item:
| File | Size | Format | |
|---|---|---|---|
| ntu-99-1.pdf Restricted Access | 4.07 MB | Adobe PDF |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
