Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 工學院
  3. 醫學工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22282
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor陳中明(Chung-Ming Chen)
dc.contributor.authorRen-Cheng Wangen
dc.contributor.author王人晟zh_TW
dc.date.accessioned2021-06-08T04:14:52Z-
dc.date.copyright2010-08-17
dc.date.issued2010
dc.date.submitted2010-08-10
dc.identifier.citationPerler, F.B.: InBase, the Intein Database. Nucleic Acids Res. 30, 383-384. (2002)
Pietrokovski, S.: Intein spread and extinction in evolution. Trends Genet. 17, 465–472. (2001)
Liu X.Q.: PROTEIN-SPLICING INTEIN: Genetic Mobility, Origin, And Evolution. Annual Review of Genetics. 34: 61-76 (2000)
Schwarzer, D., and Cole, P.A.: Protein semisynthesis and expressed protein ligation: chasing a protein's tail. Curr Opin Chem Biol 9(6): 561–9 (2005)
De Grey, A.D.N.J.: Mitochondrial gene therapy: an arena for the biomedical use of inteins. Trends Biotechnol. 18(9): 394-399 (2000)
Gogarten, J.P., Senejani, A.G., Zhaxybayeva, O., et al.: Inteins: structure, function, and evolution.Annu. Rev. Microbiol. 56, 263–287 (2002)
Chang, C.-C. and C.-J. Lin.: LIBSVM: a library for support vector machines.Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm. (2001)
Kyte, J. &Doolittle, R. F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol.IS7, 109-132. (1982)
Sweet R.M., Eisenberg D.: Amino acid scale: Optimized matching hydrophobicity (OMH). J. Mol. Biol. 171:479-488 (1983).
Eisenberg D., Schwarz E., Komarony M., Wall R.: Amino acid scale: Normalized consensus hydrophobicity scale. J. Mol. Biol. 179:125-142 (1984).
Hoop TP and Woods KR.: Prediction of protein antigenic determinants from amino acid sequences. Proc Natl Acad Sci USA 78:3824 (1981)
Guy H.R.: Amino acid scale: Hydrophobicity scale based on free energy of transfer (kcal/mole). Biophys J. 47:61-70 (1985).
Shemella P. , Pereira B., Zhang Y., Roey P.V. , Belfort G., Garde S. and Nayak S.K.: Mechanism for Intein C-Terminal Cleavage: A Proposal from Quantum Mechanical Calculations. Biophysical Journal, Volume 92, Issue 3, 847-853, 1 (2007)
Wood, D. W., W. Wu, G. Belfort, V. Derbyshire, and M. Belfort. A genetic system yields self-cleaving inteins for bioseparations. Nat. Biotechnol. 17:889–892 (1999)
Wood, D. W., V. Derbyshire, W. Wu, M. Chartrain, M. Belfort, and G. Belfort. Optimized single-step affinity purification with a self-cleaving intein applied to human acidic fibroblast growth factor. Biotechnol. Prog. 16:1055–1063 (2000)
Brown T.A. , Genomes, 2nd edition, (2002)
Chevalier B.S. , Stoddard B.L. : Homing endonucleases: structural and functional insight into the catalysts of intron/intein mobility. Nucleic Acids Research,Vol. 29, No. 18 3757-3774 (2001)
V. Vapnik, 'The Nature of Statistical Learning Theory. 1995,' NY Springer
RandSeq - http://www.expasy.ch/tools/randseq.html
Hall T.A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT (1999)
毛健民,古紅梅。 蛋白質內含子。 生物學教學.(第28卷)第11期;50-51(2003)
魏新元。 内含肽的研究及應用進展。內含肽的研究及應用進展。 西北農林科技大學學報(自然科學版)第36卷第5期;171-177 (2008)
張相萍,屈艾,丁鐵林。 蛋白質內含子的研究進展。生物學雜誌。03;59-62 (2009)
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22282-
dc.description.abstract蛋白質內含子(intein)是一種寄生基因,類似於去氧核醣核酸中的內含子(intron)。它會跟著宿主基因一同進行轉錄跟轉譯反應,在轉譯完成形成蛋白質序列後,它會產生自發性的反應,會自宿主蛋白中自切(self-splicing)出來,並且它不會對宿主蛋白原有的功能造成影響,因為前期的寄生以及在形成蛋白質後的自切行為,故命名它為蛋白質內含子(intein)。目前有超過三百筆的蛋白質內含子資料被蒐集並且公佈在蛋白質內含子資料庫中(Perler, 2002 Inbase)。由資料裡我們可以發現蛋白質內含子它遍佈於構成所有生物種類的三域系統(細菌、古細菌、真核生物)中,不管是哪種生物系統,我們均可以發現蛋白質內含子的存在。而百分之七十的蛋白質內含子是寄生在去氧核醣核酸或是核醣核酸的合成酶基因中,另外有百分之二十五是寄生在代謝相關的基因上。也因為它具有自切的生化特性,它在蛋白質工程的應用上相當廣泛,像是蛋白質合成、藥物研發或是基因治療均有不少的應用。
除了它獨特的生化特性外,另外Liu(2000)與Pietrokovski(2001)的爭論中尚未能有一個結論的焦點便是它的演化過程,而為了能夠發現更多的蛋白質內含子以及在它演化樹的探討上有所幫助,我們便設計了一個工具來幫助我們達到這樣的目的,這也是撰寫本篇論文的主要原因。
蛋白質內含子以功能區域來說,可以分為兩大區塊,一部分是自切區塊(splicing domain),另一部分則是DOD內切酶區塊(DOD Endonuclease domain)。其中DOD內切酶區塊會大約而不精確的辨識宿主去氧核醣核酸序列中14~40個左右的鹼基片段,進一步將其序列切開造成歸巢現象(homing process)跟散佈蛋白質內含子的序列。因此由功能上可以看出DOD內切酶區塊並不是一個蛋白質內含子構成必要的區塊,而約莫有百分之十的蛋白質內含子僅有自切區塊,我們稱之為微型蛋白質內含子(mini-intein)。因為DOD內切酶區塊的不具必要性,加上它因著要辨識不同的宿主基因而變化性也大,在本篇研究中,我們並不會利用到它。實驗中主要使用的序列是自切區塊內的A、B、F跟G功能域(motif)。進一步利用支持向量機(SVM),可成功的分辨出蛋白質內含子的序列。
zh_TW
dc.description.abstractAn intein is a parasitic genetic element similar to self-splicing introns . Intein are able to splice itself from its host protein and rejoin other protein segments, namely exteins, without influencing the function of the host protein. The self-splicing process of an intein is a spontaneous reaction. Up to date, more than 300 distinctive inteins have been discovered and are archived in the public intein database (Inbase). Inteins can be found in various living organisms across three life domains, i.e. archaea, eubacteria and eucarya. Currently, approximate seventy percentage of discovered intein parasitize in the host genes of DNA/RNA polymerase, while other 25% or more can be retrieved in the genes of metabolism. Inteins are very versatile in the biotechnological applications, e.g., protein synthesis, drug development, gene therapy , and so on. It may be ascribed to that inteins are very efficient in protein splicing.
Important as it is, the evolutionary process of inteins remains controversy and the classification of inteins has been rarely investigated. To assist the understanding and discovery of intein sequences, we propose a tool to predict intein sequences based on known inteins. The tool can distinguish inteins from other proteins and hence may help the identification of inteins in a host protein. With this tool, we may be able to find some sequence features of inteins to boost the understanding of their evolutionary process.
Intein can be functionally divided into splicing domain and DOD Endonuclease domain, as shown in Figure 2. DOD Endonuclease domain can recognize sites of 14–40 DNA residues and usually does not require a complete match with the target sequence for a homing process to spread intein. Accordingly DOD domains of inteins may vary with different target genes. Nevertheless, DOD domain is not a necessary domain for an intein. An intein without DOD domain is regarded as a mini-intein. On the other hand, splicing domain exists in all inteins and plays a very crucial role for the function of protein splicing. As a result, we adopt the A, B, F, and G motifs in splicing domain as features to characterize an intein for the purpose of classification. In this study, we adopt support vector machine (SVM) technique to classify inteins from other proteins.
en
dc.description.provenanceMade available in DSpace on 2021-06-08T04:14:52Z (GMT). No. of bitstreams: 1
ntu-99-R96548013-1.pdf: 4168146 bytes, checksum: ade4c3340a01a18cac7e4eae2c9772fb (MD5)
Previous issue date: 2010
en
dc.description.tableofcontents第一章 序論 1
1.1 研究背景與動機 1
1.2 研究目的 2
1.3 論文架構 3
第二章 文獻探討 4
2.1蛋白質內含子文獻探討 4
2.1.1蛋白質內含子的發現與研究史 4
2.1.2蛋白質內含子的種類與特徵 5
2.1.3蛋白質內含子的自切反應機制與歸巢現象 7
2.1.4蛋白質內含子在生物工程上的應用 9
2.1.5蛋白質內含子在演化上的特性與爭議 10
2.2演算法方面文獻 11
2.2.1支持向量機之原理與介紹 11
SVM 經過學習之後,對於未知類別的新資料,可以依照規則(2.4)分類之: 13
2.2.2特徵值的給分方式-胺基酸疏水性及序列統計資料 14
2.2.3 特徵粹取-向前選取法(Forward selection) 14
2.2.4 檢定方式-交叉驗證(Cross-validation) 15
第三章 研究材料與方法 17
3.1 研究流程 17
3.2 研究材料及前處理 18
3.2.1 蛋白質內含子序列的前處理 18
3.3 序列統計與生化分析 19
3.4 計算及決定疏水性特徵值 20
3.5 利用疏水性方法分辨蛋白質內含子所屬物種及與演化樹比較 20
3.6 特徵粹取及判定一個蛋白質序列是否含有蛋白質內含子 21
第四章 實驗結果與討論 23
4.1 序列統計分析結果 23
4.1.1 蛋白質內含子序列之間的兩兩比對之結果 23
4.1.2 每個位置上的胺基酸組成比例 24
4.1.3 蛋白質內含子的疏水性結果 27
4.1.4 三大域物種蛋白質內含子的胺基酸組成比例 29
4.2計算疏水性方式的比較 30
4.2.1 以隨機蛋白質序列作為錯誤資料決定特徵值 30
4.2.2 使用隨機重組蛋白質內含子序列作為初步驗證 33
4.3 利用疏水性方法分辨蛋白質內含子所屬物種及與演化樹比較 34
4.4特徵粹取及判定一個蛋白質序列是否含有蛋白質內含子 36
第五章 結論與未來研究方向 42
5.1結論 42
5.2未來研究方向 43
dc.language.isozh-TW
dc.subject演化zh_TW
dc.subject生物資訊zh_TW
dc.subject蛋白質內含子預測zh_TW
dc.subject支持向量機zh_TW
dc.subject特徵篩選zh_TW
dc.subjectSVMen
dc.subjectEvolutionen
dc.subjectFeature Selectionen
dc.subjectBioinformaticsen
dc.subjectIntein Predictionen
dc.title蛋白質內含子之預測zh_TW
dc.titleThe Prediction Of Intein Sequenceen
dc.typeThesis
dc.date.schoolyear98-2
dc.description.degree碩士
dc.contributor.oralexamcommittee黃乾綱,陳倩瑜
dc.subject.keyword生物資訊,蛋白質內含子預測,支持向量機,特徵篩選,演化,zh_TW
dc.subject.keywordBioinformatics,Intein Prediction,SVM,Feature Selection,Evolution,en
dc.relation.page67
dc.rights.note未授權
dc.date.accepted2010-08-10
dc.contributor.author-college工學院zh_TW
dc.contributor.author-dept醫學工程學研究所zh_TW
Appears in Collections:醫學工程學研究所

Files in This Item:
File SizeFormat 
ntu-99-1.pdf
  Restricted Access
4.07 MBAdobe PDF
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved