請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9088
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 莊曜宇 | |
dc.contributor.author | Siao-Han Wong | en |
dc.contributor.author | 翁小涵 | zh_TW |
dc.date.accessioned | 2021-05-20T20:08:51Z | - |
dc.date.available | 2011-07-31 | |
dc.date.available | 2021-05-20T20:08:51Z | - |
dc.date.copyright | 2009-07-31 | |
dc.date.issued | 2009 | |
dc.date.submitted | 2009-07-31 | |
dc.identifier.citation | 1. 行政院衛生署-統計資訊網. 2009;
Available from: http://www.doh.gov.tw/CHT2006/index_populace.aspx. 2. Subramanian, J. and R. Govindan, Molecular genetics of lung cancer in people who have never smoked. Lancet Oncol, 2008. 9(7): p. 676-82. 3. Schena, M., et al., Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science, 1995. 270(5235): p. 467-70. 4. Yongchao Ge, S.D., and Terence P. Speed, Resampling-based multiple testing for microarray data analysis. TEST, 2003. 12(1): p. 1-77. 5. Sandrine Dudoit, J.P.S.a.J.C.B., Multiple Hypothesis Testing in Microarray Experiments. Statistical Science, 2003. 18(1): p. 71-103. 6. Kanehisa, M. and S. Goto, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res, 2000. 28(1): p. 27-30. 7. BioCarta pathways database. Available from: http://www.biocarta.com/genes/index.asp. 8. Dahlquist, K.D., et al., GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways. Nat Genet, 2002. 31(1): p. 19-20. 9. Category C2, Curated canonical pathways. April, 2008; Available from: http://www.broad.mit.edu/gsea/msigdb/collections.jsp#C2. 10. Harris, M.A., et al., The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res, 2004. 32(Database issue): p. D258-61. 11. Mani, R., et al., Defining genetic interaction. Proc Natl Acad Sci U S A, 2008. 105(9): p. 3461-6. 12. Daraselia, N., et al., Extracting human protein interactions from MEDLINE using a full-sentence parser. Bioinformatics, 2004. 20(5): p. 604-11. 13. von Mering, C., et al., Comparative assessment of large-scale data sets of protein-protein interactions. Nature, 2002. 417(6887): p. 399-403. 14. Hart, G.T., A.K. Ramani, and E.M. Marcotte, How complete are current yeast and human protein-interaction networks? Genome Biol, 2006. 7(11): p. 120. 15. Yuryev, A., Introduction to pathway analysis, in Pathway analysis for drug discovery - computational infrastructure and applications, A. yuryev, Editor. 2008, Wiley. p. 1-20. 16. Bader, G.D., D. Betel, and C.W. Hogue, BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res, 2003. 31(1): p. 248-50. 17. Peri, S., et al., Human protein reference database as a discovery resource for proteomics. Nucleic Acids Res, 2004. 32(Database issue): p. D497-501. 18. Zanzoni, A., et al., MINT: a Molecular INTeraction database. FEBS Lett, 2002. 513(1): p. 135-40. 19. Curtis, R.K., M. Oresic, and A. Vidal-Puig, Pathways to the analysis of microarray data. Trends Biotechnol, 2005. 23(8): p. 429-35. 20. Sivachenko, A.Y., Pathway analysis of high-throughput experimental data, in Pathway analysis for drug discovery - computational infrastructure and applications, A. yuryev, Editor. 2008, Wiley. p. 103-117. 21. Calculating and Interpreting the P-values for Functions, Pathways, and Lists in Ingenuity Pathways Analysis. 2008. 22. Ekins, S., et al., Pathway mapping tools for analysis of high content data. Methods Mol Biol, 2007. 356: p. 319-50. 23. Huang da, W., B.T. Sherman, and R.A. Lempicki, Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc, 2009. 4(1): p. 44-57. 24. Richard Simon, A.P.L., BRB-ArrayTools Version 3.7 - User's Manual. 2007. p. 69-73. 25. Subramanian, A., et al., Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A, 2005. 102(43): p. 15545-50. 26. Mootha, V.K., et al., PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet, 2003. 34(3): p. 267-73. 27. Tian, L., et al., Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci U S A, 2005. 102(38): p. 13544-9. 28. Pavlidis, P., D.P. Lewis, and W.S. Noble, Exploring gene expression data with class scores. Pac Symp Biocomput, 2002: p. 474-85. 29. Watts, D.J. and S.H. Strogatz, Collective dynamics of 'small-world' networks. Nature, 1998. 393(6684): p. 440-2. 30. Barabasi, A.L. and Z.N. Oltvai, Network biology: understanding the cell's functional organization. Nat Rev Genet, 2004. 5(2): p. 101-13. 31. Wu, Z., X. Zhao, and L. Chen, Identifying responsive functional modules from protein-protein interaction network. Mol Cells, 2009. 27(3): p. 271-7. 32. Ideker, T., et al., Discovering regulatory and signalling circuits in molecular interaction networks. Bioinformatics, 2002. 18 Suppl 1: p. S233-40. 33. Nacu, S., et al., Gene expression network analysis and applications to immunology. Bioinformatics, 2007. 23(7): p. 850-8. 34. Kelley, R. and T. Ideker, Systematic interpretation of genetic interactions using protein networks. Nat Biotechnol, 2005. 23(5): p. 561-6. 35. Li, Y., P. Agarwal, and D. Rajagopalan, A global pathway crosstalk network. Bioinformatics, 2008. 24(12): p. 1442-7. 36. Lee, E., et al., Inferring pathway activity toward precise disease classification. PLoS Comput Biol, 2008. 4(11): p. e1000217. 37. Chuang, H.Y., et al., Network-based classification of breast cancer metastasis. Mol Syst Biol, 2007. 3: p. 140. 38. Su, L.J., et al., Selection of DDX5 as a novel internal control for Q-RT-PCR from microarray data using a block bootstrap re-sampling scheme. BMC Genomics, 2007. 8: p. 140. 39. Edgar, R., M. Domrachev, and A.E. Lash, Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res, 2002. 30(1): p. 207-10. 40. Stark, C., et al., BioGRID: a general repository for interaction datasets. Nucleic Acids Res, 2006. 34(Database issue): p. D535-9. 41. Karp, P.D., et al., The EcoCyc Database. Nucleic Acids Res, 2002. 30(1): p. 56-8. 42. Inc., P., Partek Genomics Suite. 2009, Partek Inc.: St. Louis. 43. Cline, M.S., et al., Integration of biological networks and gene expression data using Cytoscape. Nat Protoc, 2007. 2(10): p. 2366-82. 44. Barsky, A., et al., Cerebral: a Cytoscape plugin for layout of and interaction with biological networks using subcellular localization annotation. Bioinformatics, 2007. 23(8): p. 1040-2. 45. Color Objects in KEGG Pathways. Available from: http://www.genome.jp/kegg/tool/color_pathway.html. 46. QuickGO : a web-based browser for Gene Ontology terms and annotations. Available from: http://www.ebi.ac.uk/QuickGO/. 47. GeneChip® Human Genome Arrays. 48. Irizarry, R.A., et al., Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res, 2003. 31(4): p. e15. 49. Bolstad, B.M., et al., A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics, 2003. 19(2): p. 185-93. 50. Daniel Holder, R.F.R., V. Bill Pikounis, Vladimir Svetnik, Keith Soper. Statistical analysis of high density oligonucleotide arrays: a SAFER approach. in Proceedings of the ASA Annual Meeting 2001. 2001. 51. Bland, J.M. and D.G. Altman, Multiple significance tests: the Bonferroni method. BMJ, 1995. 310(6973): p. 170. 52. Storey, J.D. and R. Tibshirani, Statistical significance for genomewide studies. Proc Natl Acad Sci U S A, 2003. 100(16): p. 9440-5. 53. Draghici, S., et al., A systems biology approach for pathway level analysis. Genome Res, 2007. 17(10): p. 1537-45. 54. 圖20. 97年主要癌症死亡率, 97年死因統計結果分析.doc, Editor, Department of Health, Executive Yuan, R.O.C. (Taiwan). | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9088 | - |
dc.description.abstract | 近年來,基因晶片 (microarray) 的數據分析已經從傳統上僅利用統計分析的方法,轉向加入更多已知基因功能性註解來協助分析。生物反應路徑分析法 (pathway analysis) 針對資料庫中每一個已定義好的反應路徑 (pathway) ,測量於實驗設計中其是否存在訊息核糖核酸 (mRNA) 層級上的顯著變異。而另外一種網絡圖譜分析法 (network analysis) 則旨在搜尋一個由基因間所有可能存在的交互作用所組成的圖譜 (global interaction network) ,看其中是否有顯著改變的子圖譜 (subnetwork) 。這兩種分析方法各擅勝場,且可望補足對方的缺點:前者僅分析已知的反應路徑,故將結果局限在熟知的生物知識中;後者雖蘊藏了許多可能的基因互動途徑,但直接從 global interaction network 中搜尋容易找到無法以現有生物知識呈現其一致性生物意義的subnetwork。
本文提出一個能基於pathway analysis但更進一步結合network analysis優點的分析方式來改善現有的分析方式,值得一提的是此一結合概念在目前的分析領域中並不常見。 此方法首先利用 Tian et al. 發展的pathway analysis方式測定有顯著變異的pathway。接著以這些pathway的成員作為出發點,採用Nacu et al. 的network analysis方法進行一個目的導向式的subnetwork搜尋。 本篇論文將此方法應用在台大醫院肺癌病人的基因晶片數據上。一開始嘗試在有變異的pathway中尋找其最具代表性的成分 (subnetwork) 。這組數據產生的subnetwork在另一組台北榮總肺癌病人的數據中亦得到了呈現高度一致的結果。此外,我們針對找到的subnetwork進行其成員基因的功能性分析,發現從原本完整pathway縮減到subnetwork的過程中,整體的功能由原本具有的多樣性,專一化到特定的功能上。這暗示了該pathway的某部分功能在實驗中是明顯地被改變的,而這樣的改變得以用這篇論文的方法被察覺。更進一步,我們展現了本方法承繼了network analysis而來的優點。立基於pathway的已知成員去搜尋其可能有互動的鄰近基因,我們得到的subnetwork是以此pathway為出發點,但包含許多未被認識的交互作用,這樣的結果可以協助研究者對於未知的部分設計實驗做更深入的探究。從另外一方面來說,這樣的結果也有異於傳統network analysis的方式,它能使得研究可以立基於研究者感興趣的pathway,基於已知的生物知識去拓展相關未知的可能性。 總結所有分析的結果,它們從不同方向指出了此分析方法不同於以往的許多優點:它可以從顯著改變的pathway中萃取出一個最重要且大小適合研究的subnetwork、也可以針對研究者感興趣的pathway或特定的調控機制進行主題式的深入探討、此外除了立基於原有生物知識外,它亦具有開發基因間新的互動機制的能力。 | zh_TW |
dc.description.abstract | Currently the analysis of microarray data had turned into integrating with prior biological knowledge: pathway analysis interprets transcriptomic data on pathway level and identified predefined groups of genes with dysregulation; network analysis takes gene-gene interactions information into consideration and searches for modules associated to the phenotypes under study. The two analyses have its own advantages respectively and they complement the weaknesses of each other: pathway analysis provides little clues to directly explore new biological knowledge and network analysis usually yields modules including few consistent biological information. In this study an analytical methodology was developed to integrate current pathway analysis method with network analysis methods.
Initially, dysregulated pathways are identified by modified pathway analysis method in Tian et al.. Subsequently, a focus-oriented investigation on dysregulated pathways are performed by network analysis following the work of Nacu et al., and this step is using modules within or related to members of the pathways to be further investigated. Several improvements were made, such as the scoring functions and the module identification algorithms. To illustrate the benefits of this methodology, a lung cancer study with 30 paired cancer and normal tissues was explored. The results derived within dysregulated pathways were also identified consistently in another public dataset GSE7670. Furthermore, GO term enrichment analysis was applied to show that the modules have a specialized functionality than the original pathways. In brief, original large modules were reduced from the entire pathway to a smaller size of relevant interconnected members, which are much easier to be manipulated but still remain their biological information. Moreover, the ability of this methodology to explore novel interactions related to pathway members were also demonstrated by extending the module search algorithm beyond the pre-defined pathways. This would not be achieved by traditional pathway analysis methods, which usually don’t include biomolecular interaction information. Yet, modules identified in this methodology were based on dysregulated pathways with specific biological meaning since their members were mainly associated. In conclusion, these data all indicated the advantages to integrate both pathway and network information during microarray analysis: to uncover manageable size of molecular interaction networks important for pathway dysregulation, to focus on interested pathways, functions or even specific regulatory events, and to possess the potential of performing exploratory researches on mechanisms that are not yet well understood. Undoubtedly, this concept could be extensively applied to other array experiments of similar design regardless of the disease under study. | en |
dc.description.provenance | Made available in DSpace on 2021-05-20T20:08:51Z (GMT). No. of bitstreams: 1 ntu-98-R96945033-1.pdf: 5236141 bytes, checksum: efd4574345b9c885728ba89a70dc0de5 (MD5) Previous issue date: 2009 | en |
dc.description.tableofcontents | 口試委員會審定書 I
謝誌 II 摘要 III ABSTRACT V Chapter 1 Introduction 1 1.1 Lung cancer 1 1.2 Microarray 2 1.3 Data analysis 2 1.3.1 Single gene analysis 3 1.3.2 Biological knowledge database 4 1.3.3 Pathway analysis 8 1.3.4 Network analysis 11 1.3.5 Methodology in this work 13 Chapter 2 Materials 17 2.1 Lung cancer datasets 17 2.2 Databases 17 2.3 Program environment and public/ commercial tools 19 Chapter 3 Methods 22 3.1 Database construction 22 3.2 Single gene analysis 24 3.3 Pathway analysis 25 3.4 Network analysis 27 3.5 Results demonstration 30 Chapter 4 Results 31 4.1 Database 31 4.2 Single gene analysis 32 4.3 Pathway analysis 34 4.4 Network analysis - within pathways 36 4.5 Result demonstration and comparison 39 4.5.1 Mapping the main component and leading edge subset on KEGG pathways 39 4.5.2 GO term enrichment analysis 41 4.6 Network analysis – protruding pathways 48 Chapter 5 Discussion 50 6.1 Input matrix creation 50 6.2 Pathway analysis 51 6.3 Network analysis 55 6.4 Future perspectives 60 Chapter 6 Conclusions 63 REFERENCES 66 APPENDIX 70 | |
dc.language.iso | en | |
dc.title | 利用已知基因傳遞機制及蛋白質交互作用圖譜來開發新的基因互動途徑 | zh_TW |
dc.title | Integrate pathway information and protein interaction network to explore possible interactions between genes | en |
dc.type | Thesis | |
dc.date.schoolyear | 97-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 賴亮全,高成炎,蔡孟勳 | |
dc.subject.keyword | 基因晶片,數據分析, | zh_TW |
dc.subject.keyword | microarray,pathway analysis,network analysis,module, | en |
dc.relation.page | 85 | |
dc.rights.note | 同意授權(全球公開) | |
dc.date.accepted | 2009-07-31 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 生醫電子與資訊學研究所 | zh_TW |
顯示於系所單位: | 生醫電子與資訊學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-98-1.pdf | 5.11 MB | Adobe PDF | 檢視/開啟 |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。