請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22122
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 莊曜宇(Eric Y. Chuang) | |
dc.contributor.author | Chien-Yueh Lee | en |
dc.contributor.author | 李建樂 | zh_TW |
dc.date.accessioned | 2021-06-08T04:03:41Z | - |
dc.date.copyright | 2018-08-08 | |
dc.date.issued | 2018 | |
dc.date.submitted | 2018-08-01 | |
dc.identifier.citation | 1. Schena, M., et al., Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science, 1995. 270(5235): p. 467-70.
2. Bumgarner, R., Overview of DNA microarrays: types, applications, and their future. Curr Protoc Mol Biol, 2013. Chapter 22: p. Unit 22 1. 3. Shendure, J., et al., Accurate multiplex polony sequencing of an evolved bacterial genome. Science, 2005. 309(5741): p. 1728-32. 4. Metzker, M.L., Sequencing technologies - the next generation. Nat Rev Genet, 2010. 11(1): p. 31-46. 5. Dewey, F.E., et al., DNA sequencing: clinical applications of new DNA sequencing technologies. Circulation, 2012. 125(7): p. 931-44. 6. Bainbridge, M.N., et al., Whole exome capture in solution with 3 Gbp of data. Genome Biol, 2010. 11(6): p. R62. 7. Wang, Z., M. Gerstein, and M. Snyder, RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet, 2009. 10(1): p. 57-63. 8. Creighton, C.J., J.G. Reid, and P.H. Gunaratne, Expression profiling of microRNAs by deep sequencing. Brief Bioinform, 2009. 10(5): p. 490-7. 9. Montero-Mendieta, S., et al., A practical guide to build de-novo assemblies for single tissues of non-model organisms: the example of a Neotropical frog. PeerJ, 2017. 5: p. e3702. 10. Ji, P., et al., MetaSort untangles metagenome assembly by reducing microbial community complexity. Nat Commun, 2017. 8: p. 14306. 11. Park, P.J., ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet, 2009. 10(10): p. 669-80. 12. Lu, T.P., et al., miRSystem: an integrated system for characterizing enriched functions and pathways of microRNA targets. PLoS One, 2012. 7(8): p. e42390. 13. Lee, Y.F., et al., CellExpress: a comprehensive microarray-based cancer cell line and clinical sample gene expression analysis online system. Database (Oxford), 2018. 2018. 14. Lee, C.-Y., et al., Common applications of next-generation sequencing technologies in genomic research. Translational Cancer Research, 2013. 2(1): p. 33-45. 15. Lee, C.Y., et al., Whole-genome de novo sequencing reveals unique genes that contributed to the adaptive evolution of the Mikado pheasant. Gigascience, 2018. 7(5). 16. Kryukov, K. and T. Imanishi, Human Contamination in Public Genome Assemblies. PLoS One, 2016. 11(9): p. e0162424. 17. Merchant, S., D.E. Wood, and S.L. Salzberg, Unexpected cross-species contamination in genome sequencing projects. PeerJ, 2014. 2: p. e675. 18. Sheikhizadeh, S. and D. de Ridder, ACE: accurate correction of errors using K-mer tries. Bioinformatics, 2015. 31(19): p. 3216-8. 19. Simao, F.A., et al., BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics, 2015. 31(19): p. 3210-2. 20. Alfoldi, J. and K. Lindblad-Toh, Comparative genomics as a tool to understand evolution and disease. Genome Res, 2013. 23(7): p. 1063-8. 21. Carthew, R.W. and E.J. Sontheimer, Origins and Mechanisms of miRNAs and siRNAs. Cell, 2009. 136(4): p. 642-55. 22. Leung, A.K. and P.A. Sharp, MicroRNA functions in stress responses. Mol Cell, 2010. 40(2): p. 205-15. 23. Havelange, V. and R. Garzon, MicroRNAs: emerging key regulators of hematopoiesis. Am J Hematol, 2010. 85(12): p. 935-42. 24. Ma, S., et al., Low-dose radiation-induced responses: focusing on epigenetic regulation. Int J Radiat Biol, 2010. 86(7): p. 517-28. 25. Harris, A., S.M. Krams, and O.M. Martinez, MicroRNAs as immune regulators: implications for transplantation. Am J Transplant, 2010. 10(4): p. 713-9. 26. Port, J.D. and C. Sucharov, Role of microRNAs in cardiovascular disease: therapeutic challenges and potentials. J Cardiovasc Pharmacol, 2010. 56(5): p. 444-53. 27. Ferland-McCollough, D., et al., The involvement of microRNAs in Type 2 diabetes. Biochem Soc Trans, 2010. 38(6): p. 1565-70. 28. Farazi, T.A., et al., miRNAs in human cancer. J Pathol, 2011. 223(2): p. 102-15. 29. John, B., et al., Human MicroRNA targets. PLoS Biol, 2004. 2(11): p. e363. 30. Lewis, B.P., C.B. Burge, and D.P. Bartel, Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell, 2005. 120(1): p. 15-20. 31. Grimson, A., et al., MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell, 2007. 27(1): p. 91-105. 32. Friedman, R.C., et al., Most mammalian mRNAs are conserved targets of microRNAs. Genome Res, 2009. 19(1): p. 92-105. 33. Maragkakis, M., et al., Accurate microRNA target prediction correlates with protein repression levels. BMC Bioinformatics, 2009. 10: p. 295. 34. Maragkakis, M., et al., DIANA-microT web server: elucidating microRNA functions through target prediction. Nucleic Acids Res, 2009. 37(Web Server issue): p. W273-6. 35. Tsang, J.S., M.S. Ebert, and A. van Oudenaarden, Genome-wide dissection of microRNA functions and cotargeting networks using gene set signatures. Mol Cell, 2010. 38(1): p. 140-53. 36. Krek, A., et al., Combinatorial microRNA target predictions. Nat Genet, 2005. 37(5): p. 495-500. 37. Miranda, K.C., et al., A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell, 2006. 126(6): p. 1203-17. 38. Kertesz, M., et al., The role of site accessibility in microRNA target recognition. Nat Genet, 2007. 39(10): p. 1278-84. 39. Papadopoulos, G.L., et al., DIANA-mirPath: Integrating human and mouse microRNAs in pathways. Bioinformatics, 2009. 25(15): p. 1991-3. 40. Bang-Berthelsen, C.H., et al., Independent component and pathway-based analysis of miRNA-regulated gene expression in a model of type 1 diabetes. BMC Genomics, 2011. 12: p. 97. 41. Draghici, S., et al., A systems biology approach for pathway level analysis. Genome Res, 2007. 17(10): p. 1537-45. 42. Griffiths-Jones, S., et al., miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res, 2006. 34(Database issue): p. D140-4. 43. Betel, D., et al., The microRNA.org resource: targets and expression. Nucleic Acids Res, 2008. 36(Database issue): p. D149-53. 44. Enright, A.J., et al., MicroRNA targets in Drosophila. Genome Biol, 2003. 5(1): p. R1. 45. Papadopoulos, G.L., et al., The database of experimentally supported targets: a functional update of TarBase. Nucleic Acids Res, 2009. 37(Database issue): p. D155-8. 46. Xiao, F., et al., miRecords: an integrated resource for microRNA-target interactions. Nucleic Acids Res, 2009. 37(Database issue): p. D105-10. 47. Carbon, S., et al., AmiGO: online access to ontology and annotation data. Bioinformatics, 2009. 25(2): p. 288-9. 48. Kanehisa, M., et al., KEGG for linking genomes to life and the environment. Nucleic Acids Res, 2008. 36(Database issue): p. D480-4. 49. Schaefer, C.F., et al., PID: the Pathway Interaction Database. Nucleic Acids Res, 2009. 37(Database issue): p. D674-9. 50. Matthews, L., et al., Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res, 2009. 37(Database issue): p. D619-22. 51. Enerly, E., et al., miRNA-mRNA integrated analysis reveals roles for miRNAs in primary breast tumors. PLoS One, 2011. 6(2): p. e16915. 52. Gutierrez, N.C., et al., Deregulation of microRNA expression in the different genetic subtypes of multiple myeloma and correlation with gene expression profiling. Leukemia, 2010. 24(3): p. 629-37. 53. Edgar, R., M. Domrachev, and A.E. Lash, Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res, 2002. 30(1): p. 207-10. 54. Jundt, F., et al., Jagged1-induced Notch signaling drives proliferation of multiple myeloma cells. Blood, 2004. 103(9): p. 3511-5. 55. Bommert, K., R.C. Bargou, and T. Stuhmer, Signalling and survival pathways in multiple myeloma. Eur J Cancer, 2006. 42(11): p. 1574-80. 56. Schwarzer, R., et al., Notch inhibition blocks multiple myeloma cell-induced osteoclast activation. Leukemia, 2008. 22(12): p. 2273-7. 57. Harrison, C.N., Platelets and thrombosis in myeloproliferative diseases. Hematology Am Soc Hematol Educ Program, 2005: p. 409-15. 58. Wagner, D.D. and P.C. Burger, Platelets in inflammation and thrombosis. Arterioscler Thromb Vasc Biol, 2003. 23(12): p. 2131-7. 59. Kristinsson, S.Y., Thrombosis in multiple myeloma. Hematology Am Soc Hematol Educ Program, 2010. 2010: p. 437-44. 60. Zangari, M., et al., Thrombosis in multiple myeloma. Expert Rev Anticancer Ther, 2007. 7(3): p. 307-15. 61. Roy, A.M., M.S. Baliga, and S.K. Katiyar, Epigallocatechin-3-gallate induces apoptosis in estrogen receptor-negative human breast carcinoma cells via modulation in protein expression of p53 and Bax and caspase-3 activation. Mol Cancer Ther, 2005. 4(1): p. 81-90. 62. Ciucci, A., et al., Induction of apoptosis in estrogen receptor-negative breast cancer cells by natural and synthetic cyclopentenones: role of the IkappaB kinase/nuclear factor-kappaB pathway. Mol Pharmacol, 2006. 70(5): p. 1812-21. 63. Lu, T.P., et al., Identification of a novel biomarker, SEMA5A, for non-small cell lung carcinoma in nonsmoking women. Cancer Epidemiol Biomarkers Prev, 2010. 19(10): p. 2590-7. 64. Capparuccia, L. and L. Tamagnone, Semaphorin signaling in cancer cells and in cells of the tumor microenvironment--two sides of a coin. J Cell Sci, 2009. 122(Pt 11): p. 1723-36. 65. Neufeld, G., et al., Semaphorins in cancer. Front Biosci, 2005. 10: p. 751-60. 66. Yang, J.M., et al., Consensus scoring criteria for improving enrichment in virtual screening. J Chem Inf Model, 2005. 45(4): p. 1134-46. 67. Hsu, D.F., Y.-S. Chung, and K.B. S, Combinatorial Fusion Analysis: Methods and Practice of Combining Multiple Scoring Systems. 2006, Advanced Data Mining Technologies in Bioinformatics, IdealGroup Inc. p. pp32-62. 68. Hsu, D.F., K.B. S, and C. Schweikert, Rank-Score Characteristics (RSC) Function and Cognitive Diversity. 2010, Brain Informatics. p. 42-54. 69. Reinhart, B.J., et al., MicroRNAs in plants. Genes Dev, 2002. 16(13): p. 1616-26. 70. Dai, X. and P.X. Zhao, psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res, 2011. 39(Web Server issue): p. W155-9. 71. Lamb, J., et al., The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science, 2006. 313(5795): p. 1929-35. 72. Git, A., et al., Systematic comparison of microarray profiling, real-time PCR, and next-generation sequencing technologies for measuring differential microRNA expression. RNA, 2010. 16(5): p. 991-1006. 73. Roh, S.W., et al., Comparing microarrays and next-generation sequencing technologies for microbial ecology research. Trends Biotechnol, 2010. 28(6): p. 291-9. 74. Sirbu, A., et al., RNA-Seq vs dual- and single-channel microarray data: sensitivity analysis for differential expression and clustering. PLoS One, 2012. 7(12): p. e50986. 75. Li, R., et al., The sequence and de novo assembly of the giant panda genome. Nature, 2010. 463(7279): p. 311-7. 76. Vallender, E.J., Expanding whole exome resequencing into non-human primates. Genome Biol, 2011. 12(9): p. R87. 77. Voelkerding, K.V., S. Dames, and J.D. Durtschi, Next generation sequencing for clinical diagnostics-principles and application to targeted resequencing for hypertrophic cardiomyopathy: a paper from the 2009 William Beaumont Hospital Symposium on Molecular Pathology. J Mol Diagn, 2010. 12(5): p. 539-51. 78. Friedlander, M.R., et al., Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotechnol, 2008. 26(4): p. 407-15. 79. Xu, G., et al., Characterization of the small RNA transcriptomes of androgen dependent and independent prostate cancer cell line by deep sequencing. PLoS One, 2010. 5(11): p. e15519. 80. Zywicki, M., K. Bakowska-Zywicka, and N. Polacek, Revealing stable processing products from ribosome-associated small RNAs by deep-sequencing data analysis. Nucleic Acids Res, 2012. 40(9): p. 4013-24. 81. Zhang, J., et al., The impact of next-generation sequencing on genomics. J Genet Genomics, 2011. 38(3): p. 95-109. 82. Li, H., J. Ruan, and R. Durbin, Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res, 2008. 18(11): p. 1851-8. 83. Homer, N., B. Merriman, and S.F. Nelson, BFAST: an alignment tool for large scale genome resequencing. PLoS One, 2009. 4(11): p. e7767. 84. Novocraft. Novoalign. Available from: http://www.novocraft.com. 85. Li, H. and R. Durbin, Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics, 2010. 26(5): p. 589-95. 86. Burrows, M. and D.J. Wheeler, A block-sorting lossless data compression algorithm. HP Labs Technical Reports, 1994: p. SRC-RR-124. 87. Liu, C.M., et al., SOAP3: ultra-fast GPU-based parallel alignment tool for short reads. Bioinformatics, 2012. 28(6): p. 878-9. 88. Treangen, T.J. and S.L. Salzberg, Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet, 2012. 13(1): p. 36-46. 89. Miller, J.R., S. Koren, and G. Sutton, Assembly algorithms for next-generation sequencing data. Genomics, 2010. 95(6): p. 315-27. 90. Jeck, W.R., et al., Extending assembly of short DNA sequences to handle error. Bioinformatics, 2007. 23(21): p. 2942-4. 91. Margulies, M., et al., Genome sequencing in microfabricated high-density picolitre reactors. Nature, 2005. 437(7057): p. 376-80. 92. Zerbino, D.R. and E. Birney, Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res, 2008. 18(5): p. 821-9. 93. DePristo, M.A., et al., A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet, 2011. 43(5): p. 491-8. 94. Li, H., et al., The Sequence Alignment/Map format and SAMtools. Bioinformatics, 2009. 25(16): p. 2078-9. 95. Koboldt, D.C., et al., VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics, 2009. 25(17): p. 2283-5. 96. Koboldt, D.C., et al., VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res, 2012. 22(3): p. 568-76. 97. Larson, D.E., et al., SomaticSniper: identification of somatic point mutations in whole genome sequencing data. Bioinformatics, 2012. 28(3): p. 311-7. 98. Roth, A., et al., JointSNVMix: a probabilistic model for accurate detection of somatic mutations in normal/tumour paired next-generation sequencing data. Bioinformatics, 2012. 28(7): p. 907-13. 99. Feuk, L., A.R. Carson, and S.W. Scherer, Structural variation in the human genome. Nat Rev Genet, 2006. 7(2): p. 85-97. 100. Chen, K., et al., BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods, 2009. 6(9): p. 677-81. 101. Hormozdiari, F., et al., Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery. Bioinformatics, 2010. 26(12): p. i350-7. 102. Zeitouni, B., et al., SVDetect: a tool to identify genomic structural variations from paired-end and mate-pair sequencing data. Bioinformatics, 2010. 26(15): p. 1895-6. 103. Korbel, J.O., et al., PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Genome Biol, 2009. 10(2): p. R23. 104. Grabherr, M.G., et al., Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol, 2011. 29(7): p. 644-52. 105. Robertson, G., et al., De novo assembly and analysis of RNA-seq data. Nat Methods, 2010. 7(11): p. 909-12. 106. Schulz, M.H., et al., Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels. Bioinformatics, 2012. 28(8): p. 1086-92. 107. Langmead, B., et al., Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol, 2009. 10(3): p. R25. 108. Langmead, B. and S.L. Salzberg, Fast gapped-read alignment with Bowtie 2. Nat Methods, 2012. 9(4): p. 357-9. 109. Trapnell, C., L. Pachter, and S.L. Salzberg, TopHat: discovering splice junctions with RNA-Seq. Bioinformatics, 2009. 25(9): p. 1105-11. 110. Thorvaldsdottir, H., J.T. Robinson, and J.P. Mesirov, Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform, 2012. 111. Robinson, J.T., et al., Integrative genomics viewer. Nat Biotechnol, 2011. 29(1): p. 24-6. 112. Anders, S., P.T. Pyl, and W. Huber, HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics, 2015. 31(2): p. 166-9. 113. Bullard, J.H., et al., Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics, 2010. 11: p. 94. 114. Marioni, J.C., et al., RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res, 2008. 18(9): p. 1509-17. 115. Anders, S. and W. Huber, Differential expression analysis for sequence count data. Genome Biol, 2010. 11(10): p. R106. 116. Hardcastle, T.J. and K.A. Kelly, baySeq: empirical Bayesian methods for identifying differential expression in sequence count data. BMC Bioinformatics, 2010. 11: p. 422. 117. Robinson, M.D., D.J. McCarthy, and G.K. Smyth, edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 2010. 26(1): p. 139-40. 118. Roberts, A., et al., Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol, 2011. 12(3): p. R22. 119. Trapnell, C., et al., Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol, 2010. 28(5): p. 511-5. 120. Roberts, A., et al., Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics, 2011. 27(17): p. 2325-9. 121. Trapnell, C., et al., Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat Biotechnol, 2012. 31(1): p. 46-53. 122. Miller, A.C., et al., RNA-seq-based mapping and candidate identification of mutations from forward genetic screens. Genome Res, 2013. 123. Canovas, A., et al., SNP discovery in the bovine milk transcriptome using RNA-Seq technology. Mamm Genome, 2010. 21(11-12): p. 592-8. 124. Chepelev, I., et al., Detection of single nucleotide variations in expressed exons of the human genome using RNA-Seq. Nucleic Acids Res, 2009. 37(16): p. e106. 125. Hill, J.T., et al., MMAPPR: Mutation Mapping Analysis Pipeline for Pooled RNA-seq. Genome Res, 2013. 126. Seila, A.C., et al., Divergent transcription from active promoters. Science, 2008. 322(5909): p. 1849-51. 127. Affymetrix, E.T.P. and E.T.P. Cold Spring Harbor Laboratory, Post-transcriptional processing generates a diversity of 5'-modified long and short RNAs. Nature, 2009. 457(7232): p. 1028-32. 128. Taft, R.J., et al., Tiny RNAs associated with transcription start sites in animals. Nat Genet, 2009. 41(5): p. 572-8. 129. Guttman, M., et al., lincRNAs act in the circuitry controlling pluripotency and differentiation. Nature, 2011. 477(7364): p. 295-300. 130. Ng, J.H. and H.H. Ng, LincRNAs join the pluripotency alliance. Nat Genet, 2010. 42(12): p. 1035-6. 131. Jacquier, A., The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs. Nat Rev Genet, 2009. 10(12): p. 833-44. 132. McGettigan, P.A., Transcriptomics in the RNA-seq era. Curr Opin Chem Biol, 2013. 133. Lab, H. FASTX-Toolkit. Available from: http://hannonlab.cshl.edu/fastx_toolkit/. 134. Buffalo, V. scythe: A 3'-end Aadapter Contaminant Trimmer. Available from: https://github.com/vsbuffalo/scythe. 135. Martin, M., Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 2011. 17(1). 136. Patel, R., K. and M. Jain, NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data. PLoS One, 2012. 7(2): p. e30619. 137. Bioinformatics, B. FastQC: A quality control tool for high throughput sequence data.; Available from: http://www.bioinformatics.babraham.ac.uk/. 138. Buffalo, V., qrqc: Quick Read Quality Control. 2012. 139. Burge, S.W., et al., Rfam 11.0: 10 years of RNA families. Nucleic Acids Res, 2013. 41(Database issue): p. D226-32. 140. Kozomara, A. and S. Griffiths-Jones, miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res, 2011. 39(Database issue): p. D152-7. 141. Hendrix, D., M. Levine, and W. Shi, miRTRAP, a computational method for the systematic identification of miRNAs from high throughput sequencing data. Genome Biol, 2010. 11(4): p. R39. 142. Mathelier, A. and A. Carbone, MIReNA: finding microRNAs with high accuracy and no learning at genome scale and from deep sequencing data. Bioinformatics, 2010. 26(18): p. 2226--2234. 143. Guan, D.-G., et al., mirExplorer: detecting microRNAs from genome and next generation sequencing data using the AdaBoost method with transition probability matrix and combined features. RNA Biol, 2011. 8(5): p. 922--934. 144. Hackenberg, M., N. Rodriguez-Ezpeleta, and A.M. Aransay, miRanalyzer: an update on the detection and analysis of microRNAs in high-throughput sequencing experiments. Nucleic Acids Res, 2011. 39(Web Server issue): p. W132-8. 145. Friedlander, M.R., et al., Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotech, 2008. 26(4): p. 407--415. 146. Friedlander, M.R., et al., miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res, 2012. 40(1): p. 37--52. 147. Huang, P.-J., et al., DSAP: deep-sequencing small RNA analysis pipeline. Nucleic Acids Res, 2010. 38(Web Server issue): p. W385-91. 148. Yang, X. and L. Li, miRDeep-P: a computational tool for analyzing the microRNA transcriptome in plants. Bioinformatics, 2011. 27(18): p. 2614--2615. 149. Barzon, L., et al., Applications of Next-Generation Sequencing Technologies to Diagnostic Virology. International Journal of Molecular Sciences, 2011. 12(11): p. 7861--7884. 150. Vodovar, N., et al., In silico reconstruction of viral genomes from small RNAs improves virus-derived small interfering RNA profiling. J Virol, 2011. 85(21): p. 11016--11021. 151. Li, Y., et al., Performance comparison and evaluation of software tools for microRNA deep-sequencing data analysis. Nucleic Acids Res, 2012. 40(10): p. 4298--4305. 152. Williamson, V., et al., Detecting miRNAs in deep-sequencing data: a software performance comparison and evaluation. Brief Bioinform, 2013. 14(1): p. 36--45. 153. Calin, G.A. and C.M. Croce, MicroRNA signatures in human cancers. Nat Rev Cancer, 2006. 6(11): p. 857--866. 154. Pritchard, C.C., H.H. Cheng, and M. Tewari, MicroRNA profiling: approaches and considerations. Nat Rev Genet, 2012. 13(5): p. 358--369. 155. Eipper-Mains, J.E., B.A. Eipper, and R.E. Mains, Global approaches to the role of miRNAs in drug-induced changes in gene expression. Frontiers in Genetics, 2012. 3(109). 156. Git, A., et al., Systematic comparison of microarray profiling, real-time PCR, and next-generation sequencing technologies for measuring differential microRNA expression. RNA, 2010. 16(5): p. 991--1006. 157. Linsen, S.E.V., et al., Limitations and possibilities of small RNA digital gene expression profiling. Nat Meth, 2009. 6(7): p. 474--476. 158. Tian, G., et al., Sequencing bias: comparison of different protocols of microRNA library construction. BMC Biotechnol, 2010. 10: p. 64. 159. Jayaprakash, A.D., et al., Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing. Nucleic Acids Res, 2011. 39(21): p. e141. 160. Illumina. Flicker 3.0. Available from: http://support.illumina.com/sequencing/sequencing_software/flicker_30_small_rna_analysis.ilmn. 161. Goodwin, E.C. and D. DiMaio, Repression of human papillomavirus oncogenes in HeLa cervical carcinoma cells causes the orderly reactivation of dormant tumor suppressor pathways. Proc Natl Acad Sci U S A, 2000. 97(23): p. 12513-8. 162. Harima, Y., et al., Prediction of outcome of advanced cervical cancer to thermoradiotherapy according to expression profiles of 35 genes selected by cDNA microarray analysis. Int J Radiat Oncol Biol Phys, 2004. 60(1): p. 237-48. 163. Ross, D.T., et al., Systematic variation in gene expression patterns in human cancer cell lines. Nat Genet, 2000. 24(3): p. 227-35. 164. Forbes, S.A., et al., COSMIC: exploring the world's knowledge of somatic mutations in human cancer. Nucleic Acids Res, 2015. 43(Database issue): p. D805-11. 165. Barretina, J., et al., The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 2012. 483(7391): p. 603-7. 166. Shoemaker, R.H., The NCI60 human tumour cell line anticancer drug screen. Nat Rev Cancer, 2006. 6(10): p. 813-23. 167. Roth, R.B., et al., Gene expression analyses reveal molecular relationships among 20 regions of the human CNS. Neurogenetics, 2006. 7(2): p. 67-80. 168. Sandberg, R. and I. Ernberg, Assessment of tumor characteristic gene expression in cell lines using a tissue similarity index (TSI). Proc Natl Acad Sci U S A, 2005. 102(6): p. 2052-7. 169. Sharma, S.V., D.A. Haber, and J. Settleman, Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nature reviews cancer, 2010. 10(4): p. 241-253. 170. Hanahan, D. and R.A. Weinberg, Hallmarks of cancer: the next generation. cell, 2011. 144(5): p. 646-674. 171. Rajski, M., et al., Global gene expression analysis of the interaction between cancer cells and osteoblasts to predict bone metastasis in breast cancer. PloS one, 2012. 7(1): p. e29743. 172. Goodspeed, A., et al., Tumor-derived cell lines as molecular models of cancer pharmacogenomics. Molecular Cancer Research, 2016. 14(1): p. 3-13. 173. Brazma, A., et al., ArrayExpress--a public repository for microarray gene expression data at the EBI. Nucleic Acids Res, 2003. 31(1): p. 68-71. 174. Koleti, A., et al., Data Portal for the Library of Integrated Network-based Cellular Signatures (LINCS) program: integrated access to diverse large-scale cellular perturbation response data. Nucleic Acids Res, 2018. 46(D1): p. D558-D566. 175. Artimo, P., et al., ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res, 2012. 40(Web Server issue): p. W597-603. 176. Barber, R.D., et al., GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues. Physiological Genomics, 2005. 21(3): p. 389-395. 177. de Kok, J.B., et al., Normalization of gene expression measurements in tumor tissues: comparison of 13 endogenous control genes. Laboratory Investigation, 2005. 85(1): p. 154-159. 178. Müllner, D., Modern hierarchical, agglomerative clustering algorithms. arXiv preprint arXiv:1109.2378, 2011. 179. Tipping, M.E. and C.M. Bishop, Mixtures of probabilistic principal component analyzers. Neural Computation, 1999. 11(2): p. 443-482. 180. Bolstad, B.M., et al., A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics, 2003. 19(2): p. 185-93. 181. Pelz, C.R., et al., Global rank-invariant set normalization (GRSN) to reduce systematic distortions in microarray data. BMC Bioinformatics, 2008. 9: p. 520. 182. Iwamoto, T., et al., Estrogen receptor (ER) mRNA and ER-related gene expression in breast cancers that are 1% to 10% ER-positive by immunohistochemistry. J Clin Oncol, 2012. 30(7): p. 729-34. 183. Rapoport, A.P., et al., Flavopiridol induces apoptosis and caspase-3 activation of a newly characterized Burkitt's lymphoma cell line containing mutant p53 genes. Blood Cells Mol Dis, 2001. 27(3): p. 610-24. 184. Rae, J.M., et al., MDA-MB-435 cells are derived from M14 melanoma cells--a loss for breast cancer, but a boon for melanoma research. Breast Cancer Res Treat, 2007. 104(1): p. 13-9. 185. Ikediobi, O.N., et al., Mutation analysis of 24 known cancer genes in the NCI-60 cell line set. Mol Cancer Ther, 2006. 5(11): p. 2606-12. 186. Leroy, B., et al., Analysis of TP53 mutation status in human cancer cell lines: a reassessment. Hum Mutat, 2014. 35(6): p. 756-65. 187. Scudiero, D.A., A. Monks, and E.A. Sausville, Cell line designation change: multidrug-resistant cell line in the NCI anticancer screen. J Natl Cancer Inst, 1998. 90(11): p. 862. 188. Parker, J.S., et al., Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol, 2009. 27(8): p. 1160-7. 189. Holst, F., et al., Estrogen receptor alpha (ESR1) gene amplification is frequent in breast cancer. Nature Genetics, 2007. 39(5): p. 655-660. 190. Robinson, D.R., et al., Activating ESR1 mutations in hormone-resistant metastatic breast cancer. Nat Genet, 2013. 45(12): p. 1446-51. 191. van't Veer, L.J., et al., Gene expression profiling predicts clinical outcome of breast cancer. Nature, 2002. 415(6871): p. 530-536. 192. McGrath, E.L., et al., Differential Responses of Human Fetal Brain Neural Stem Cells to Zika Virus Infection. Stem Cell Reports, 2017. 8(3): p. 715-727. 193. Sun, Y. and Q. Liu, Deciphering the Correlation between Breast Tumor Samples and Cell Lines by Integrating Copy Number Changes and Gene Expression Profiles. Biomed Research International, 2015. 194. Rhodes, D.R., et al., ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia, 2004. 6(1): p. 1-6. 195. Bridgman, C.L., Habitat use, distribution and conservation status of the mikado pheasant (Syrmaticus mikado) in Taiwan. Ecology and Evolutionary Biology. Vol. Doctor of Philosophy. 2002: The University of Tennessee. 113. 196. Severinghaus, S.R., A study of the Swinhoe's and Mikado pheasants in Taiwan with recommendations for their conservation. 1977: Cornell University, May. 197. McGowan, P.J. and P.J. Garson, Pheasants: status survey and conservation action plan 1995-1999. Vol. 24. 1995: IUCN. 198. Fuller, R.A., Pheasants: status survey and conservation action plan 2000-2004. Vol. 51. 2000: IUCN. 199. Yu, L., et al., Genomic analysis of snub-nosed monkeys (Rhinopithecus) identifies genes and processes related to high-altitude adaptation. Nat Genet, 2016. 48(8): p. 947-52. 200. Le Duc, D., et al., Kiwi genome provides insights into evolution of a nocturnal lifestyle. Genome Biol, 2015. 16: p. 147. 201. Li, S., et al., Genomic signatures of near-extinction and rebirth of the crested ibis and other endangered bird species. Genome Biol, 2014. 15(12): p. 557. 202. Hung, C.M., et al., Drastic population fluctuations explain the rapid extinction of the passenger pigeon. Proc Natl Acad Sci U S A, 2014. 111(29): p. 10636-41. 203. Qiu, Q., et al., The yak genome and adaptation to life at high altitude. Nat Genet, 2012. 44(8): p. 946-9. 204. Qu, Y., et al., Ground tit genome reveals avian adaptation to living at high altitudes in the Tibetan plateau. Nat Commun, 2013. 4: p. 2071. 205. Zhan, X.J. and Z.W. Zhang, Molecular phylogeny of avian genus Syrmaticus based on the mitochondrial cytochrome B gene and control region. Zoolog Sci, 2005. 22(4): p. 427-35. 206. Lee, P., et al., A wildlife distribution database in Taiwan. Council of Agriculture, Taipei, 1998. 207. Bolger, A.M., M. Lohse, and B. Usadel, Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 2014. 30(15): p. 2114-20. 208. Leggett, R.M., et al., NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries. Bioinformatics, 2014. 30(4): p. 566-8. 209. Zimin, A | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/22122 | - |
dc.description.abstract | 由於定序技術的成本降低,每天都產生了大量的資料,使得基因體的研究進入了一個新的時代。然而如何有效、正確和方便地處理和分析這些高通量的資料仍是一項挑戰,有鑑於此,本論文目標在利用生物資訊學的方法,解決高通量實驗技術在基因體研究上所遇到的問題。具體而言,本研究包含二大方向:發展全面性的線上分析系統,以及應用次世代定序的分析流程(pipeline),並分成四個主題進行探討。
在第一個主題中,為了解決在多個預測演算法中出現不一致的預測結果和微型核糖核酸(miRNA)頻繁改變的命名方式,本論文發展了一個線上工具-miRSystem,它同時具備預測miRNA目標基因和進行生物功能與調控途徑分析的能力。本程式從整合的7個預測演算法和2個經過實驗驗證的資料中,來找出有相同結果的miRNA目標基因,接著利用這些基因並透過2種不同的富集(enrichment)演算法進一步尋找出下游的生物功能和調控途徑。此外,利用附屬在本系統下的程式miRConverter可以消除miRNA在命名上的潛在差異,並可以從給定的序列中搜尋出相似的miRNA。 在第二個主題中,研究旨在探討次世代定序的各種技術及廣泛應用,並從中討論、提出資料處理的分析流程和給予在基因體學、轉錄體學與miRNA研究中合適工具的選擇建議。 在第三個主題中,研究著重在細胞株和臨床樣本的內生性基因表現。「如何選擇一個合適的細胞株作為實驗模型」、「如何在橫跨數個龐大的資料集當中,進行全面性的分析與有效的視覺化其結果」,這些問題往往是各生物實驗室的主要挑戰。有鑑於此,本研究開發了CellExpress線上系統,具有基因表現查詢、相似性評估、基因標誌探索和使用者資料分析等四大功能,將有助於使用者從指定的細胞株或是臨床樣本中,進行有正規化過的基因表現量查詢、比較樣本之間的差異性或相似性、和找到有顯著改變的基因,此外,可針對使用者所上傳的資料,去跟現行系統的資料庫進行比較,並尋找出相似的基因表現圖譜(profiling)。 在第四個主題中,本論文建立了一條全新基因體組裝的分析流程,並且完成了第一個帝雉的全基因體定序。帝雉的基因體草圖包含約10億個鹼基對和15,972個註解基因,並顯示其能量代謝、氧氣運輸、血紅素結合、輻射反應、免疫反應和DNA修復等功能的基因,受到在演化適應上的正向選擇與基因數量擴張的影響。此外,在一條22.7萬個鹼基對的主要組織相容性複合體(MHC)序列的連續區域中,利用手動整理註解出39個可能的基因,比較原雞相同的基因座發現具有高程度的相似性,但也發現了2個基因翻轉點在TAPBP和TAP1-TAP2基因之間。接著對帝雉粒線體DNA進行定序和組裝,並且跟其他4種長尾雉做比較,從分子時鐘的分析推測帝雉的祖先大約是在347萬年前從北方遷徙到台灣。 整體而言,本論文提出了在基因體學上的應用及方法。結果表明所提出的工具不但都可以有效地解決基因體上的問題,而且所提出的分析流程亦成功地呈現了帝雉基因體對適應高海拔的見解。 | zh_TW |
dc.description.abstract | Owing to the reduced cost of sequencing technology, a massive amount of data has been generated every day and made genomic research enter a new era. However, it is a challenge to process and analyze these kinds of high-throughput data effectively, correctly, and conveniently. The dissertation aimed to address the issue of high-throughput experimental techniques in genomic research using bioinformatics approaches. Specifically, development of comprehensive and web-based analysis systems and application of NGS pipelines were the two main approaches which were further divided into four topics of investigations.
In the first topic, miRSystem was developed to overcome the issues of inconsistently predicted results across multiple prediction algorithms and the nomenclature of microRNAs (miRNAs) with a frequent change. MiRSystem was a web-based system to support miRNA target prediction and analysis of biological functions and pathways simultaneously. Seven prediction algorithms and two experimentally validated data sources were integrated into the program to reveal genes consistently targeted by miRNAs. These identified target genes were further investigated downstream biological functions and pathways by two enrichment algorithms. In addition, miRConverter, an accessory of the system, was proposed to remove the potential discrepancies in the nomenclature of miRNAs and to search similar miRNAs from a given sequence. To investigate broad applications of NGS, in the second topic, the author discussed the available implementations of NGS technologies, presented guidelines for data processing pipelines, and made suggestions for selecting suitable tools in genomics, transcriptomics, and small RNA research. In the third topic, the study focused on endogenous gene expression in cell lines and clinical samples. In a biological laboratory, it still poses a major challenge in how to select an appropriate cell line to serve as an experimental model and how to perform comprehensive analysis and efficient visualization results across several large datasets at once. To address these issues, an online system, CellExpress, was proposed with four functions, including gene expression search, similarity assessment, gene signature explorer, and user data analysis. These functions were a great benefit to query normalized gene expression values, to compare the difference or similarity, to identify significantly changed genes from specified cell lines and clinical samples, and to compare gene expression profiling with user uploaded data and existing datasets in the system, respectively. In the fourth topic, the study established an NGS analysis pipeline for de novo genome assembly and completed the first whole-genome sequencing of the Mikado pheasant. The draft genome of the Mikado pheasant, which consisted of 1.04 Gb sequences and 15,972 annotated protein-coding genes, and displayed expansion and positive selection of genes related to features that contributed to its adaptive evolution, such as energy metabolism, oxygen transport, hemoglobin binding, radiation response, immune response, and DNA repair. Furthermore, the major histocompatibility complex (MHC) region which contained 39 putative genes spanning 227 kb on a contiguous region were annotated and manually curated. The MHC loci of the pheasant revealed a high level of synteny and two inversions of TAPBP and TAP1-TAP2 genes compared with the same loci in the chicken. The complete mitochondrial genome was also sequenced, assembled, and compared against four other long-tailed pheasants. The result from molecular clock analysis suggested that ancestors of the Mikado pheasant might migrate from the north to Taiwan about 3.47 million years ago. In conclusion, this dissertation presented a comprehensive approach for genomic research and applications. The results demonstrated that the proposed tools all effectively addressed genomic issues as well as proposed pipelines successfully revealed insights into the adaptation to high altitude from the Mikado pheasant genome. | en |
dc.description.provenance | Made available in DSpace on 2021-06-08T04:03:41Z (GMT). No. of bitstreams: 1 ntu-107-D00945006-1.pdf: 18346849 bytes, checksum: bfad765a8756e3c25c468723f8491b99 (MD5) Previous issue date: 2018 | en |
dc.description.tableofcontents | 誌謝 i
中文摘要 ii Abstract iv Table of Contents vii List of Figures x List of Tables xii Chapter 1. Introduction 1 1.1. High-throughput experimental techniques 1 1.1.1. Microarray technology 1 1.1.2. Next-generation sequencing 2 1.2. Applications of bioinformatics approaches to two web-based systems 4 1.3. The analysis pipeline of NGS 5 1.4. De novo genome assembly 7 1.5. Figure 11 Chapter 2. miRSystem: An integrated system for characterizing enriched functions and pathways of microRNA targets 12 2.1. Abstract 12 2.2. Introduction 13 2.3. Materials and methods 16 2.3.1. Overview of miRSystem workflow 16 2.3.2. Database construction and contents 17 2.3.3. Database characteristics in predicting miRNA-gene relationships 18 2.3.4. Simulation of the null baselines for identifying biological functions that target genes are enriched 19 2.3.5. Weighted pathway-ranking method for identifying enriched biological functions 19 2.4. Results 20 2.4.1. Website interface 20 2.4.2. Example 1: Conversion of miRNA IDs across different annotation versions 21 2.4.3. Example 2: Prediction of functions that grouped miRNAs are enriched 21 2.4.4. Example 3: Demonstration of analyzing datasets in public domain 23 2.5. Discussion 25 2.6. Figures 28 2.7. Tables 37 Chapter 3. Common applications of next-generation sequencing technologies in genomic research 45 3.1. Abstract 45 3.2. Introduction 45 3.3. DNA sequencing data analysis 47 3.3.1. Preprocessing procedures 47 3.3.2. Read alignment 48 3.3.3. De novo assembly 49 3.3.4. Single nucleotide variant (SNV) detection 50 3.3.5. Structural variation detection 50 3.4. RNA sequencing data analysis 51 3.4.1. De novo transcriptome assembly 51 3.4.2. Expression profiling analysis 52 3.4.3. Variant calling and transcriptomic epigenetics 54 3.5. Small RNA sequencing 55 3.5.1. General workflow 56 3.5.2. Small RNA prediction 57 3.5.3. MiRNA characterization 58 3.6. Future perspectives 59 3.7. Figures 60 3.8. Tables 62 Chapter 4. CellExpress: A comprehensive microarray-based cancer cell line and clinical sample gene expression analysis online system 67 4.1. Abstract 67 4.2. Introduction 68 4.3. Materials and methods 71 4.3.1. System implementation 71 4.3.2. Gene expression search 72 4.3.3. Statistical analysis of gene signature in the CellExpress system 73 4.3.4. Similarity assessment of gene expression and user data analysis 73 4.4. Results 75 4.4.1. Web interface 75 4.4.2. Overview of the major functions 76 4.4.3. Example 1 – Comparison of ESR1 expression between ER and ER cell lines 77 4.4.4. Example 2 – Similarity assessment of gene expression profiles in different cell lines 78 4.4.5. Example 3 – The gene signature associated with estrogen receptor expression in breast cancer 79 4.4.6. Example 4 – The classification of stem cells 81 4.5. Discussion 82 4.6. Figures 85 4.7. Tables 93 Chapter 5. Whole-genome de novo sequencing reveals unique genes that contributed to the adaptive evolution of the Mikado pheasant 100 5.1. Abstract 100 5.2. Introduction 101 5.3. Materials and methods 104 5.3.1. Sample preparation and sequencing 104 5.3.2. De novo genome assembly 104 5.3.3. Evaluation of assembly quality 106 5.3.4. Genome comparison 107 5.3.5. Gene prediction and annotation 108 5.3.6. Gene families 109 5.3.7. Examination of genes under positive selection and enrichment analysis 111 5.3.8. Mitochondrial genome assembly 113 5.4. Results 114 5.4.1. Genome assembly and annotation 114 5.4.2. Assessment of the assembly quality 115 5.4.3. Genome comparison 116 5.4.4. Phylogenetic relationships of the Mikado pheasant 118 5.4.5. Gene family evolution 119 5.4.6. Positive selection 120 5.4.7. Identification of the MHC-B region of the Mikado pheasant 122 5.4.8. Evolutionary history of Syrmaticus pheasants 124 5.4.9. Amino acid substitution analysis in Mikado pheasant hemoglobin genes 125 5.5. Discussion 125 5.6. Figures 133 5.7. Tables 151 Chapter 6. Perspectives 168 6.1. Future directions 168 6.2. Conclusion 172 References 173 Appendix 192 | |
dc.language.iso | en | |
dc.title | 建立基因體研究之多元分析系統及其應用 | zh_TW |
dc.title | Development of Comprehensive Analysis Systems for Genomic Research and Applications | en |
dc.type | Thesis | |
dc.date.schoolyear | 106-2 | |
dc.description.degree | 博士 | |
dc.contributor.oralexamcommittee | 王弘毅(Hurng-Yi Wang),蔡懷寬(Huai-Kuang Tsai),陳沛隆(Pei-Lung Chen),曾新穆(Vincent S. Tseng),洪志銘(Chih-Ming Hung) | |
dc.subject.keyword | 基因體研究,微陣列晶片,次世代定序,線上系統,次世代定序分析流程,微型核糖核酸,帝雉, | zh_TW |
dc.subject.keyword | Genomic research,Microarray,Next-generation sequencing,Online system,NGS pipeline,miRNA,Mikado pheasant, | en |
dc.relation.page | 224 | |
dc.identifier.doi | 10.6342/NTU201802367 | |
dc.rights.note | 未授權 | |
dc.date.accepted | 2018-08-02 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 生醫電子與資訊學研究所 | zh_TW |
顯示於系所單位: | 生醫電子與資訊學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-107-1.pdf 目前未授權公開取用 | 17.92 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。