Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 管理學院
  3. 資訊管理學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/34202
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor李瑞庭(Anthony J. T. Lee)
dc.contributor.authorChieh-Chun Chenen
dc.contributor.author陳玠均zh_TW
dc.date.accessioned2021-06-13T05:57:59Z-
dc.date.available2007-07-03
dc.date.copyright2006-07-03
dc.date.issued2006
dc.date.submitted2006-06-27
dc.identifier.citation[1] Rakesh Agrawal, Ramakrishnan Srikant, Mining sequential patterns, In Proceedings of the IEEE 7th International Conference on Data Engineering, 1995, pp. 3-14.
[2] Orly Alter, Patrick O. Brown and David Botstein, Singular value decomposition for genome-wide expression data processing and modeling, In Proceedings of the National Academy of Sciences, vol. 96, 2000, pp. 10101-10106.
[3] Dhammika Amaratunga and Javier Cabrera, Exploration and analysis of DNA microarray and protein array data, Wiley series in probability and statistics, New Jersey, USA, 2004.
[4] Yoseph Barash and Nir Friedman, Context-specific Bayesian clustering for gene expression data, In Proceedings of the Fifth Annual International Conference on Research in Computational Molecular Biology, Montreal, Canada, ACM Press, New York, 2001, pp. 12-21.
[5] Bryan Bergeron, Microarrays, Bioinformatics Computing, Prentice Hall, New Jersey, USA, 2003, pp. 222-257.
[6] Ting Chen, Vladimir Filkov, Steven S. Skiena, Identifying gene regulatory networks from experimental data, In Proceedings of the Third Annual International Conference on Research in Computational Molecular Biology, Lyon, France, ACM Press, 1999, pp. 94-103.
[7] Yen-Liang Chen, Shih-Sheng Chen and Ping-Yu Hsu, Mining hybrid sequential patterns and sequential rules, Information Systems, vol. 27, 2002, pp. 345-362.
[8] Yizong Cheng and George M. Church, Biclustering of expression data, In Proceedings of the 8th International Conference on Interlligent Systems for Molecular Biology, AAAI Press, 2000, pp. 93-103.
[9] Gao Cong, Anthony K. H. Tung and Jiong Yang, FARMER: finding interesting rule groups in microarray datasets, In Proceedings of International Conference on Special Interest Group on Management of Data, 2004, pp. 143-154.
[10] Gao Cong, Kian-Lee Tan, Anthony K. H. Tung and Xin Xu, Mining top-k covering rule groups for gene expression data, In Proceedings of the 2005 International Conference on Special Interest Group on Management of Data, 2005, pp. 670-681.
[11] Chad Creighton and Samir Hanash, Mining gene expression database for association rules, Bioinformatics, Vol. 19, 2003, pp. 79-86.
[12] Vladmir Filkov, Steven Skiena, Jizu Zhi, Analysis techniques for microarray time-series data, In Proceedings of the Fifth Annual International Conference on Computational Biology, 2001, pp. 124-131.
[13] Liping Ji and Kian-Lee Tan, Mining gene expression data for positive and negative co-regulaed gene clusters, Bioinformatics, vol.20, 2004, pp. 2711-2718 .
[14] Liping Ji and Kian-Lee Tan, Identifying time-lagged gene clusters using gene expression data, Bioinformatics, vol. 21, 2005, pp. 509-516.
[15] Andrew T. Kwon, Holger H. Hoos, and Raymond Ng, Inference of transcriptional regulation relationships from gene expression data, Bioinformatics, vol. 19, 2003, pp. 905-912.
[16] Arthur M. Lest, Introduction to Bioinformatics, Oxford University Press, New York USA, 2002, pp. 68-69.
[17] Alan Wee-Chung Liew, Hong Yen, Mengsu Yang, Pattern recognition techniques for the emerging field of bioinformatics: A review, Pattern Recognition, vol.38, 2005, pp. 2055-2073.
[18] Mamoru Kato, Tstsuhiko Tsunoda and Toshihisa Takagi, Lag analysis of genetic networks in the cell cycle of budding yeast, Genome Informatics, vol.12, 2001, pp. 266-267.
[19] Saul B. Needleman and Christian D. Wunsch, A general method applicable to the search for similarities in the amino acid sequence of two proteins, Journal of Molecular Biology, vol. 48, 1970, pp. 443-453.
[20] De Pan, Fei Wang, Jiankui Guo and Jianhua Ding, A new method to mine gene regulation relationship information, In Proceedings of the Second International Conference on Fuzzy Systems and Knowledge Discovery, 2005, pp. 1051-1060.
[21] Karl Pearson, Mathematical contribution to the theory of evolution, VII: on the correlation of characters not quantitatively measurable, Philosophical Transactions of the Royal Society of London, London UK, 1900
[22] Gregory Piatetsky-Shapiro and Pablo Tamayo, Microarray data mining: facing the challenges, SIGKDD Explorations, vol. 5, no. 2, 2002, pp. 1-5.
[23] Alexander Schliep, Alexander Schonhuth and Christine Steinhoff, Using hidden markov models to analyze gene expression time course data, Bioinformatics, vol. 19, 2003, pp. 264-272.
[24] Pablo Tamayo, Donna Slonim, Jill Mesirov, Qing Zhu, Sutisak Kitareewan, Ethan Dmitrovsky, Eric S. Lander and Todd R. Golub, Interpreting patterns of gene expression with self organizing maps: Methods and applications to hematopoietic differentiation, In Proceedings of the National Academy of Sciences, vol.96, 1999, pp. 2907-2912.
[25] Hannu Toivonen, Sampling large databases for association rules, In Proceedings of 22th International Conference on Very Large Data Bases, 1996, pp. 134-145.
[26] Jing Yu, V. Anne Smith, Paul P. Wang, Alexander J. Hartemink and Erich D. Jarvis, Advances to Bayesian network inference for generating causal networks from observational biological data, Bioinformatics, vol. 20, 2004, pp. 3594-3603.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/34202-
dc.description.abstract基因表現之時間序列資料分析能夠找出基因與基因群集之間的調節關係。然而,現有的分析方法如event method和q-cluster method都有其限制。Event method只能找出基因對之間的調節關係,而無法提供調節關係詳細的資訊;q-cluster method則受限於調節樣式長度限制。因此,我們提出一個有效率的資料探勘方法,以找出所有重要的調控樣式而且不受到樣式長度的限制。利用所找出的樣式資訊,我們可以進一步分析基因與基因群集之間的調節關係。
首先,我們將生物晶片資料矩陣轉換為基因改變傾向矩陣。然後,我們將在某一連續時間內擁有相同調控樣式的基因分群,並記錄其詳細資訊,包括基因編號以及發生時間。藉由這些詳細資訊,利用逐層式的組合,我們能夠進一步擴展調控樣式的長度,找到所有重要的調控樣式。最後,我們分析這些基因群集的特性並找出彼此之間的調節關係。
為了評估所提出方法,我們進行兩個實驗。首先,我們利用模擬的資料來評估所提出方法的效率及擴充性。接著,我們利用基因本體論(Gene Ontology)以及439個已被生物學家證實的調控基因對來評估所提出方法的效能。結果顯示,我們所提出的方法不僅具有效率及擴充性,並且可以有效的找出基因群集之間的調節關係。
zh_TW
dc.description.abstractAnalyzing time series gene expression data provides a great opportunity to discover regulation relationships among genes and gene clusters. However, existing methods of mining gene regulation relationships, such as event method and q-cluster method, have their own limitations. The event method can only identify the relationships between gene pairs without the detailed time-lagged information and the q-cluster method limited by its pattern length can only find localized patterns. Therefore, in this thesis, we propose an approach that can efficiently mine all frequent regulation patterns without the limitation of pattern length and discover the regulation relationships among gene clusters with the detailed time-lagged information.
We first transform the raw data into a tendency matrix. Next, we group together genes sharing the same expression tendency over certain consecutive time points, and obtain their patterns and detailed information. Then, we extend the patterns obtained into longer patterns by a level-wise combination approach. Finally, we can analyze the characteristics of gene clusters and infer the regulation relationships among them. The experimental result demonstrates that our proposed method is efficient and scalable. Moreover, we use Gene Ontology and 439 regulation relationships proved by biologists to evaluate the effectiveness of our proposed method. The experimental result shows that our proposed method can reliably find those regulation relationships among gene clusters.
en
dc.description.provenanceMade available in DSpace on 2021-06-13T05:57:59Z (GMT). No. of bitstreams: 1
ntu-95-R93725007-1.pdf: 674112 bytes, checksum: ed35a3bf48461d34e373dcd464b2d229 (MD5)
Previous issue date: 2006
en
dc.description.tableofcontentsTABLE OF CONTENTS i
LIST OF FIGURES ii
LIST OF TABLES iii
CHAPTER 1 INTRODUCTION 1
CHAPTER 2 PRELIMINARIES AND PROBLEM DEFINITION 4
2.1 MICROARRAY DATASET 4
2.2 PROPERTIES OF CO-REGULATED GENES 4
2.3 PROBLEM DEFINITION 6
CHAPTER 3 OUR PROPOSED APPROACH 10
3.1 TRANSFORMING GENE EXPRESSION DATA INTO SYMBOLIC LEVELS 10
3.2 GENERATION OF SEED-PATTERN TABLE 12
3.3 EXTENDING FREQUENT PATTERNS BY COMBINATION 15
3.3.1 The combination algorithm 15
3.3.2 An example for combination 18
3.4 GENERATE CO-REGULATED RELATIONSHIPS 21
CHAPTER 4 PERFORMANCE ANALYSIS 25
4.1 PERFORMANCE EVALUATION 25
4.2 REAL DATA EXPERIMENTS 31
4.2.1 Data Sets 31
4.2.2 Regulation Relationships 32
CHAPTER 5 CONCLUDING REMARKS 44
REFERENCES 45
dc.language.isoen
dc.subject基因群集調節關係zh_TW
dc.subject資料探勘zh_TW
dc.subject基因群集zh_TW
dc.subject時間性基因表現資料分析zh_TW
dc.subjectTime-series gene expression dataen
dc.subjectdata miningen
dc.subjectgene regulation relationshipsen
dc.subjectgene clusteringen
dc.title由基因表現之時間序列資料探勘基因群集之調節關係zh_TW
dc.titleMining Regulation Relationships between Gene Clusters by Using Time-Series Gene Expression Dataen
dc.typeThesis
dc.date.schoolyear94-2
dc.description.degree碩士
dc.contributor.oralexamcommittee陳彥良(Yen-Liang Chen),劉敦仁(Duen-Ren Liu)
dc.subject.keyword時間性基因表現資料分析,基因群集,基因群集調節關係,資料探勘,zh_TW
dc.subject.keywordTime-series gene expression data,gene clustering,gene regulation relationships,data mining,en
dc.relation.page47
dc.rights.note有償授權
dc.date.accepted2006-06-28
dc.contributor.author-college管理學院zh_TW
dc.contributor.author-dept資訊管理學研究所zh_TW
Appears in Collections:資訊管理學系

Files in This Item:
File SizeFormat 
ntu-95-1.pdf
  Restricted Access
658.31 kBAdobe PDF
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved