TRIPS 樹探勘演算法的改進

Chien-Sheng Sher; 佘健生

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9566

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	顏嗣鈞
dc.contributor.author	Chien-Sheng Sher	en
dc.contributor.author	佘健生	zh_TW
dc.date.accessioned	2021-05-20T20:29:00Z	-
dc.date.available	2011-08-04
dc.date.available	2021-05-20T20:29:00Z	-
dc.date.copyright	2008-08-04
dc.date.issued	2008
dc.date.submitted	2008-08-01
dc.identifier.citation	[1] M.J. Zaki and C.C. Aggarwal. XRules: an effective structural classifier for XML data. Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 316-325, 2003. [2] M.J. Zaki. E_ciently mining frequent trees in a forest. Proceedings of the eighth ACM SIGKDD conference on Knowledge discovery and data mining, 2002. [3] T. Asai, H. Arimura, T. Uno, S. Nakano (2003): Discovering Frequent Substructures in Large Unordered Trees. Proceedings of the International Conference on Discovery Science, Japan. [4] S. Nijssen, JN Kok (2003): Efficient discovery of frequent unordered trees. Proceedings of the International Workshop on Mining Graphs, Trees, and Sequences, Croatia. [5] S. Tatikonda, S. Parthasarathy, and T. Kurc. Trips and tides:New algorithms for tree mining. Technical Report ftp://ftp.cse.ohio-state.edu/pub/tech-report/2006/TR68.pdf, (OSU-CISRC-7/06-TR68), 2006. [6] M.J. Zaki (2005): Efficiently Mining Frequent Embedded Unordered Trees. Fundamenta Informaticae vol. 65, pp. 1-20. [7] H. Prufer. Neuer Beweis eines Satzesuber Permutationen. Archiv fur Mathematik und Physik, 27:742-744, 1918. [8] S. Tatikonda, S. Parthasarathy. An Efficient Parallel Tree Mining Algorithm for Emerging Commodity Processors.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/9566	-
dc.description.abstract	在處理大量資料時，將會需要一些特別的技巧來從資料中獲得比較有用的資訊，資料探勘（Data mining）便是一項從大量資料中尋找出隱藏在這些資料裡有用、相關資訊的技巧。在目前許多對於資料探勘的研究上，進展的方向也從探勘頻繁出現的物件集合逐漸朝向更加複雜的結構進行探勘，例如樹結構（tree）或圖（graph）等。在本文中，我們所提到的這個探勘頻繁子樹的問題，在過去已經被證實了可用於許多廣範圍的應用上，像是生物資訊（bioinformatics）、XML處理、計算語言學（computational linguistics）和web使用率探勘上。TRIPS是一個從樹資料庫中探勘子樹的新演算法，這個演算法比起一些過去的演算法還快，並且可以很廣泛的用於針對嵌入或歸納子樹，而子樹可以是有標記、無標記、有序、無序等。本文中主要會針對TRIPS演算法對於嵌入、有序子樹和嵌入、無序子樹上提出改進的做法，在實驗結果上也可以看到我們的做法在速度上所獲得的提升。在本文中，我們會先稍微介紹樹探勘演算法的概要與應用，然後說明介紹相關的基礎定義，再來詳細介紹一些有關的樹探勘演算法，並且之後會針對Shirsh Tatikonda提出的TRIPS [1] 演算法，提出我們改進的地方，以及在實驗上所驗證出來的結果。	zh_TW
dc.description.provenance	Made available in DSpace on 2021-05-20T20:29:00Z (GMT). No. of bitstreams: 1 ntu-97-R94921086-1.pdf: 4357352 bytes, checksum: 3d83e14211af122ac4ec7d22958963e3 (MD5) Previous issue date: 2008	en
dc.description.tableofcontents	摘要……………………………………………………………………iii 第一章序論……………………………………………………… 1 第二章理論基礎………………………………………………… 6 2.1 基本定義…………………………………………………… 6 2.2 問題定義…………………………………………………… 12 第三章樹探勘演算法…………………………………………… 14 3.1 頻繁子集問題……………………………………………… 14 3.2 TreeMiner & PatternMatcher演算法…………………… 16 3.3 SLEUTH演算法……………………………………………… 24 第四章 TRIPS演算法改進處 …………………………………… 28 4.1 TRIPS演算法 ……………………………………………… 28 4.2 針對TRIPS演算法在資料結構上的改進 ………………… 36 4.2.1 針對歸納、有序子樹的改進做法……………………… 36 4.2.2針對歸納、無序子樹的改進做法 ……………………… 38 4.2.3針對嵌入、有序子樹的改進做法 ……………………… 39 4.2.4針對嵌入、無序子樹的改進做法 ……………………… 43 第五章實驗結果與分析………………………………………… 46 5.1實驗環境 …………………………………………………… 46 5.2實驗方法與結果 …………………………………………… 47 第六章結論及未來工作展望…………………………………… 60 參考文獻 ………………………………………………………… 61
dc.language.iso	zh-TW
dc.title	TRIPS 樹探勘演算法的改進	zh_TW
dc.title	An Improved TRIPS Tree Mining Algorithm	en
dc.type	Thesis
dc.date.schoolyear	96-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	郭斯彥,雷欽隆,莊仁輝,黃秋煌
dc.subject.keyword	資料探勘,樹探勘,演算法,	zh_TW
dc.subject.keyword	Data mining,Tree mining,Algorithm,	en
dc.relation.page	62
dc.rights.note	同意授權(全球公開)
dc.date.accepted	2008-08-01
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電機工程學研究所	zh_TW
顯示於系所單位：	電機工程學系

文件中的檔案：

檔案	大小	格式
ntu-97-1.pdf	4.26 MB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。