請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110完整後設資料紀錄
| DC 欄位 | 值 | 語言 |
|---|---|---|
| dc.contributor.advisor | 顏嗣鈞 | |
| dc.contributor.author | YU-FANG WU | en |
| dc.contributor.author | 吳於芳 | zh_TW |
| dc.date.accessioned | 2021-06-13T02:30:19Z | - |
| dc.date.available | 2007-02-02 | |
| dc.date.copyright | 2007-02-02 | |
| dc.date.issued | 2007 | |
| dc.date.submitted | 2007-01-25 | |
| dc.identifier.citation | [1] P. Bille. A Survey on Tree Edit Distance and Related Problems. Theoretical Computer Science, 337(1-3): 217-239, 2005.
[2] Kaizhong Zhang and Dennis Shasha. Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing, 18: 1245–1262, 1989. [3] P.N. Klein. Computing the Edit-distance between Unrooted Ordered Trees. Proceedings of the 6th annual European Symposium on Algorithms (ESA) 1998., pages 91–102. Springer-Verlag, 1998. [4] A. Micheli and D. Rossin. Edit Distance Between Unlabeled Ordered Trees. RAIRO - Theoretical Informatics and Applications, 40: 593-609, 2006, [5] http://www.w3.org/XML [6] M. Garofalakis and A. Kumar. XML Stream Processing Using Tree-Edit Distance Embeddings. ACM Transactions on Database Systems, 30(1): 279-332, 2005. [7] G. Cormode and S. Muthukrishnan. The String Edit Distance Matching Problem with Moves. Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, 667-676, 2002. [8] T. Grust and M. van Keulen. Tree Awareness for Relational DBMS Kernels: Staircase Join. Intelligent Search on XML Data, 231-245, 2003. [9] P. Boncz, T. Grust, M. van Keulen, S. Manegold, J. Rittinger, and J. Teubner. MonetDB/XQuery: A Fast XQuery Processor Powered by a Relational Engine. Proceedings of the 2006 ACM SIGMOD international conference on Management of data, 479-490, 2006. [10] P. Boncz, S. Manegold, J. Rittinger. Updating the Pre/Post Plane in MonetDB/XQuery. Informal Proceedings of the Second International Workshop on XQuery Implementation, Experience, and Perspectives (XIME-P), June 16- 17, 2005. | |
| dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110 | - |
| dc.description.abstract | 在資訊科學研究中,樹狀結構比較問題廣泛地出現在各種領域,其中也包括了可擴展標記語言文件的處理。而在解決樹狀結構比較問題時,樹狀結構編輯距離是一個常見且重要,並可用來量化樹狀結構差異的方法。因此,在比較大量串流可擴展標記語言文件時,有效率的樹狀結構嵌入演算法就相當重要。
在本篇論文中,我們提出一個新的樹狀距離嵌入演算法,用來比較從串流可擴展標記語言文件中取得的無標記有序樹狀結構。與最近的研究相較,我們的貢獻是在不增加時間及空間複雜度的情況下,簡化得到樹狀距離的過程。同時我們也分析了這個演算法在樹狀結構編輯距離上造成失真的上下界以及誤差的機率。 | zh_TW |
| dc.description.abstract | The problem of comparing tree structures occurs in various areas in computer science and engineering, including the application to XML data processing. To solve this problem, tree edit distance is a common and significant measurement defining the difference between two tree structures quantitatively. Efficient tree edit distance embedding algorithms are therefore of significant importance in comparing large streaming XML document trees. In this thesis, we propose a new algorithm to obtain edit distance between unlabeled ordered trees derived from streaming XML data. In comparison with the previous work, our contribution lies in simplifying the procedure of obtaining the tree edit distance without increasing the time and space complexities. The upper and lower bounds as well as the error probability of our algorithm are also analyzed in this thesis. | en |
| dc.description.provenance | Made available in DSpace on 2021-06-13T02:30:19Z (GMT). No. of bitstreams: 1 ntu-96-R93921142-1.pdf: 513817 bytes, checksum: 5bb5b6f526607ee067e7cf975a6ac8da (MD5) Previous issue date: 2007 | en |
| dc.description.tableofcontents | Acknowledgement……i
Abstract (Chinese)……ii Abstract (English)……iii Contents……iv List of Figures……vi List of Tables……viii Chapter 1 Introduction……1 Chapter 2 Preliminaries……5 Section 2.1 Tree Edit Distance……5 2.1.1 Definitions……6 2.1.2 Algorithms to Obtain Tree Edit Distance……9 2.1.3 Edit Distance between Unlabeled Trees……9 Section 2.2 XML Stream Processing……11 2.2.1 XML and Tree Structures……12 2.2.2 Data Stream and Pseudorandom Sketching……13 2.2.3 A Tree Edit Distance Embedding Algorithm……15 Section 2.3 Staircase Join……20 2.3.1 Pre/Post Plane Encoding of XML Document Trees……20 2.3.2 Relational XML Algebra……23 Chapter 3 Modified Tree Edit Distance Embedding Algorithm……27 Section 3.1 Motif of Modification……27 Section 3.2 The First Modified Algorithm……28 Section 3.3 Upper and Lower Bounds……29 3.3.1 Effect from Insertion and Deletion……30 3.3.2 Effect from Subtree-Moving……34 3.3.3 Ambiguities……40 Section 3.4 The Second Modified Algorithm……43 3.4.1 Definition……43 3.4.2 Analysis of Error Probabilities……44 3.4.3 Comparison with the Original Algorithm……47 Chapter 4 Tree Edit Distance on Pre/Post Plane Encoding……48 Section 4.1 Effect from Insertion and Deletion……48 Section 4.2 Effect from Subtree-Swapping……50 Section 4.3 Analysis……52 Chapter 5 Combination of Modified Tree Edit Distance Embedding Algorithm and Pre/Post Plane Encoding……55 Chapter 6 Conclusions and Future Works……60 References……62 | |
| dc.language.iso | en | |
| dc.subject | 可擴展標記語言文件 | zh_TW |
| dc.subject | 無標記有序樹狀結構 | zh_TW |
| dc.subject | 串流 | zh_TW |
| dc.subject | 演算法 | zh_TW |
| dc.subject | 樹狀結構編輯距離 | zh_TW |
| dc.title | 樹狀結構編輯距離在可擴展標記語言文件上之分析研究 | zh_TW |
| dc.title | Analysis of Tree Edit Distance on XML Data | en |
| dc.type | Thesis | |
| dc.date.schoolyear | 95-1 | |
| dc.description.degree | 碩士 | |
| dc.contributor.oralexamcommittee | 雷欽隆,王勝德,郭斯彥,王柏堯 | |
| dc.subject.keyword | 樹狀結構編輯距離,可擴展標記語言文件,演算法,串流,無標記有序樹狀結構, | zh_TW |
| dc.subject.keyword | Tree Edit Distance,XML,Algorithm,Streaming,Unlabeled Ordered Trees, | en |
| dc.relation.page | 64 | |
| dc.rights.note | 有償授權 | |
| dc.date.accepted | 2007-01-25 | |
| dc.contributor.author-college | 電機資訊學院 | zh_TW |
| dc.contributor.author-dept | 電機工程學研究所 | zh_TW |
| 顯示於系所單位: | 電機工程學系 | |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| ntu-96-1.pdf 未授權公開取用 | 501.77 kB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
