Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor顏嗣鈞
dc.contributor.authorYU-FANG WUen
dc.contributor.author吳於芳zh_TW
dc.date.accessioned2021-06-13T02:30:19Z-
dc.date.available2007-02-02
dc.date.copyright2007-02-02
dc.date.issued2007
dc.date.submitted2007-01-25
dc.identifier.citation[1] P. Bille. A Survey on Tree Edit Distance and Related Problems. Theoretical Computer Science, 337(1-3): 217-239, 2005.
[2] Kaizhong Zhang and Dennis Shasha. Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing, 18: 1245–1262, 1989.
[3] P.N. Klein. Computing the Edit-distance between Unrooted Ordered Trees. Proceedings of the 6th annual European Symposium on Algorithms (ESA) 1998., pages 91–102. Springer-Verlag, 1998.
[4] A. Micheli and D. Rossin. Edit Distance Between Unlabeled Ordered Trees. RAIRO - Theoretical Informatics and Applications, 40: 593-609, 2006,
[5] http://www.w3.org/XML
[6] M. Garofalakis and A. Kumar. XML Stream Processing Using Tree-Edit Distance Embeddings. ACM Transactions on Database Systems, 30(1): 279-332, 2005.
[7] G. Cormode and S. Muthukrishnan. The String Edit Distance Matching Problem with Moves. Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms, 667-676, 2002.
[8] T. Grust and M. van Keulen. Tree Awareness for Relational DBMS Kernels: Staircase Join. Intelligent Search on XML Data, 231-245, 2003.
[9] P. Boncz, T. Grust, M. van Keulen, S. Manegold, J. Rittinger, and J. Teubner. MonetDB/XQuery: A Fast XQuery Processor Powered by a Relational Engine. Proceedings of the 2006 ACM SIGMOD international conference on Management of data, 479-490, 2006.
[10] P. Boncz, S. Manegold, J. Rittinger. Updating the Pre/Post Plane in MonetDB/XQuery. Informal Proceedings of the Second International Workshop on XQuery Implementation, Experience, and Perspectives (XIME-P), June 16- 17, 2005.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/31110-
dc.description.abstract在資訊科學研究中,樹狀結構比較問題廣泛地出現在各種領域,其中也包括了可擴展標記語言文件的處理。而在解決樹狀結構比較問題時,樹狀結構編輯距離是一個常見且重要,並可用來量化樹狀結構差異的方法。因此,在比較大量串流可擴展標記語言文件時,有效率的樹狀結構嵌入演算法就相當重要。
在本篇論文中,我們提出一個新的樹狀距離嵌入演算法,用來比較從串流可擴展標記語言文件中取得的無標記有序樹狀結構。與最近的研究相較,我們的貢獻是在不增加時間及空間複雜度的情況下,簡化得到樹狀距離的過程。同時我們也分析了這個演算法在樹狀結構編輯距離上造成失真的上下界以及誤差的機率。
zh_TW
dc.description.abstractThe problem of comparing tree structures occurs in various areas in computer science and engineering, including the application to XML data processing. To solve this problem, tree edit distance is a common and significant measurement defining the difference between two tree structures quantitatively. Efficient tree edit distance embedding algorithms are therefore of significant importance in comparing large streaming XML document trees. In this thesis, we propose a new algorithm to obtain edit distance between unlabeled ordered trees derived from streaming XML data. In comparison with the previous work, our contribution lies in simplifying the procedure of obtaining the tree edit distance without increasing the time and space complexities. The upper and lower bounds as well as the error probability of our algorithm are also analyzed in this thesis.en
dc.description.provenanceMade available in DSpace on 2021-06-13T02:30:19Z (GMT). No. of bitstreams: 1
ntu-96-R93921142-1.pdf: 513817 bytes, checksum: 5bb5b6f526607ee067e7cf975a6ac8da (MD5)
Previous issue date: 2007
en
dc.description.tableofcontentsAcknowledgement……i
Abstract (Chinese)……ii
Abstract (English)……iii
Contents……iv
List of Figures……vi
List of Tables……viii
Chapter 1 Introduction……1
Chapter 2 Preliminaries……5
Section 2.1 Tree Edit Distance……5
2.1.1 Definitions……6
2.1.2 Algorithms to Obtain Tree Edit Distance……9
2.1.3 Edit Distance between Unlabeled Trees……9
Section 2.2 XML Stream Processing……11
2.2.1 XML and Tree Structures……12
2.2.2 Data Stream and Pseudorandom Sketching……13
2.2.3 A Tree Edit Distance Embedding Algorithm……15
Section 2.3 Staircase Join……20
2.3.1 Pre/Post Plane Encoding of XML Document Trees……20
2.3.2 Relational XML Algebra……23
Chapter 3 Modified Tree Edit Distance Embedding Algorithm……27
Section 3.1 Motif of Modification……27
Section 3.2 The First Modified Algorithm……28
Section 3.3 Upper and Lower Bounds……29
3.3.1 Effect from Insertion and Deletion……30
3.3.2 Effect from Subtree-Moving……34
3.3.3 Ambiguities……40
Section 3.4 The Second Modified Algorithm……43
3.4.1 Definition……43
3.4.2 Analysis of Error Probabilities……44
3.4.3 Comparison with the Original Algorithm……47
Chapter 4 Tree Edit Distance on Pre/Post Plane Encoding……48
Section 4.1 Effect from Insertion and Deletion……48
Section 4.2 Effect from Subtree-Swapping……50
Section 4.3 Analysis……52
Chapter 5 Combination of Modified Tree Edit Distance Embedding Algorithm and Pre/Post Plane Encoding……55
Chapter 6 Conclusions and Future Works……60
References……62
dc.language.isoen
dc.subject可擴展標記語言文件zh_TW
dc.subject無標記有序樹狀結構zh_TW
dc.subject串流zh_TW
dc.subject演算法zh_TW
dc.subject樹狀結構編輯距離zh_TW
dc.title樹狀結構編輯距離在可擴展標記語言文件上之分析研究zh_TW
dc.titleAnalysis of Tree Edit Distance on XML Dataen
dc.typeThesis
dc.date.schoolyear95-1
dc.description.degree碩士
dc.contributor.oralexamcommittee雷欽隆,王勝德,郭斯彥,王柏堯
dc.subject.keyword樹狀結構編輯距離,可擴展標記語言文件,演算法,串流,無標記有序樹狀結構,zh_TW
dc.subject.keywordTree Edit Distance,XML,Algorithm,Streaming,Unlabeled Ordered Trees,en
dc.relation.page64
dc.rights.note有償授權
dc.date.accepted2007-01-25
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept電機工程學研究所zh_TW
Appears in Collections:電機工程學系

Files in This Item:
File SizeFormat 
ntu-96-1.pdf
  Restricted Access
501.77 kBAdobe PDF
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved