請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/64074
標題: | 圖形資料庫中封閉性頻繁圖形序列樣式之資料探勘 Mining Closed Frequent Sequential Graph Patterns in Graph Databases |
作者: | Chuan-Yuan Lin 林傳淵 |
指導教授: | 李瑞庭 |
關鍵字: | 動態網路,圖形序列樣式,資料探勘, dynamic networks,sequential graph patterns,data mining, |
出版年 : | 2012 |
學位: | 碩士 |
摘要: | 在社群網站中,人們在某段時間中的互動關係可以用一個圖形來表示,其中,每個點代表一個人,兩點之間的邊代表人與人之間的互動關係。然而,點和邊會因為時間變化而有所不同,而這種會隨著時間改變的網路稱作動態網路。從動態網路中探勘頻繁樣式,可以幫助我們分析社群網站中的互動行為。因此,在這篇論文中,我們提出一個有效率的探勘演算法叫「SGP-Mine」,以找出在圖形資料庫中封閉性頻繁圖形序列樣式。SGP-Mine演算法主要可分成兩個階段。第一階段,我們先找出所有長度為一的頻繁樣式。在第二階段,我們以深度優先搜尋的方式遞迴產生所有的頻繁樣式。在產生的過程中,我們利用修剪策略刪除不必要的候選樣式,並用封閉性檢查移除非封閉性的樣式。因此,SGP-Mine能有效率地從圖形資料庫中找出封閉性頻繁圖形序列樣式。實驗結果顯示,不論在合成或真實資料庫中,我們所提出的方法皆優於改良式的Apriori演算法。 In social networks, a snapshot of interactions among users can be modeled by a graph, where a vertex stands for a user and an edge denotes an interaction between two users. The users and interactions may change over time and form a dynamic network. Mining sequential graph patterns from a dynamic network can help us analyze interaction behaviors on social networks. Therefore, in this thesis, we propose a novel algorithm, SGP-Mine (Sequential Graph Pattern-Mine), to mine the closed frequent sequential graph patterns in graph databases. The proposed algorithm consists of two phases. First, we use gSpan to find all frequent SGPatterns of length one (1-SGPatterns) from the database and record the occurrences for each 1-SGPattern found. Second, we recursively generate frequent SGPatterns in a DFS manner until no more frequent SGPatterns can be found. During the mining process, we employ three pruning strategies to prune unnecessary candidates and a closure checking scheme to eliminate non-closed patterns. Therefore, the proposed method can efficiently mine closed frequent sequential graph patterns in a graph database. The experiment results show the SGP-Mine algorithm outperforms the modified Apriori algorithm. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/64074 |
全文授權: | 有償授權 |
顯示於系所單位: | 資訊管理學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-101-1.pdf 目前未授權公開取用 | 578.36 kB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。