Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電子工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/89129
Title: 去中心化前沿佇列提升廣度優先搜尋在圖形處理器上的可擴展性
Improving the Scalability of Breadth-First Search on GPUs via Frontier Queue Decentralization
Authors: 鄭博修
Po-Hsiu Cheng
Advisor: 郭斯彥
Sy-Yen Kuo
Keyword: 圖形處理器,平行計算,廣度優先搜尋,
GPU,parallel computing,breadth-­first ­search,
Publication Year : 2023
Degree: 碩士
Abstract: 圖(Graph)是一種常見的資料結構,在導航、語音辨識和推薦系統等方面具有廣泛的應用。其中,廣度優先搜索(BFS)是探索圖中節點的基本算法,在獲取各種graph性質的方面起著至關重要的作用。圖形處理器(GPU)為常用的硬體加速器,具備卓越的計算能力和存儲容量。現在已有許多BFS演算法被移植到GPU上以提高效能,例如並行BFS(PBFS)演算法。
本研究主要為一種改進傳統PBFS演算法的可擴展性之方法。採用了去中心化前沿佇列(Decentralized Frontier Queue)、即時佇列排空(Real-time Queue Draining)、兩級鄰居訪問(Two-level Neighbor Visiting)及狀態陣列原子掃描(Atomic Status Array Scanning)等設計。這些機制成功緩解GPU上的爭用(Contension)、降低記憶體消耗、解決負載不平衡問題,在實現了具競爭力的運行速度的同時,改進了PBFS演算法在GPU上的可擴展性。本論文介紹了此方法的設計、評估,以及未來改進的方向。
Graph is a common data structure widely used in navigation, speech recognition, and recommendation systems. Breadth-First Search (BFS) is a fundamental algorithm for graph traversal and plays a crucial role in obtaining various graph properties. Graphic Processing Unit (GPU) is a commonly used hardware accelerators with remarkable computing power and storage capacity. Many BFS algorithms have been ported to GPUs to improve performance, such as the Parallel BFS (PBFS) algorithm.
This study proposes some approach to improve the scalability of the traditional PBFS algorithm, includes Decentralized Frontier Queue, Real-time Queue Draining, Two-level Neighbor Visiting, and Atomic Status Array Scanning. These mechanism successfully alleviate contention on GPUs, reduce memory consumption, and solve load imbalance issues. We achieved competitive executing speeds and improved the scalability of the PBFS algorithm on GPUs. This thesis shows the design, evaluation, and future works.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/89129
DOI: 10.6342/NTU202302373
Fulltext Rights: 未授權
Appears in Collections:電子工程學研究所

Files in This Item:
File SizeFormat 
ntu-111-2.pdf
  Restricted Access
926.58 kBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved