Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電子工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101036
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor盧奕璋zh_TW
dc.contributor.advisorYi-Chang Luen
dc.contributor.author林家弘zh_TW
dc.contributor.authorChia-Hung Linen
dc.date.accessioned2025-11-26T16:33:36Z-
dc.date.available2025-11-27-
dc.date.copyright2025-11-26-
dc.date.issued2025-
dc.date.submitted2025-09-03-
dc.identifier.citation[1] S.B.NeedlemanandC.D.Wunsch.“Ageneralmethodapplicabletothesearch for similarities in the amino acid sequence of two proteins”. In: Journal of molecular biology 48.3 (1970), pp. 443–453.
[2] T. F. Smith, M. S. Waterman, et al. “Identification of common molecular subsequences”. In: Journal of molecular biology 147.1 (1981), pp. 195–197.
[3] S.Marco-Sola, J. C. Moure, M.Moreto,andA.Espinosa. “Fast gap-affine pairwise alignment using the wavefront algorithm”. In: Bioinformatics 37.4 (Sept. 2020), pp. 456–463. ISSN: 1367-4803. DOI: 10.1093/bioinformatics/btaa777.
[4] A.M.Wenger, P. Peluso, W. J. Rowell, P.-C. Chang, R. J. Hall, G. T. Concepcion, J. Ebler, A. Fungtammasan, A. Kolesnikov, N. D. Olson, A. Töpfer, M. Alonge, M. Mahmoud, Y. Qian, C. Chin, A. M. Phillippy, M. C. Schatz, E. W. Myers, M. A. DePristo, J. Ruan, T. Marschall, F. Sedlazeck, J. M. Zook, H. Li, S. Koren, A. Carroll, D. R. Rank, and M. W. Hunkapiller. “Accurate circular consensus longread sequencing improves variant detection and assembly of a human genome”. In: Nature Biotechnology 37.10 (2019), pp. 1155–1162. DOI: 10.1038/s41587-0190217-9.
[5] Q. Fan, X. Zhao, J. Li, et al. “De novo non-canonical nanopore basecalling enables private communication using heavily-modified DNA data at single-molecule level”. In: Nature Communications 16 (2025), p. 4099. DOI: 10.1038/s41467-02559357-2.
[6] D. R. Bentley, S. Balasubramanian, H. P. Swerdlow, G. P. Smith, J. Milton, C. G. Brown, K. P. Hall, D. J. Evers, C. L. Barnes, H. R. Bignell, and et al. “Accurate whole human genome sequencing using reversible terminator chemistry”. In: Nature 456.7218 (Nov. 2008), pp. 53–59. DOI: 10.1038/nature07517.
[7] C.Chin,D.H.Alexander,P.Marks,A.A.Klammer,J.Drake,C.Heiner,C.Alkan, et al. “Nonhybrid, finished microbial genome assemblies from long read SMRT sequencing data”. In: Nature Methods 10.6 (2013), pp. 563–569. DOI: 10.1038/ nmeth.2474.
[8] S.Marco-Sola,J.M.Eizenga,A.Guarracino,B.Paten,E.Garrison,andM.Moreto.“Optimal gap-affine alignment in O(s) space”. In: Bioinformatics 39.2 (Feb. 2023), btad074. ISSN: 1367-4811. DOI: 10.1093/bioinformatics/btad074.
[9] Xilinx.DMA/BridgeSubsystemforPCIExpressv4.1.ProductGuidePG195.https: //www.xilinx.com/support/documentation/ip_documentation/xdma/v4_1/ pg195-pcie-dma.pdf. 2022.
[10] Y.Ono,M.Hamada,andK.Asai.“PBSIM3:asimulatorforalltypesofPacBioand ONTlongreads”. In: NAR Genomics and Bioinformatics 4.4 (Dec. 2022), lqac092. ISSN: 2631-9268. DOI: 10.1093/nargab/lqac092.
[11] S. Walia, C. Ye, A. Bera, D. Lodhavia, and Y. Turakhia. “TALCO: Tiling Genome Sequence Alignment Using Convergence of Traceback Pointers”. In: 2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA). 2024, pp. 91–107. DOI: 10.1109/HPCA57654.2024.00044.
[12] Y. Turakhia, S. D. Goenka, G. Bejerano, and W. J. Dally. “Darwin-WGA: A Coprocessor Provides Increased Sensitivity in Whole Genome Alignments with High Speedup”. In: 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA). 2019,pp.359–372.DOI:10.1109/HPCA.2019.00050.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101036-
dc.description.abstract隨著基因檢測技術的進步,基因序列比對的長度經常達到10Kbp以上,甚至可達100Kbp。傳統動態規劃演算法在時間與空間複雜度為O(mn),使得比對長度增加時資源需求急遽上升,難以負荷現今的運算需求。本研究基於高記憶體效率的雙向波前比對演算法,將其結合本論文提出的內容感知平鋪技術,設計並實現一款硬體加速器。本加速器主要包含兩個階段:序列比對與序列回溯。
本研究提出的平鋪技術將長序列切分為512×512大小的區塊,並採用內容感知方式將區塊銜接,大幅降低硬體資源需求。實驗結果顯示,即使在基因序列錯誤率高達30%的高變異區域,平鋪準確率仍可達99.9%,且本平舖技術最高能夠減少78%的記憶體用量。而在每個512×512 的區塊內,我們用128個運算元件實現了256倍平行化的雙向波前比對演算法,並透過動態平行指派運算元件,並行處理演算法切分出的各子區域,以提升運算效能。序列回溯階段則採用雙向路徑回溯策略,進一步加速回溯過程。
本研究的主要貢獻在於提出一種適用於波前比對演算法的內容感知平鋪技術,並成功在現場可程式化邏輯閘陣列(FPGA)與特定應用積體電路(ASIC)上實現長讀序列比對硬體加速器。該平台輸入參考基因序列與待比對序列後,能高效輸出相對應的比對結果。與雙向波前比對演算法運行於AMD Ryzen 9 7950X(32緒CPU)上相比,FPGA與ASIC 實作分別可實現最高324倍與1181倍的效能加速;與其他硬體加速器相比,本研究的ASIC版本加速器使用最小的面積取得最快的運算速度。
zh_TW
dc.description.abstractWith the advancement of genome sequencing technologies, the length of sequence alignment has increased significantly, often exceeding 10 kbp and even reaching 100 kbp. Traditional dynamic programming algorithms, with a time and space complexity of O(mn), impose exponentially increasing resource demands as the equence length grows, making them unsuitable for modern computing needs. In this work, we present a hardware accelerator based on the memory-efficient Bidirectional Wavefront Alignment (BiWFA) algorithm, enhanced with a content-aware tiling technique to further extend its capability to handle longer genomic sequences. The accelerator consists of two major stages: sequence alignment and traceback.
The proposed tiling technique partitions long sequences into tiles of 512bp × 512bp each and connects them using a content-aware scheme, substantially reducing hardware resource consumption. Experimental results show that even in high-variant regions with sequence error rates as high as 30%, the tiling accuracy remains 99.9%. This tiling technique effectively reduces memory footprint by up to 78% .Within each 512×512 tile, we implement a 256-way parallel BiWFA using 128 processing elements (PE) and dynamically allocate PEs to concurrently process sub-regions within the tile, significantly improving performance. The traceback stage adopts a bidirectional traceback strategy to further accelerate the process.
The primary contribution of this work lies in the content-aware tiling technique tailored for wavefront-based alignment algorithms, along with the implementation of a long-read sequence alignment accelerator on both FPGA and ASIC platforms. Given a reference and a query sequence as input, the accelerator efficiently outputs the corresponding alignment results. Compared to a software BiWFA implementation running on a 32-thread AMD Ryzen 9 7950X CPU, our FPGA and ASIC implementations achieve up to 324× and 1181× speedups, respectively. Compared with the other hardware accelerators, the ASIC implementation proposed in this work achieves the highest processing speed while occupying the smallest silicon area.
en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2025-11-26T16:33:36Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2025-11-26T16:33:36Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents口試委員會審定書 iii
誌謝 v
摘要 vii
Abstract ix
Table of Contents xi
List of Figures xiii
List of Tables xvi
1緒論 1
1.1論文貢獻. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2論文架構. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2背景知識 5
2.1序列比對. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1.1計分方式. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.2編碼. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2動態規劃. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.3波前比對演算法. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.3.1動態規劃. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.3.2輸出格式. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3.3序列回溯. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3演算法 15
3.1雙向波前比對演算法. . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.1.1波前比對序列回溯. . . . . . . . . . . . . . . . . . . . . . . . 22
3.2內容感知平舖技術. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.2.1端對邊界全域比對. . . . . . . . . . . . . . . . . . . . . . . . 25
3.2.2內容感知. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.3 BWFA-CAT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
4硬體設計 33
4.1硬體系統概觀. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
4.2平鋪控制器. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
4.2.1 XDMAAXI-stream介面. . . . . . . . . . . . . . . . . . . . . 35
4.2.2區塊控制. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
4.3序列比對處理器. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.3.1運算單元. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
4.3.2運算單元控制器. . . . . . . . . . . . . . . . . . . . . . . . . . 42
4.3.3序列回溯模組. . . . . . . . . . . . . . . . . . . . . . . . . . . 45
5實驗結果 49
5.1實驗資料與參數. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5.1.1實驗資料. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5.1.2實驗參數. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
5.2實驗硬體. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
5.2.1硬體實作. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
5.3實驗結果. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
5.3.1內容感知平鋪技術. . . . . . . . . . . . . . . . . . . . . . . . 53
5.3.2吞吐量計算. . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
6結論與展望 61
6.1結論. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
6.2未來展望. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
A Chapter 63
References 65
-
dc.language.isozh_TW-
dc.subject序列比對-
dc.subject序列回溯-
dc.subject內容感知重疊平鋪-
dc.subject動態平行指派運算元件-
dc.subject硬體加速器-
dc.subjectsequence alignment-
dc.subjectsequence traceback-
dc.subjectcontent-aware tiling-
dc.subjectdynamic PE arrangement-
dc.subjecthardware accelerator-
dc.title具內容感知平舖技術之雙向波前比對演算法與其硬體加速器zh_TW
dc.titleBWFA-CAT:Bidirectional Wavefront Sequence Alignment with Content-Aware Tiling and Its Hardware Acceleratoren
dc.typeThesis-
dc.date.schoolyear114-1-
dc.description.degree碩士-
dc.contributor.oralexamcommittee劉宗德;陳和麟zh_TW
dc.contributor.oralexamcommitteeTsung-Te Liu;Ho-Lin Chenen
dc.subject.keyword序列比對,序列回溯內容感知重疊平鋪動態平行指派運算元件硬體加速器zh_TW
dc.subject.keywordsequence alignment,sequence tracebackcontent-aware tilingdynamic PE arrangementhardware acceleratoren
dc.relation.page66-
dc.identifier.doi10.6342/NTU202503534-
dc.rights.note同意授權(全球公開)-
dc.date.accepted2025-09-03-
dc.contributor.author-college電機資訊學院-
dc.contributor.author-dept電子工程學研究所-
dc.date.embargo-lift2025-11-27-
Appears in Collections:電子工程學研究所

Files in This Item:
File SizeFormat 
ntu-114-1.pdf18.68 MBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved