請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101036完整後設資料紀錄
| DC 欄位 | 值 | 語言 |
|---|---|---|
| dc.contributor.advisor | 盧奕璋 | zh_TW |
| dc.contributor.advisor | Yi-Chang Lu | en |
| dc.contributor.author | 林家弘 | zh_TW |
| dc.contributor.author | Chia-Hung Lin | en |
| dc.date.accessioned | 2025-11-26T16:33:36Z | - |
| dc.date.available | 2025-11-27 | - |
| dc.date.copyright | 2025-11-26 | - |
| dc.date.issued | 2025 | - |
| dc.date.submitted | 2025-09-03 | - |
| dc.identifier.citation | [1] S.B.NeedlemanandC.D.Wunsch.“Ageneralmethodapplicabletothesearch for similarities in the amino acid sequence of two proteins”. In: Journal of molecular biology 48.3 (1970), pp. 443–453.
[2] T. F. Smith, M. S. Waterman, et al. “Identification of common molecular subsequences”. In: Journal of molecular biology 147.1 (1981), pp. 195–197. [3] S.Marco-Sola, J. C. Moure, M.Moreto,andA.Espinosa. “Fast gap-affine pairwise alignment using the wavefront algorithm”. In: Bioinformatics 37.4 (Sept. 2020), pp. 456–463. ISSN: 1367-4803. DOI: 10.1093/bioinformatics/btaa777. [4] A.M.Wenger, P. Peluso, W. J. Rowell, P.-C. Chang, R. J. Hall, G. T. Concepcion, J. Ebler, A. Fungtammasan, A. Kolesnikov, N. D. Olson, A. Töpfer, M. Alonge, M. Mahmoud, Y. Qian, C. Chin, A. M. Phillippy, M. C. Schatz, E. W. Myers, M. A. DePristo, J. Ruan, T. Marschall, F. Sedlazeck, J. M. Zook, H. Li, S. Koren, A. Carroll, D. R. Rank, and M. W. Hunkapiller. “Accurate circular consensus longread sequencing improves variant detection and assembly of a human genome”. In: Nature Biotechnology 37.10 (2019), pp. 1155–1162. DOI: 10.1038/s41587-0190217-9. [5] Q. Fan, X. Zhao, J. Li, et al. “De novo non-canonical nanopore basecalling enables private communication using heavily-modified DNA data at single-molecule level”. In: Nature Communications 16 (2025), p. 4099. DOI: 10.1038/s41467-02559357-2. [6] D. R. Bentley, S. Balasubramanian, H. P. Swerdlow, G. P. Smith, J. Milton, C. G. Brown, K. P. Hall, D. J. Evers, C. L. Barnes, H. R. Bignell, and et al. “Accurate whole human genome sequencing using reversible terminator chemistry”. In: Nature 456.7218 (Nov. 2008), pp. 53–59. DOI: 10.1038/nature07517. [7] C.Chin,D.H.Alexander,P.Marks,A.A.Klammer,J.Drake,C.Heiner,C.Alkan, et al. “Nonhybrid, finished microbial genome assemblies from long read SMRT sequencing data”. In: Nature Methods 10.6 (2013), pp. 563–569. DOI: 10.1038/ nmeth.2474. [8] S.Marco-Sola,J.M.Eizenga,A.Guarracino,B.Paten,E.Garrison,andM.Moreto.“Optimal gap-affine alignment in O(s) space”. In: Bioinformatics 39.2 (Feb. 2023), btad074. ISSN: 1367-4811. DOI: 10.1093/bioinformatics/btad074. [9] Xilinx.DMA/BridgeSubsystemforPCIExpressv4.1.ProductGuidePG195.https: //www.xilinx.com/support/documentation/ip_documentation/xdma/v4_1/ pg195-pcie-dma.pdf. 2022. [10] Y.Ono,M.Hamada,andK.Asai.“PBSIM3:asimulatorforalltypesofPacBioand ONTlongreads”. In: NAR Genomics and Bioinformatics 4.4 (Dec. 2022), lqac092. ISSN: 2631-9268. DOI: 10.1093/nargab/lqac092. [11] S. Walia, C. Ye, A. Bera, D. Lodhavia, and Y. Turakhia. “TALCO: Tiling Genome Sequence Alignment Using Convergence of Traceback Pointers”. In: 2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA). 2024, pp. 91–107. DOI: 10.1109/HPCA57654.2024.00044. [12] Y. Turakhia, S. D. Goenka, G. Bejerano, and W. J. Dally. “Darwin-WGA: A Coprocessor Provides Increased Sensitivity in Whole Genome Alignments with High Speedup”. In: 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA). 2019,pp.359–372.DOI:10.1109/HPCA.2019.00050. | - |
| dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101036 | - |
| dc.description.abstract | 隨著基因檢測技術的進步,基因序列比對的長度經常達到10Kbp以上,甚至可達100Kbp。傳統動態規劃演算法在時間與空間複雜度為O(mn),使得比對長度增加時資源需求急遽上升,難以負荷現今的運算需求。本研究基於高記憶體效率的雙向波前比對演算法,將其結合本論文提出的內容感知平鋪技術,設計並實現一款硬體加速器。本加速器主要包含兩個階段:序列比對與序列回溯。
本研究提出的平鋪技術將長序列切分為512×512大小的區塊,並採用內容感知方式將區塊銜接,大幅降低硬體資源需求。實驗結果顯示,即使在基因序列錯誤率高達30%的高變異區域,平鋪準確率仍可達99.9%,且本平舖技術最高能夠減少78%的記憶體用量。而在每個512×512 的區塊內,我們用128個運算元件實現了256倍平行化的雙向波前比對演算法,並透過動態平行指派運算元件,並行處理演算法切分出的各子區域,以提升運算效能。序列回溯階段則採用雙向路徑回溯策略,進一步加速回溯過程。 本研究的主要貢獻在於提出一種適用於波前比對演算法的內容感知平鋪技術,並成功在現場可程式化邏輯閘陣列(FPGA)與特定應用積體電路(ASIC)上實現長讀序列比對硬體加速器。該平台輸入參考基因序列與待比對序列後,能高效輸出相對應的比對結果。與雙向波前比對演算法運行於AMD Ryzen 9 7950X(32緒CPU)上相比,FPGA與ASIC 實作分別可實現最高324倍與1181倍的效能加速;與其他硬體加速器相比,本研究的ASIC版本加速器使用最小的面積取得最快的運算速度。 | zh_TW |
| dc.description.abstract | With the advancement of genome sequencing technologies, the length of sequence alignment has increased significantly, often exceeding 10 kbp and even reaching 100 kbp. Traditional dynamic programming algorithms, with a time and space complexity of O(mn), impose exponentially increasing resource demands as the equence length grows, making them unsuitable for modern computing needs. In this work, we present a hardware accelerator based on the memory-efficient Bidirectional Wavefront Alignment (BiWFA) algorithm, enhanced with a content-aware tiling technique to further extend its capability to handle longer genomic sequences. The accelerator consists of two major stages: sequence alignment and traceback.
The proposed tiling technique partitions long sequences into tiles of 512bp × 512bp each and connects them using a content-aware scheme, substantially reducing hardware resource consumption. Experimental results show that even in high-variant regions with sequence error rates as high as 30%, the tiling accuracy remains 99.9%. This tiling technique effectively reduces memory footprint by up to 78% .Within each 512×512 tile, we implement a 256-way parallel BiWFA using 128 processing elements (PE) and dynamically allocate PEs to concurrently process sub-regions within the tile, significantly improving performance. The traceback stage adopts a bidirectional traceback strategy to further accelerate the process. The primary contribution of this work lies in the content-aware tiling technique tailored for wavefront-based alignment algorithms, along with the implementation of a long-read sequence alignment accelerator on both FPGA and ASIC platforms. Given a reference and a query sequence as input, the accelerator efficiently outputs the corresponding alignment results. Compared to a software BiWFA implementation running on a 32-thread AMD Ryzen 9 7950X CPU, our FPGA and ASIC implementations achieve up to 324× and 1181× speedups, respectively. Compared with the other hardware accelerators, the ASIC implementation proposed in this work achieves the highest processing speed while occupying the smallest silicon area. | en |
| dc.description.provenance | Submitted by admin ntu (admin@lib.ntu.edu.tw) on 2025-11-26T16:33:36Z No. of bitstreams: 0 | en |
| dc.description.provenance | Made available in DSpace on 2025-11-26T16:33:36Z (GMT). No. of bitstreams: 0 | en |
| dc.description.tableofcontents | 口試委員會審定書 iii
誌謝 v 摘要 vii Abstract ix Table of Contents xi List of Figures xiii List of Tables xvi 1緒論 1 1.1論文貢獻. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2論文架構. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 2背景知識 5 2.1序列比對. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.1計分方式. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.1.2編碼. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.2動態規劃. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.3波前比對演算法. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.3.1動態規劃. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.2輸出格式. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 2.3.3序列回溯. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 3演算法 15 3.1雙向波前比對演算法. . . . . . . . . . . . . . . . . . . . . . . . . . . 15 3.1.1波前比對序列回溯. . . . . . . . . . . . . . . . . . . . . . . . 22 3.2內容感知平舖技術. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.2.1端對邊界全域比對. . . . . . . . . . . . . . . . . . . . . . . . 25 3.2.2內容感知. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 3.3 BWFA-CAT. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 4硬體設計 33 4.1硬體系統概觀. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 4.2平鋪控制器. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 4.2.1 XDMAAXI-stream介面. . . . . . . . . . . . . . . . . . . . . 35 4.2.2區塊控制. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36 4.3序列比對處理器. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 4.3.1運算單元. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 4.3.2運算單元控制器. . . . . . . . . . . . . . . . . . . . . . . . . . 42 4.3.3序列回溯模組. . . . . . . . . . . . . . . . . . . . . . . . . . . 45 5實驗結果 49 5.1實驗資料與參數. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 5.1.1實驗資料. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 5.1.2實驗參數. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 5.2實驗硬體. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 5.2.1硬體實作. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 5.3實驗結果. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 5.3.1內容感知平鋪技術. . . . . . . . . . . . . . . . . . . . . . . . 53 5.3.2吞吐量計算. . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 6結論與展望 61 6.1結論. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 6.2未來展望. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62 A Chapter 63 References 65 | - |
| dc.language.iso | zh_TW | - |
| dc.subject | 序列比對 | - |
| dc.subject | 序列回溯 | - |
| dc.subject | 內容感知重疊平鋪 | - |
| dc.subject | 動態平行指派運算元件 | - |
| dc.subject | 硬體加速器 | - |
| dc.subject | sequence alignment | - |
| dc.subject | sequence traceback | - |
| dc.subject | content-aware tiling | - |
| dc.subject | dynamic PE arrangement | - |
| dc.subject | hardware accelerator | - |
| dc.title | 具內容感知平舖技術之雙向波前比對演算法與其硬體加速器 | zh_TW |
| dc.title | BWFA-CAT:Bidirectional Wavefront Sequence Alignment with Content-Aware Tiling and Its Hardware Accelerator | en |
| dc.type | Thesis | - |
| dc.date.schoolyear | 114-1 | - |
| dc.description.degree | 碩士 | - |
| dc.contributor.oralexamcommittee | 劉宗德;陳和麟 | zh_TW |
| dc.contributor.oralexamcommittee | Tsung-Te Liu;Ho-Lin Chen | en |
| dc.subject.keyword | 序列比對,序列回溯內容感知重疊平鋪動態平行指派運算元件硬體加速器 | zh_TW |
| dc.subject.keyword | sequence alignment,sequence tracebackcontent-aware tilingdynamic PE arrangementhardware accelerator | en |
| dc.relation.page | 66 | - |
| dc.identifier.doi | 10.6342/NTU202503534 | - |
| dc.rights.note | 同意授權(全球公開) | - |
| dc.date.accepted | 2025-09-03 | - |
| dc.contributor.author-college | 電機資訊學院 | - |
| dc.contributor.author-dept | 電子工程學研究所 | - |
| dc.date.embargo-lift | 2025-11-27 | - |
| 顯示於系所單位: | 電子工程學研究所 | |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| ntu-114-1.pdf | 18.68 MB | Adobe PDF | 檢視/開啟 |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
