Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電子工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/20128
Title: GATK變異尋找工具的硬體加速
Acceleration of Variant Discovery Tool in GATK
Authors: Zi-Yuan Lin
林子源
Advisor: 陳少傑(Sao-Jie Chen)
Keyword: 基因定序,GATK 基因體分析工具包,DNA變異探索,
Genome Sequencing,GATK,DNA Variant Calling,
Publication Year : 2018
Degree: 碩士
Abstract: 本論文提出一個針對生物基因定序(DNA Sequencing)之應用軟體GATK (Genome Analysis Tool-Kit)中變異探索(Variant Discovery)步驟流程改良的軟硬體混合設計的加速方式。近年來由於生物醫學的研究與發展、次世代定序(Next Generation Sequencing)技術的發明,使得基因定序技術已有相當大幅度地突破,現今的基因定序之應用軟體以麻省理工Broad Institute 所發展的GATK 基因序列分析工具包較為著名,並在生物及醫學領域的研究中被廣泛使用。然而這類軟體仍存在著許多缺陷,例如執行效能受限於其軟體開發環境、部份功能的演算法效率不佳,以及記憶體使用需求高等問題,因此極需以另一種方式實作GATK 以解決上述問題。
在本論文中我們會以軟體語言(C++)以及硬體描述語言(Verilog HDL)對GATK 中的變異探索流程進行重新設計,其中包含了簡化流程中的演算法並降低運算複雜度、使用平行化的硬體架構達到加速目的;並在硬體描述語言上,透過 Field Programmable Gate Array (FPGA)驗證我們的設計。目前在硬體與軟體模擬已達到相較GATK 軟體約6.2倍的加速與原先相比不到10%的記憶體使用量。
This work presents a digital hardware design to accelerate HaplotypeCaller, a tool in the Variant Discovery phase of Genome Analysis Tool-Kit (GATK) [1], which is a software tools package for genetic sequencing data analyzing.
Because of the progress of development in the biomedical field and the appearance of Next Generation Sequencing (NGS) [2] technique, there has been a breakthrough on large DNA sequencing throughput. Many software tools have been developed for DNA sequencing. In this Thesis, we will introduce a tool-kit called GATK, a well-known Java-based command line tool used by many Biomedical Scientists. However, these kinds of tools suffer from the low performance issue caused by their software development environment, and some of the algorithms may not work perfectly under certain special cases. Therefore, a new design using other language and platform is needed for further clinical analysis and research.
In our work, we implement the redesign of a tool called HaplotypeCaller, which is the most important tool in the Variant Discovery phase of GATK. The work is done by using a software hardware co-design environment of C++ and Verilog, and implementing the hardware part on FPGA. The overall performance of our software and hardware co-design platform achieved a speed-up of 6.2 times.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/20128
DOI: 10.6342/NTU201800688
Fulltext Rights: 未授權
Appears in Collections:電子工程學研究所

Files in This Item:
File SizeFormat 
ntu-107-1.pdf
  Restricted Access
3.37 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved