Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 生命科學院
  3. 基因體與系統生物學學位學程
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/15348
Title: 利用深度學習建構跨細胞株模型預測增強子之細胞株特異性活性
Predicting cell type-specific enhancer activities by cross-cell type modeling with deep learning
Authors: Yi-An Tung
童翊安
Advisor: 陳倩瑜(Chien-Yu Chen)
Co-Advisor: 歐陽彥正(Yen-Jen Oyang)
Keyword: 機器學習,深度學習,增強子,基因調控網路,跨細胞株預測,增強子預測模型,
accuEnhancer,Machine learning,deep learning,enhancer,Gene regulatory network,cross-cell type prediction,
Publication Year : 2020
Degree: 博士
Abstract: 增強子是一類重要的調節元件,過去許多研究中已顯示出增強子是輔助啟動子調節細胞基因表達的關鍵角色。目前,人類基因體中,增強子的數目及其在不同細胞中的活性,仍存在有許多未知。在過往的研究發現,增強子的活性與一些功能性的資料相關,例如:組蛋白修飾、序列特徵以及染色質的結構與開合程度等等。在本論文中,我們主要利用DNase以及其他組蛋白修飾數據建立了深度學習模型,並且以H3K27ac峰值作為所選細胞類型中的增強子位置進行訓練與預測。此外,本研究還設計了結合多種細胞類型的聯合訓練(Joint training),用以提高模型預測性能。透過我們所提出的深度學習模型accuEnhancer,我們展示了利用完整特徵資料集以及深度學習於預測單一種類細胞株內增強子活性的可行性,其準確性和F1可以達到0.97和0.9。為了更進一步進行跨細胞株的增強子活性預測,本論文提出通過整合來自不同細胞類型的數據來提高跨細胞類型預測的性能。隨著結合來自不同細胞類型的更多訓練數據,預測獨立細胞類型的F1從0.3上升至0.80。結果表明,通過合併更多的跨細胞類型的數據集,深度學習模型可以捕獲複雜的調控模式並提供更好的性能。最後,本研究測試了accuEnhancer模型在預測VISTA實驗驗證的增強子數據庫的有效性。結果顯示accuEnhancer在預測經過實驗驗證的增強子能勝過前人的其他方法,本論文因此探討了跨細胞株,乃至於跨物種預測的可行性。
Enhancers are one class of the regulatory elements that have been shown to act as key components to assist promoters in modulating the gene expression in living cells. At present, the number of enhancers in the human genome as well as their activities in different cell types are still largely unknown. Previous studies have shown that enhancer activities are associated with some functional data, such as histone modification, sequence motifs, and chromatin accessibilities. This study utilized DNase data to build a deep learning model for predicting the H3K27ac peaks as the active enhancers. Moreover, this thesis proposed joint training of multiple cell types to boost the model performance. The analyses conducted in this thesis first demonstrated the general feasibility of accuEnhancer to predict within-cell type enhancer activities, where the accuracy and the F1 score can achieve 0.97 and 0.9, respectively.
To further predict cell type-specific enhancers by cross-cell type modeling, we integrated the training data from different cell types to boost the model performance. The F1 score increased from 0.3 to 0.80 as the model combined more training data from different cell types. The results demonstrated that by incorporating more datasets across cell types, the complex regulatory patterns could be captured by the deep neural networks to deliver better performances. Lastly, this study tested the effectiveness of the model on predicting experimentally validated enhancers in the VISTA database. The results indicated that accuEnhancer outperforms the previous works in predicting cell type-specific enhancer activities by cross-cell type modeling.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/15348
DOI: 10.6342/NTU202001200
Fulltext Rights: 未授權
Appears in Collections:基因體與系統生物學學位學程

Files in This Item:
File SizeFormat 
U0001-2906202023270600.pdf
  Restricted Access
12.39 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved