請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/78407| 標題: | 利用分程深度特徵提取預測分子生成熱 Range-Separated Deep Learning Feature Extraction for Heat of Formation of Molecules |
| 作者: | Ting-Wei Hsu 許鼎威 |
| 指導教授: | 李奕霈(Yi-Pei Li) |
| 關鍵字: | 增量理論,熱力學性質,機器學習,可加成性,圖像卷積神經網絡(graph convolutional neural networks, GCNNs),泛化能力, increment theory,molecular property,machine learning,additivity,graph convolutional neural networks(GCNNs),formation enthalpy, |
| 出版年 : | 2021 |
| 學位: | 碩士 |
| 摘要: | 增量理論(Increment theory)基於分子中存在的片段來估算其熱力學性質,並且藉由能進行線性加成的特性來預測難以從實驗或理論計算中得出的大分子的熱力學性質,然而增量理論的應用僅限於欲分析的分子有包含預先定義的原子團或成分,如果分子中存在新的化學片段,則需要定義新的原子團並收集新的數據以定義出這些新的化學單元對整體分子性質的貢獻,為了解決這個問題,我們引入了一種機器學習方法,該方法是利用圖像卷積神經網絡(GCNNs)並基於包含原子周圍的局部化學環境來學習每個原子的貢獻,並對每個原子貢獻求和以得出分子的總體性質,由於此方法遵循增量理論的可加成性,因此可以輕鬆地將其推廣到訓練數據中不存在的大分子。除此之外,我們還引入了另一種機器學習架構,該架構可以分別學習局部和全局的分子結構信息使得我們不僅可以得到每個原子的特性,還可以確定整個分子結構對分子性質的貢獻。經過一系列的探討、研究和比較,我們所建構的改良圖像卷積神經網絡在預測熱力學性質的準確度上可以勝過傳統的增量理論。相對於其他現行已開發的熱力學參數預測模型,我們所提出的架構可以提供更精準的預測效能,即使在進行大分子的性質預測或泛化能力檢測時,也可以提供高準確性的熱力學參數評估。 The increment theory estimates the property of a species based on the fragments presented in a molecule and can naturally be scaled up to predict properties of large molecules that are difficult to derive from experiments or ab initio calculations. However, the application of increment theory is limited to the species containing the pre-defined groups or components. If new chemical fragments exist in the molecules, it is required to define new groups and to collect new data to determine the contribution of these new chemical units. To address this issue, we introduce a machine learning approach that can learn the contribution of each atom based on the local environment that encompassing the atom using graph convolutional neural networks (GCNNs), and sum over the atomic contributions to derive the property of the molecule. Because this approach follows the additivity scheme of increment theory, it can be easily generalized to larger molecules that are not present in the training data. In addition, we also introduce a machine learning architecture that can separately learn local and global structural information, and thus can determine not only the properties of each individual atom but also the contribution of the overall molecular structure. We examined the performance of the proposed models on formation enthalpy predictions and found that these models can outperform the conventional increment theory. In addition, they also perform competitively with previously reported machine learning-based thermochemistry estimator, and can achieve a higher accuracy when tested on out-of-domain test sets containing molecules that are significantly larger than those in the training set. |
| URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/78407 |
| DOI: | 10.6342/NTU202100230 |
| 全文授權: | 有償授權 |
| 電子全文公開日期: | 2026-01-29 |
| 顯示於系所單位: | 化學工程學系 |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| U0001-2801202115285400.pdf 未授權公開取用 | 1.98 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
