Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 工學院
  3. 化學工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/78407
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor李奕霈(Yi-Pei Li)
dc.contributor.authorTing-Wei Hsuen
dc.contributor.author許鼎威zh_TW
dc.date.accessioned2021-07-11T14:55:22Z-
dc.date.available2026-01-29
dc.date.copyright2021-03-08
dc.date.issued2021
dc.date.submitted2021-01-29
dc.identifier.citation(1) Collobert, R.; Weston, J.; Bottou, L.; Karlen, M.; Kavukcuoglu, K.; Kuksa, P., Natural Language Processing (Almost) from Scratch. J. Mach. Learn. Res. 2011, 12, 2493-2537.
(2) Ker, J.; Wang, L. P.; Rao, J.; Lim, T., Deep Learning Applications in Medical Image Analysis. IEEE Access 2018, 6, 9375-9389.
(3) Deng, L.; Li, X., Machine Learning Paradigms for Speech Recognition: An Overview. IEEE Trans. Audio Speech Lang. Process. 2013, 21 (5), 1060-1089.
(4) Nguyen, T. T. T.; Armitage, G., A Survey of Techniques for Internet Traffic Classification using Machine Learning. IEEE Commun. Surv. Tutor. 2008, 10 (4), 56-76.
(5) Huck, N., Large data sets and machine learning: Applications to statistical arbitrage. Eur. J. Oper. Res. 2019, 278 (1), 330-342.
(6) Krauss, C.; Do, X. A.; Huck, N., Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S P 500. Eur. J. Oper. Res. 2017, 259 (2), 689-702.
(7) Foster, K. R.; Koprowski, R.; Skufca, J. D., Machine learning, medical diagnosis, and biomedical engineering research - commentary. Biomed. Eng. Online 2014, 13, 9.
(8) Butler, K. T.; Davies, D. W.; Cartwright, H.; Isayev, O.; Walsh, A., Machine learning for molecular and materials science. Nature 2018, 559 (7715), 547-555.
(9) Zhang, L.; Chen, H.; Tao, X.; Cai, H.; Liu, J.; Ouyang, Y.; Peng, Q.; Du, Y., Machine learning reveals the importance of the formation enthalpy and atom-size difference in forming phases of high entropy alloys. Materials Design 2020, 108835.
(10) Balabin, R. M.; Lomakina, E. I., Neural network approach to quantum-chemistry data: Accurate prediction of density functional theory energies. J. Chem. Phys. 2009, 131 (7), 8.
(11) Yang, G. Y.; Wu, J.; Chen, S. G.; Zhou, W. J.; Sun, J.; Chen, G. H., Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels. J. Chem. Phys. 2018, 148 (24), 7.
(12) Hu, L. H.; Wang, X. J.; Wong, L. H.; Chen, G. H., Combined first-principles calculation and neural-network correction approach for heat of formation. J. Chem. Phys. 2003, 119 (22), 11501-11507.
(13) Suleimanov, Y. V.; Green, W. H., Automated Discovery of Elementary Chemical Reaction Steps Using Freezing String and Berny Optimization Methods. J. Chem. Theory Comput. 2015, 11 (9), 4248-4259.
(14) Khachatryan, L.; Burcat, A.; Dellinger, B., An elementary reaction-kinetic model for the gas-phase formation of 1,3,6,8-and 1,3,7,9-tetrachlorinated dibenzo-p-dioxins from 2,4,6-trichlorophenol. Combust. Flame 2003, 132 (3), 406-421.
(15) Green, W. H.; Barton, P. I.; Bhattacharjee, B.; Matheu, D. M.; Schwer, D. A.; Song, J.; Sumathi, R.; Carstensen, H. H.; Dean, A. M.; Grenda, J. M., Computer construction of detailed chemical kinetic models for gas-phase reactors. Ind. Eng. Chem. Res. 2001, 40 (23), 5362-5370.
(16) Benson, S. W.; Buss, J. H., ADDITIVITY RULES FOR THE ESTIMATION OF MOLECULAR PROPERTIES - THERMODYNAMIC PROPERTIES. J. Chem. Phys. 1958, 29 (3), 546-572.
(17) Constantinou, L.; Gani, R., NEW GROUP-CONTRIBUTION METHOD FOR ESTIMATING PROPERTIES OF PURE COMPOUNDS. Aiche J. 1994, 40 (10), 1697-1710.
(18) Joback, K. G.; Reid, R. C., ESTIMATION OF PURE-COMPONENT PROPERTIES FROM GROUP-CONTRIBUTIONS. Chem. Eng. Commun. 1987, 57 (1-6), 233-243.
(19) Benson, S. W.; Cruickshank, F.; Golden, D.; Haugen, G. R.; O'Neal, H. E.; Rodgers, A.; Shaw, R.; Walsh, R., Additivity rules for the estimation of thermochemical properties. Chemical Reviews 1969, 69 (3), 279-324.
(20) Zhao, Q. Y.; Savoie, B. M., Self-Consistent Component Increment Theory for Predicting Enthalpy of Formation. J. Chem Inf. Model. 2020, 60 (4), 2199-2207.
(21) Pedley, J. B., Thermochemical data of organic compounds. Springer Science Business Media: 2012.
(22) Cohen, N.; Benson, S. W., ESTIMATION OF HEATS OF FORMATION OF ORGANIC-COMPOUNDS BY ADDITIVITY METHODS. Chemical Reviews 1993, 93 (7), 2419-2438.
(23) Savoie, B. M.; Webb, M. A.; Miller III, T. F., Enhancing cation diffusion and suppressing anion diffusion via Lewis-acidic polymer electrolytes. The journal of physical chemistry letters 2017, 8 (3), 641-646.
(24) Han, K. H.; Jamal, A.; Grambow, C. A.; Buras, Z. J.; Green, W. H., An Extended Group Additivity Method for Polycyclic Thermochemistry Estimation. Int. J. Chem. Kinet. 2018, 50 (4), 294-303.
(25) Lay, T. H.; Yamada, T.; Tsai, P. L.; Bozzelli, J. W., Thermodynamic parameters and group additivity ring corrections for three- to six-membered oxygen heterocyclic hydrocarbons. J. Phys. Chem. A 1997, 101 (13), 2471-2477.
(26) Coley, C. W.; Barzilay, R.; Green, W. H.; Jaakkola, T. S.; Jensen, K. F., Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction. J. Chem Inf. Model. 2017, 57 (8), 1757-1772.
(27) Duvenaud, D. K.; Maclaurin, D.; Iparraguirre, J.; Bombarell, R.; Hirzel, T.; Aspuru-Guzik, A.; Adams, R. P. In Convolutional networks on graphs for learning molecular fingerprints, Advances in neural information processing systems, 2015; pp 2224-2232.
(28) Kearnes, S.; McCloskey, K.; Berndl, M.; Pande, V.; Riley, P., Molecular graph convolutions: moving beyond fingerprints. J. Comput.-Aided Mol. Des. 2016, 30 (8), 595-608.
(29) Yang, K.; Swanson, K.; Jin, W. G.; Coley, C.; Eiden, P.; Gao, H.; Guzman-Perez, A.; Hopper, T.; Kelley, B.; Mathea, M.; Palmer, A.; Settels, V.; Jaakkola, T.; Jensen, K.; Barzilay, R., Analyzing Learned Molecular Representations for Property Prediction. J. Chem Inf. Model. 2019, 59 (8), 3370-3388.
(30) Wu, Z. Q.; Ramsundar, B.; Feinberg, E. N.; Gomes, J.; Geniesse, C.; Pappu, A. S.; Leswing, K.; Pande, V., MoleculeNet: a benchmark for molecular machine learning. Chem. Sci. 2018, 9 (2), 513-530.
(31) Zanchini, E., On the definition of extensive property energy by the first postulate of thermodynamics. Foundations of physics 1986, 16 (9), 923-935.
(32) Cemič, L., Thermodynamics in mineral sciences. Springer: 2005.
(33) Li, Y. P.; Han, K. H.; Grambow, C. A.; Green, W. H., Self-Evolving Machine: A Continuously Improving Model for Molecular Thermochemistry. J. Phys. Chem. A 2019, 123 (10), 2142-2152.
(34) Grambow, C. A.; Li, Y. P.; Green, W. H., Accurate Thermochemistry with Small Data Sets: A Bond Additivity Correction and Transfer Learning Approach. J. Phys. Chem. A 2019, 123 (27), 5826-5835.
(35) Ramakrishnan, R.; Dral, P. O.; Rupp, M.; von Lilienfeld, O. A., Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data 2014, 1, 7.
(36) Scalia, G.; Grambow, C. A.; Pernici, B.; Li, Y.-P.; Green, W. H., Evaluating Scalable Uncertainty Estimation Methods for Deep Learning-Based Molecular Property Prediction. J. Chem Inf. Model. 2020.
(37) NIST Thermodynamics Research Center NIST/TRC Table Database. CD-ROM; 2004.
(38) Zhang, J. Accurate Calculations of Molecular Properties with Explicitly Correlated Methods. Virginia Tech, 2014.
(39) Zhang, J.; Valeev, E. F., Prediction of reaction barriers and thermochemical properties with explicitly correlated coupled-cluster methods: a basis set assessment. J. Chem. Theory Comput. 2012, 8 (9), 3175-3186.
(40) Császár, A. G.; Leininger, M. L.; Szalay, V., The standard enthalpy of formation of CH 2. The Journal of chemical physics 2003, 118 (23), 10631-10642.
(41) Peterson, K. A.; Feller, D.; Dixon, D. A., Chemical accuracy in ab initio thermochemistry and spectroscopy: current strategies and future challenges. Theoretical Chemistry Accounts 2012, 131 (1), 1079.
(42) Gao, C. W.; Allen, J. W.; Green, W. H.; West, R. H., Reaction Mechanism Generator: Automatic construction of chemical kinetic mechanisms. Computer Physics Communications 2016, 203, 212-225.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/78407-
dc.description.abstract增量理論(Increment theory)基於分子中存在的片段來估算其熱力學性質,並且藉由能進行線性加成的特性來預測難以從實驗或理論計算中得出的大分子的熱力學性質,然而增量理論的應用僅限於欲分析的分子有包含預先定義的原子團或成分,如果分子中存在新的化學片段,則需要定義新的原子團並收集新的數據以定義出這些新的化學單元對整體分子性質的貢獻,為了解決這個問題,我們引入了一種機器學習方法,該方法是利用圖像卷積神經網絡(GCNNs)並基於包含原子周圍的局部化學環境來學習每個原子的貢獻,並對每個原子貢獻求和以得出分子的總體性質,由於此方法遵循增量理論的可加成性,因此可以輕鬆地將其推廣到訓練數據中不存在的大分子。除此之外,我們還引入了另一種機器學習架構,該架構可以分別學習局部和全局的分子結構信息使得我們不僅可以得到每個原子的特性,還可以確定整個分子結構對分子性質的貢獻。經過一系列的探討、研究和比較,我們所建構的改良圖像卷積神經網絡在預測熱力學性質的準確度上可以勝過傳統的增量理論。相對於其他現行已開發的熱力學參數預測模型,我們所提出的架構可以提供更精準的預測效能,即使在進行大分子的性質預測或泛化能力檢測時,也可以提供高準確性的熱力學參數評估。zh_TW
dc.description.abstractThe increment theory estimates the property of a species based on the fragments presented in a molecule and can naturally be scaled up to predict properties of large molecules that are difficult to derive from experiments or ab initio calculations. However, the application of increment theory is limited to the species containing the pre-defined groups or components. If new chemical fragments exist in the molecules, it is required to define new groups and to collect new data to determine the contribution of these new chemical units. To address this issue, we introduce a machine learning approach that can learn the contribution of each atom based on the local environment that encompassing the atom using graph convolutional neural networks (GCNNs), and sum over the atomic contributions to derive the property of the molecule. Because this approach follows the additivity scheme of increment theory, it can be easily generalized to larger molecules that are not present in the training data. In addition, we also introduce a machine learning architecture that can separately learn local and global structural information, and thus can determine not only the properties of each individual atom but also the contribution of the overall molecular structure. We examined the performance of the proposed models on formation enthalpy predictions and found that these models can outperform the conventional increment theory. In addition, they also perform competitively with previously reported machine learning-based thermochemistry estimator, and can achieve a higher accuracy when tested on out-of-domain test sets containing molecules that are significantly larger than those in the training set.en
dc.description.provenanceMade available in DSpace on 2021-07-11T14:55:22Z (GMT). No. of bitstreams: 1
U0001-2801202115285400.pdf: 2031648 bytes, checksum: 4f8bb43fa47e9893ee6d06738ead2b81 (MD5)
Previous issue date: 2021
en
dc.description.tableofcontents
口試委員會審定書 2
誌謝 3
中文摘要 5
Abstract 6
Contents 8
Figure Index 10
Table Index 11
1. Introduction 12
2. Methodology 17
2.1 Model architecture 17
2.2 Atomic fingerprint 19
2.3 Hybrid fingerprint 21
2.4 Dimension 0 summation 22
2.5 Dimension 1 summation 23
2.6 Transfer learning on high accuracy dataset 27
2.7 Molecule features selection 29
2.8 Data preparation 30
3. Results and Discussion 31
3.1 D-MPNN and Fingerprints 32
3.2 Structure and Functional Group. 37
3.3 Generalization 40
3.4 Atomic fingerprint and Previous Work Comparison. 43
4. Conclusion 46
References 48
Appendix 51
A-0. Metric 51
A-1. Features 51
dc.language.isoen
dc.subject增量理論zh_TW
dc.subject熱力學性質zh_TW
dc.subject機器學習zh_TW
dc.subject可加成性zh_TW
dc.subject圖像卷積神經網絡(graph convolutional neural networkszh_TW
dc.subjectGCNNs)zh_TW
dc.subject泛化能力zh_TW
dc.subjectincrement theoryen
dc.subjectformation enthalpyen
dc.subjectgraph convolutional neural networks(GCNNs)en
dc.subjectadditivityen
dc.subjectmachine learningen
dc.subjectmolecular propertyen
dc.title利用分程深度特徵提取預測分子生成熱zh_TW
dc.titleRange-Separated Deep Learning Feature Extraction for Heat of Formation of Moleculesen
dc.typeThesis
dc.date.schoolyear109-1
dc.description.degree碩士
dc.contributor.oralexamcommittee林祥泰(Shiang-Tai Lin),謝依芸(I-Yun Hsieh),蔡秉均(Ping-Chun Tsai)
dc.subject.keyword增量理論,熱力學性質,機器學習,可加成性,圖像卷積神經網絡(graph convolutional neural networks, GCNNs),泛化能力,zh_TW
dc.subject.keywordincrement theory,molecular property,machine learning,additivity,graph convolutional neural networks(GCNNs),formation enthalpy,en
dc.relation.page52
dc.identifier.doi10.6342/NTU202100230
dc.rights.note有償授權
dc.date.accepted2021-01-29
dc.contributor.author-college工學院zh_TW
dc.contributor.author-dept化學工程學研究所zh_TW
dc.date.embargo-lift2026-01-29-
顯示於系所單位:化學工程學系

文件中的檔案:
檔案 大小格式 
U0001-2801202115285400.pdf
  未授權公開取用
1.98 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved