Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/80599
Full metadata record
???org.dspace.app.webui.jsptag.ItemTag.dcfield???ValueLanguage
dc.contributor.advisor林守德(Shou-De Lin)
dc.contributor.authorChih-Chun Yangen
dc.contributor.author楊之郡zh_TW
dc.date.accessioned2022-11-24T03:10:23Z-
dc.date.available2021-11-03
dc.date.available2022-11-24T03:10:23Z-
dc.date.copyright2021-11-03
dc.date.issued2021
dc.date.submitted2021-10-27
dc.identifier.citation[1] S. Buuren and C. Groothuis­Oudshoorn. Mice: Multivariate imputation by chained equations in r. Journal of Statistical Software, 45, 12 2011. [2] D. Dua and C. Graff. UCI machine learning repository, 2017. [3] I. Goodfellow, J. Pouget­Abadie, M. Mirza, B. Xu, D. Warde­Farley, S. Ozair, A. Courville, and Y. Bengio. Generative adversarial nets. In Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 27. Curran Associates, Inc., 2014. [4] O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, D. Bot­ stein, and R. B. Altman. Missing value estimation methods for DNA microarrays . Bioinformatics, 17(6):520–525, 06 2001. [5] A.Vaswani,N.Shazeer,N.Parmar,J.Uszkoreit,L.Jones,A.N.Gomez,L.u.Kaiser, and I. Polosukhin. Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017. [6] J. Yoon, J. Jordon, and M. van der Schaar. GAIN: Missing data imputation using generative adversarial nets. In J. Dy and A. Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 5689–5698. PMLR, 10–15 Jul 2018. [7] J. You, X. Ma, Y. Ding, M. J. Kochenderfer, and J. Leskovec. Handling missing data with graph representation learning. In H. Larochelle, M. Ranzato, R. Hadsell, M. F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 19075–19087. Curran Associates, Inc., 2020.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/80599-
dc.description.abstract如何處理測試資料中的缺失值一直是一項備受關注的研究主題。對於學習在完整資料上且沒有做出預防措施的下游模型,我們必須先針對資料進行缺失值的填補才可進行預測。因此,如何學習一個良好的缺失值填補方法以及利用已知下游模型所提供的資訊將會是這項研究主題的關鍵。本論文旨在探討如何運用已知且不可更動的下游分類模型針對有缺失值的表格式資料進行預測。我們提出一項全新的架構,它包含了一個運用自注意力架構的缺失值填補模型以及一項迭代的下游任務標籤估計方法。架構中的缺失值填補模型可以替換成任何已知的模型。透過兩項自監督的學習任務以及從下游模型反饋的資訊,我們的缺失值填補模型在所有的競爭者中達到最佳的效果。我們透過大量的實驗來驗證我們提出的架構不只對於我們所提出的自注意力缺失值填補模型有所幫助,對於所有已知的缺失值填補方法也都可以提供進一步的效能提升。zh_TW
dc.description.provenanceMade available in DSpace on 2022-11-24T03:10:23Z (GMT). No. of bitstreams: 1
U0001-2210202118454100.pdf: 1290056 bytes, checksum: 99f8015c8cb0a5503f3f11a0f70415c0 (MD5)
Previous issue date: 2021
en
dc.description.tableofcontents口試委員審定書 P.i 摘要 P. iii Abstract P. iiii Contents P. vi List of Figures P. viii List of Tables P. xi 1 Introduction P. 1 2 Related Work P. 4 2.1 Missing Data Imputation P. 4 3 Methodology P. 6 3.1 Notations and Problem Formulation P. 6 3.2 Downstream­-Aware Imputation FrameworkP. 6 3.3 Learning Imputation Model with Self­Supervision P. 7 3.3.1 Masking and Prediction Task P. 9 3.3.2 Missing Adversarial Task P. 10 3.3.3 Learning Objectives of Self­-Supervised Tasks for Imputer P. 12 3.4 Exploiting Knowledge from the Prediction of Downstream Model P. 12 3.4.1 Iterative Downstream Label Estimation Algorithm P. 13 3.5 Full Optimization Pipeline of Our Framework P. 14 3.5.1 Full Learning Objectives for Imputer P. 15 3.5.2 Full Training Pipeline P. 15 4. Experiments P. 17 4.1 ExperimentSetup P. 17 4.1.1 Datasets P. 18 4.1.2 Baseline Methods P. 18 4.2 Comparison of Imputation Model P. 19 4.3 Analysis on Downstream Label Estimation Algorithm P. 20 4.4 Ablation Studies P. 21 5. Conclusion P. 23 Reference P. 24
dc.language.isozh-TW
dc.subject深度學習zh_TW
dc.subject表格式資料zh_TW
dc.subject預測資料之缺失值填補zh_TW
dc.subject自監督學習zh_TW
dc.subject自注意力機制zh_TW
dc.subjectSelf-Attentionen
dc.subjectSelf-Supervised Learningen
dc.subjectMissing Testing Data Imputationen
dc.subjectTabular Dataen
dc.subjectDeep Learningen
dc.title基於自監督及迭代標籤估計之預測時資料缺失值處理zh_TW
dc.titleHandling Missing Data during Prediction based onSelf­Supervision and Iterative Label Estimationen
dc.date.schoolyear109-2
dc.description.degree碩士
dc.contributor.oralexamcommittee李政德(Hsin-Tsai Liu),駱宏毅(Chih-Yang Tseng)
dc.subject.keyword深度學習,表格式資料,預測資料之缺失值填補,自監督學習,自注意力機制,zh_TW
dc.subject.keywordDeep Learning,Tabular Data,Missing Testing Data Imputation,Self-Supervised Learning,Self-Attention,en
dc.relation.page25
dc.identifier.doi10.6342/NTU202104052
dc.rights.note同意授權(限校園內公開)
dc.date.accepted2021-10-28
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
Appears in Collections:資訊工程學系

Files in This Item:
File SizeFormat 
U0001-2210202118454100.pdf
Access limited in NTU ip range
1.26 MBAdobe PDF
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved