請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/59793
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 陳建錦(Chien-Chin Chen) | |
dc.contributor.author | Chih-Wei Yang | en |
dc.contributor.author | 楊智幃 | zh_TW |
dc.date.accessioned | 2021-06-16T09:38:17Z | - |
dc.date.available | 2020-02-16 | |
dc.date.copyright | 2017-02-16 | |
dc.date.issued | 2016 | |
dc.date.submitted | 2017-02-10 | |
dc.identifier.citation | 1. Zickuhr, K., Generations 2010. Pew internet & American life project. Pew
Research Center, Washington, DC, 2010. 2. Fox, S. and M. Duggan, Health online 2013. pew internet and american life project. Pew Research Center’s Internet and American Life Project, 2013. 3. Pradhan, S., et al., Evaluating the state of the art in disorder recognition and normalization of the clinical narrative. Journal of the American Medical Informatics Association, 2015. 22(1): p. 143-154. 4. Tang, B., et al. Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space Model. in Workshop of ShARe/CLEF eHealth Evaluation Lab 2013. 2013. 5. Lafferty, J., A. McCallum, and F.C. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data. 2001. 6. Sang, E.F. and J. Veenstra. Representing text chunks. in Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics. 1999. Association for Computational Linguistics. 7. Cogley, J., N. Stokes, and J. Carthy. Medical disorder recognition with structural support vector machines. in Online Working Notes of the CLEF 2013 Evaluation Labs and Workshop. 2013. 8. Leaman, R. and G. Gonzalez. BANNER: an executable survey of advances in biomedical named entity recognition. in Pacific Symposium on Biocomputing. 2008. Citeseer. 9. Chowdhury, M. and M. Faisal. Disease mention recognition with specific features. in Proceedings of the 2010 workshop on biomedical natural language processing. 2010. Association for Computational Linguistics. 10. Li, D., K. Kipper-Schuler, and G. Savova. Conditional random fields and support vector machines for disorder named entity recognition in clinical texts. in Proceedings of the workshop on current trends in biomedical natural language processing. 2008. Association for Computational Linguistics. 11. Bodenreider, O., The unified medical language system (UMLS): integrating biomedical terminology. Nucleic acids research, 2004. 32(suppl 1): p. D267- D270. 12. Wang, Y. and J. Patrick, Cascading Classifiers for Named Entity Recognition in Clinical Notes. BIOMEDICAL INFORMATION EXTRACTION, 2009: p. 42. 13. Aronson, A.R. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. in Proceedings of the AMIA Symposium. 2001. American Medical Informatics Association. 14. Abacha, A.B. and P. Zweigenbaum. Medical entity recognition: A comparison of semantic and statistical methods. in Proceedings of BioNLP 2011 Workshop. 2011. Association for Computational Linguistics. 15. Tang, B., et al., Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. BMC medical informatics and decision making, 2013. 13(Suppl 1): p. S1. 16. Tang, Y.Z.J.W.B., Y.W.M. Jiang, and Y.C.H. Xu, UTH_CCB: A Report for SemEval 2014–Task 7 Analysis of Clinical Text. SemEval 2014, 2014: p. 802. 17. Kaewphan, S., K. Hakaka, and F. Ginter, UTU: Disease Mention Recognition and Normalization with CRFs and Vector Space Representations. SemEval 2014, 2014: p. 807. 18. Levin, A. and Y. Weiss, Learning to Combine Bottom-Up and Top-Down Segmentation. 19. Bundschus, M., et al., Extraction of semantic biomedical relations from text using conditional random fields. BMC bioinformatics, 2008. 9(1): p. 207. 20. Ghiasvand, O., Disease Name Extraction from Clinical Text Using Conditional Random Fields. 2014. 21. Suárez-Paniagua, V., I. Segura-Bedmar, and P. Martınez, Word Embedding Clustering for Disease Named Entity Recognition. 22. Hacioglu, K. Semantic role labeling using dependency trees. in Proceedings of the 20th international conference on Computational Linguistics. 2004. Association for Computational Linguistics. 23. De Marneffe, M.-C. and C.D. Manning. The Stanford typed dependencies representation. in Coling 2008: Proceedings of the workshop on Cross- Framework and Cross-Domain Parser Evaluation. 2008. Association for Computational Linguistics. 24. De Marneffe, M.-C. and C.D. Manning, Stanford typed dependencies manual. 2008, Technical report, Stanford University. 25. Bodenreider, O. and A.T. McCray, Exploring semantic groups through visual approaches. Journal of biomedical informatics, 2003. 36(6): p. 414-432. 26. McCray, A.T., A. Burgun, and O. Bodenreider, Aggregating UMLS semantic types for reducing conceptual complexity. Studies in health technology and informatics, 2001. 84(0 1): p. 216. 27. Mikolov, T., et al. Distributed representations of words and phrases and their compositionality. in Advances in neural information processing systems. 2013. 28. Mikolov, T., et al., Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013. 29. Bengio, Y., et al., A neural probabilistic language model. journal of machine learning research, 2003. 3(Feb): p. 1137-1155. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/59793 | - |
dc.description.abstract | With the advance of technology and the prevalence of Internet access, more and more users attempt to search medical advice on the Internet, and various healthcare websites thus thrive. Users usually seek assistance from those who own similar experiences on healthcare websites. However, there is a great deal of unreliable information without professional endorsement, as the result, users tend to be misled and their conditions may further deteriorate. Even if there are authoritative practitioners involved, they have problem dealing with heavy demand in daily medical advice. Recently, a number of researches explore the intelligent disease inference system, and simply divide it into two parts: medical named entity recognition and disease normalization.
This research mainly focuses on symptom named entity recognition. We conduct the experiments using pre-annotated clinical reports released by International Workshop on Semantic Evaluation 2014 Task 7. For each word in the report, we extract features and categorize them into four groups including lexical/morphological, syntactic, semantic, and combinational features, and then utilize machine learning based approach – condition random fields (CRFs) to construct a model that identifies the span of symptom entities in clinical reports. The system performance is evaluated by precision, recall, and f-measure. Our method outperformed some participants in Workshop on Semantic Evaluation 2014 Task 7. Eventually, we analyze the feature influence and key to improve our system in the future. | en |
dc.description.provenance | Made available in DSpace on 2021-06-16T09:38:17Z (GMT). No. of bitstreams: 1 ntu-105-R03725052-1.pdf: 1784155 bytes, checksum: d604737e3a7e44e6ebfcd073c7cd0b0e (MD5) Previous issue date: 2016 | en |
dc.description.tableofcontents | 誌謝 i
中文摘要 ii ABSTRACT iii CONTENTS iv LIST OF FIGURES vi LIST OF TABLES vii Chapter 1 Introduction 1 Chapter 2 Related Work 5 2.1 Tagging Formats 5 2.2 Medical Named Entity Recognition 6 Chapter 3 Methodology 9 3.1 Problem Formation 9 3.2 Conditional Random Fields 10 3.2.1 Training 11 3.2.2 Inference 15 3.3 Feature Extraction 16 3.3.1 Group 1 - Lexical/morphological Features 17 3.3.2 Group 2 - Syntactic Features 18 3.3.3 Group 3 - Semantic Features 20 3.3.4 Group 4 - Combinational Features 25 Chapter 4 Experiments 27 4.1 Dataset and Evaluation Procedure 27 4.2 System Evaluation 29 Chapter 5 Conclusion and Future Work 33 REFERENCE 35 | |
dc.language.iso | en | |
dc.title | 使用隨機條件域之智慧型病症辨識方法 | zh_TW |
dc.title | An Intelligent Symptom Named Entity Recognition Method using Conditional Random Fields | en |
dc.type | Thesis | |
dc.date.schoolyear | 105-1 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 陳孟彰(Meng-Chang Chen),盧信銘(Hsin-Ming Lu),蔡銘峰(Ming-Feng Tsai) | |
dc.subject.keyword | 健康照護,專名識別,序列標記, | zh_TW |
dc.subject.keyword | healthcare,named entity recognition,sequence labeling, | en |
dc.relation.page | 36 | |
dc.identifier.doi | 10.6342/NTU201700264 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2017-02-10 | |
dc.contributor.author-college | 管理學院 | zh_TW |
dc.contributor.author-dept | 資訊管理學研究所 | zh_TW |
顯示於系所單位: | 資訊管理學系 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-105-1.pdf 目前未授權公開取用 | 1.74 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。