利用類神經網路正則化相異實體名稱

I-Hsien Chen; 陳奕先

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55572

標題:	利用類神經網路正則化相異實體名稱 Neural Normalization of Diverse Entity Labels
作者:	I-Hsien Chen 陳奕先
指導教授:	鄭卜壬(Pu-Jen Cheng)
關鍵字:	自然語言處理,文字生成任務,指標生成網路,多任務學習,實體標籤正則化,加權損失函數, Natural Language Processing,,Text Generation Tasks,Pointer-Generator Network,Multi-Tasking Learning,Normalization of Entity Labels,Weighted Loss Function,
出版年 :	2019
學位:	碩士
摘要:	實體標籤，用於表示對實體的稱呼或描述，其格式通常沒有一致的規範。對於標籤的多樣化，可以大致分成相異類別以及相異風格。對不同類別的實體，例如學校或銀行，因為其慣用名稱通常有明顯的差異，因此標籤也有相對應的差別；而對於同樣類別的實體，其標籤卻也可能因為用途與來源的不同而呈現相異風格，例如對於學校正式與非正式的稱呼。本文中所使用的數據集由大量電話號碼組成，這些電話號碼作為「實體」，包含了各式各樣的實體類別，例如政府機關、餐廳、公司行號等等；而每個電話號碼的擁有者可能會有多個不同來源的名稱，因此格式用法也大相逕庭，可以看作為相異風格的「標籤」，因此我們所處理的該資料集可以說是同時涵蓋了兩種多樣化的概念。　　對於多樣化的實體標籤，我們希望透過類神經網路來進行正規化，使得每個實體能獲得單一的標籤作為代表。網路模型的部分，使用文本摘要模型作為基本框架，並在訓練過程中使用加權的損失函數，令目標函數能夠更加適合我們的任務。最後引入多任務學習的方法，利用輔助任務來幫助模型學習。　　最後在實驗的部份，我們會提出針對本文的資料集所設計的前處理方法。接著比較幾種模型和訓練方式的表現差異，觀察輸出結果、探討模型的表現並解釋其原因，以證明本文提出的方法的效果。同時我們也會對錯誤的部份進行更深入的觀察及討論。 Entity labels are used to indicate the name or description of entities, and their format usually do not have consistent specification. The diversification of labels can be roughly divided into different categories and different styles. For entities of different categories, such as schools or banks, there are usually great differences in their idiomatic names, so the labels also have differences. And for entities of the same category, their labels may also show different styles due to different uses and sources, such as formal and informal names for schools. In this paper, the dataset consists of large number of telephone numbers, which are “entities” belonging to diversified entity categories, such as government agencies, restaurants, companies, etc. And each phone number owner may have multiple names from different sources, which are “labels” in different styles. Therefore, our dataset covers the two kinds of diversification at the same time. For diverse entities labels, we hope to normalize them through neural networks, so that each entity can obtain a single label as a representative. We take text summarization model as the basic framework, and a weighted loss function is used in the training process to make the objective function more suitable for our task. Finally, multi-task learning is introduced, and an auxiliary task are used to help the model to learn better. Finally, we will propose a pre-processing method for our dataset. Then compare the performance of different models, observe the output texts, and give an explanation, to prove the effect of the methods we proposed. Also, we will conduct more in-depth observation and discussion on the wrong part.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/55572
DOI:	10.6342/NTU202002051
全文授權:	有償授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
U0001-2907202018142100.pdf 目前未授權公開取用	1.95 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。