從音訊到主題：用卷積神經網路學習語意

Siao-Yun Dai; 戴筱芸

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67014

標題:	從音訊到主題：用卷積神經網路學習語意 From Audio to Topics: Learning Semantics with Convolutional Neural Network
作者:	Siao-Yun Dai 戴筱芸
指導教授:	鄭卜壬
關鍵字:	卷積神經網路,隱含狄利克雷分布,主題模型,音訊, Convolutional Neural Network,LDA,topic model,audio signal,
出版年 :	2017
學位:	碩士
摘要:	Nowadays, music has become an import part of our lives. As cloud-based streaming service becomes popular, people are more dependent on music. Music as a tool of expressing emotions, it is rich in semantics. In previous genre and mood classification tasks, some people already show that combining lyrics and audio features can improve the results. Their research indicates there are potential relationship between audio and lyrics. Lyrics directly describe a song’s topic, while audio can expand the emotions. Nevertheless, lyrics can be incomplete or missing. If we can learn the topics from audio, we can guess the possible topics for a song without using lyrics. We proposed an unsupervised two-stage method. First, we learn the latent topics in lyrics by topic model. Second, we transfer audio signal to topic distribution via a convolutional neural network. We show that this framework can indeed learns a semantical representation from audio and can be directly applied to song retrievals. We can not only search the songs with lyrics. For those songs without lyrics, i.e. classical songs, we can also provide a reasonable result.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/67014
DOI:	10.6342/NTU201702967
全文授權:	有償授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-106-1.pdf 目前未授權公開取用	6.43 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。