耳蝸物理模型為基礎的音高辨識方法

Ruei-Min Lin; 林睿敏

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/33338

標題:	耳蝸物理模型為基礎的音高辨識方法 Pitch Recognition Based on Cochlear Model
作者:	Ruei-Min Lin 林睿敏
指導教授:	鄭士康(Shyh-Kang Jeng)
關鍵字:	音高辨識,耳蝸,耳蝸模型,基底膜, pitch,pitch tracking,pitch recognition,pitch determination algorithm,PDA,cochlea,cochlear model,basement membrane,BM,
出版年 :	2006
學位:	碩士
摘要:	本論文提出了一個新的音高辨識方法：利用簡單的耳蝸物理模型，可以讓電腦模擬人耳聽覺的產生，來辨識音高。傳統音高辨識方法主要分為兩大類：第一類型方法利用簡單的自相關函式，將一段聲音波形輸入函式後，經由簡單的運算，可以迅速得到主要頻率音高；第二類型方法則是將一段時域資料的聲音波形，經過傅立葉轉換後，轉成頻域資料得到頻譜，再分析頻譜得到音高。本論文藉由一個簡單的耳蝸物理模型，可以直接利用時域資料去振動耳蝸中的基底膜，再藉由分析基底膜的振動情形抓出音高。不同於第一類方法只能抓到一個主要的頻率，我們的方法，因為整條基底膜的彈性並不一致，所以可以同時抓出各個頻率的組成大小。另外，少了頻譜轉換的步驟，因此我們的方法，運算速度比起第二類方法快速許多。 In this paper, an algorithm for pitch recognition is designed. This algorithm is based on a simplified cochlear model. The traditional methods are mainly divided into two categories: one is to utilize and analyze the amplitude of sound in time domain directly; the other is to transform the sound into the frequency domain first, and then do some analysis to recognize the pitch. The operation amount in time domain is relatively small, but mostly it can only detect a single frequency. The second type of methods needs to do the transform first, so the speed is relatively slow. After getting the frequency spectrum, we can apply some algorithm to do the pitch recognition. My algorithm, which is called CM (Cochlear Model), combines the advantages of above-mentioned two kinds of methods. CM utilizes the amplitude of sound directly. Through the simple cochlea physical model, the vibration situation of the BM(basement membrane) in the cochlea can tell the pitch. For the elasticity in the BM is not uniform, we can tell more than one single frequency at the same time.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/33338
全文授權:	有償授權
顯示於系所單位：	電機工程學系

文件中的檔案：

檔案	大小	格式
ntu-95-1.pdf 目前未授權公開取用	1.19 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。