Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電機工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/33338
Title: 耳蝸物理模型為基礎的音高辨識方法
Pitch Recognition Based on Cochlear Model
Authors: Ruei-Min Lin
林睿敏
Advisor: 鄭士康(Shyh-Kang Jeng)
Keyword: 音高辨識,耳蝸,耳蝸模型,基底膜,
pitch,pitch tracking,pitch recognition,pitch determination algorithm,PDA,cochlea,cochlear model,basement membrane,BM,
Publication Year : 2006
Degree: 碩士
Abstract: 本論文提出了一個新的音高辨識方法:利用簡單的耳蝸物理模型,可以讓電腦模擬人耳聽覺的產生,來辨識音高。傳統音高辨識方法主要分為兩大類:第一類型方法利用簡單的自相關函式,將一段聲音波形輸入函式後,經由簡單的運算,可以迅速得到主要頻率音高;第二類型方法則是將一段時域資料的聲音波形,經過傅立葉轉換後,轉成頻域資料得到頻譜,再分析頻譜得到音高。本論文藉由一個簡單的耳蝸物理模型,可以直接利用時域資料去振動耳蝸中的基底膜,再藉由分析基底膜的振動情形抓出音高。不同於第一類方法只能抓到一個主要的頻率,我們的方法,因為整條基底膜的彈性並不一致,所以可以同時抓出各個頻率的組成大小。另外,少了頻譜轉換的步驟,因此我們的方法,運算速度比起第二類方法快速許多。
In this paper, an algorithm for pitch recognition is designed. This algorithm is based on a simplified cochlear model. The traditional methods are mainly divided into two categories: one is to utilize and analyze the amplitude of sound in time domain directly; the other is to transform the sound into the frequency domain first, and then do some analysis to recognize the pitch. The operation amount in time domain is relatively small, but mostly it can only detect a single frequency. The second type of methods needs to do the transform first, so the speed is relatively slow. After getting the frequency spectrum, we can apply some algorithm to do the pitch recognition. My algorithm, which is called CM (Cochlear Model), combines the advantages of above-mentioned two kinds of methods. CM utilizes the amplitude of sound directly. Through the simple cochlea physical model, the vibration situation of the BM(basement membrane) in the cochlea can tell the pitch. For the elasticity in the BM is not uniform, we can tell more than one single frequency at the same time.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/33338
Fulltext Rights: 有償授權
Appears in Collections:電機工程學系

Files in This Item:
File SizeFormat 
ntu-95-1.pdf
  Restricted Access
1.19 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved