Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
    • 指導教授
  • 搜尋 TDR
  • 授權 Q&A
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor王傑智(Chieh-Chih Wang)
dc.contributor.authorChi-Hao Linen
dc.contributor.author林祺豪zh_TW
dc.date.accessioned2021-06-14T16:52:38Z-
dc.date.available2008-08-05
dc.date.copyright2008-08-05
dc.date.issued2008
dc.date.submitted2008-07-29
dc.identifier.citationBirchfield, S. T. & Gangishetty, R. (2005). Acoustic localization by interaural level difference. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
Cui, W., Cao, Z., & Wei, J. (2006). Dual-microphone source location method in 2-d space. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
Hu, J.-S., Cheng, C.-C., & Liu,W.-H. (2006). Robust speaker’s location detection in a vehicle environment using gmmmodels. IEEE Transactions on Systems,Man and Cybernetics - PartB: Cybernetics, 36(2), 403–412.
Knapp, C. H. & Carter, G. C. (1976). The generalized correlation method for estimation of time delay. IEEE Trans. Acoust., Speech, Signal Processing, 24, 320–327.
Nakashima, H. & Mukai, T. (2005). 3d sound source localization system based on learning of binaural hearing. In IEEE International Conference on Systems, Man and Cybernetics.
Thrun, S. (2005). Affine structure from sound. In Proceedings of Conference on Neural Information Processing Systems (NIPS), Cambridge, MA. MIT Press.
Tomasi, C. & Kanade, T. (1992). Shape andmotion fromimage streams under orthography: a factorization method. International Journal of Computer Vision, 9(2), 137–154.
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596-
dc.description.abstract在機器人學裡,聲音感知是一個非常重要的功能。麥克風陣列在聲音感知的應用中被廣泛的使用,而在這些應用裡,麥克風在空間的座標通常是已知的。聲音估計結構(Structure from Sound)演算法提供了一個富有彈性之方法來校正不同結構的麥克風陣列,它能同時地定位多個麥克風與定位多個聲源。然而,在現存的演算法裡並沒有將量測的不確定性納入考量,也沒有提供音聲估計結構演算法之結果的不確定性估測。在這篇論文裡,我們提出了一個機率型聲音估計結構演算法(Probabilistic Structure from Sound)。此外,我們提出了一個機率型聲源定位演算法(Probabilistic Sound Source Localization),此演算法是使用機率型聲音估計結構演算法的結果來改進聲源定位的準確性。我們使用低成本的麥克風。大量的模擬與實驗結果成功的展示了機率型聲音估計結構演算法與機率型聲源定位演算法之成果。zh_TW
dc.description.abstractAuditory perception is one of the most important functions for robotics applications. Microphone arrays are widely used for auditory perception in which the spatial structure of microphones is usually known. The thesis first describes the affine Structure from Sound (SFS) algorithm. The structure from sound is a problem to simultaneously localize microphones and sound sources. However, the existing method does not take measurement uncertainty into account and does not provide uncertainty estimates of the SFS results. In this thesis, we propose a probabilistic structure from sound (PSFS) approach using the unscented transform. The PSFS algorithm not only localizes microphones and sound sources but also estimates the uncertainties of the SFS results. In addition, a probabilistic sound source localization (PSSL) approach using the PSFS results is provided to improve sound source localization accuracy. The ample results of simulation and experiments using low cost, off-the-shell microphones demonstrate the feasibility and performance of the proposed PSFS and PSSL approaches.en
dc.description.provenanceMade available in DSpace on 2021-06-14T16:52:38Z (GMT). No. of bitstreams: 1
ntu-97-R94922124-1.pdf: 4017230 bytes, checksum: 27e56cfbebd70979fa29db7573a14953 (MD5)
Previous issue date: 2008
en
dc.description.tableofcontentsABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
CHAPTER 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
CHAPTER 2. Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
CHAPTER 3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.1. TDOA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2. GCC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
CHAPTER 4. Affine Structure from Sound . . . . . . . . . . . . . . . . . . . . . . . . 9
4.1. Problem Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2. Data Measurement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.3. Least square solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.4. The Far Field Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4.5. Affine Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.6. Recovering the sound source locations . . . . . . . . . . . . . . . . . . . . . . 12
CHAPTER 5. Probabilistic Structure from Sound . . . . . . . . . . . . . . . . . . . . 17
5.1. The Unscented Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.2. Dealing with Uncertainties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.2.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.2.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.2.3. Uncertainties Estimation using Unscented Transform . . . . . . . . . . . 19
5.3. Dealing with Axis Inconsistency . . . . . . . . . . . . . . . . . . . . . . . . . 19
CHAPTER 6. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . 23
6.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
6.3. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . . . . . 24
CHAPTER 7. Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
7.1. TDOA Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
7.2. PSFS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
7.2.1. The Near Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
7.2.2. The Far Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
7.2.3. The Echoes Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
7.2.4. The Non Affine Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
7.3. The PSSL Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
CHAPTER 8. Conclusion and FutureWork . . . . . . . . . . . . . . . . . . . . . . . 34
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
dc.language.isoen
dc.subject聲源定位zh_TW
dc.subject麥克風陣列zh_TW
dc.subject聲音估計結構zh_TW
dc.subjectstructure from sounden
dc.subjectsound source localizationen
dc.subjectmicrophone arraysen
dc.title機率型麥克風陣列校正與聲源定位系統zh_TW
dc.titleProbabilistic Structure from Sound and Sound Source Localizationen
dc.typeThesis
dc.date.schoolyear96-2
dc.description.degree碩士
dc.contributor.oralexamcommittee郭振華(Jen-Hwa Guo),黃寶儀(Polly Huang),胡竹生(Jwu-Sheng Hu)
dc.subject.keyword麥克風陣列,聲音估計結構,聲源定位,zh_TW
dc.subject.keywordmicrophone arrays,structure from sound,sound source localization,en
dc.relation.page33
dc.rights.note有償授權
dc.date.accepted2008-07-31
dc.contributor.author-college電機資訊學院zh_TW
dc.contributor.author-dept資訊工程學研究所zh_TW
顯示於系所單位:資訊工程學系

文件中的檔案:
檔案 大小格式 
ntu-97-1.pdf
  未授權公開取用
3.92 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved