機率型麥克風陣列校正與聲源定位系統

Chi-Hao Lin; 林祺豪

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	王傑智(Chieh-Chih Wang)
dc.contributor.author	Chi-Hao Lin	en
dc.contributor.author	林祺豪	zh_TW
dc.date.accessioned	2021-06-14T16:52:38Z	-
dc.date.available	2008-08-05
dc.date.copyright	2008-08-05
dc.date.issued	2008
dc.date.submitted	2008-07-29
dc.identifier.citation	Birchfield, S. T. & Gangishetty, R. (2005). Acoustic localization by interaural level difference. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Cui, W., Cao, Z., & Wei, J. (2006). Dual-microphone source location method in 2-d space. In International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Hu, J.-S., Cheng, C.-C., & Liu,W.-H. (2006). Robust speaker’s location detection in a vehicle environment using gmmmodels. IEEE Transactions on Systems,Man and Cybernetics - PartB: Cybernetics, 36(2), 403–412. Knapp, C. H. & Carter, G. C. (1976). The generalized correlation method for estimation of time delay. IEEE Trans. Acoust., Speech, Signal Processing, 24, 320–327. Nakashima, H. & Mukai, T. (2005). 3d sound source localization system based on learning of binaural hearing. In IEEE International Conference on Systems, Man and Cybernetics. Thrun, S. (2005). Affine structure from sound. In Proceedings of Conference on Neural Information Processing Systems (NIPS), Cambridge, MA. MIT Press. Tomasi, C. & Kanade, T. (1992). Shape andmotion fromimage streams under orthography: a factorization method. International Journal of Computer Vision, 9(2), 137–154.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/40596	-
dc.description.abstract	在機器人學裡，聲音感知是一個非常重要的功能。麥克風陣列在聲音感知的應用中被廣泛的使用，而在這些應用裡，麥克風在空間的座標通常是已知的。聲音估計結構(Structure from Sound)演算法提供了一個富有彈性之方法來校正不同結構的麥克風陣列，它能同時地定位多個麥克風與定位多個聲源。然而，在現存的演算法裡並沒有將量測的不確定性納入考量，也沒有提供音聲估計結構演算法之結果的不確定性估測。在這篇論文裡，我們提出了一個機率型聲音估計結構演算法(Probabilistic Structure from Sound)。此外，我們提出了一個機率型聲源定位演算法(Probabilistic Sound Source Localization)，此演算法是使用機率型聲音估計結構演算法的結果來改進聲源定位的準確性。我們使用低成本的麥克風。大量的模擬與實驗結果成功的展示了機率型聲音估計結構演算法與機率型聲源定位演算法之成果。	zh_TW
dc.description.abstract	Auditory perception is one of the most important functions for robotics applications. Microphone arrays are widely used for auditory perception in which the spatial structure of microphones is usually known. The thesis first describes the affine Structure from Sound (SFS) algorithm. The structure from sound is a problem to simultaneously localize microphones and sound sources. However, the existing method does not take measurement uncertainty into account and does not provide uncertainty estimates of the SFS results. In this thesis, we propose a probabilistic structure from sound (PSFS) approach using the unscented transform. The PSFS algorithm not only localizes microphones and sound sources but also estimates the uncertainties of the SFS results. In addition, a probabilistic sound source localization (PSSL) approach using the PSFS results is provided to improve sound source localization accuracy. The ample results of simulation and experiments using low cost, off-the-shell microphones demonstrate the feasibility and performance of the proposed PSFS and PSSL approaches.	en
dc.description.provenance	Made available in DSpace on 2021-06-14T16:52:38Z (GMT). No. of bitstreams: 1 ntu-97-R94922124-1.pdf: 4017230 bytes, checksum: 27e56cfbebd70979fa29db7573a14953 (MD5) Previous issue date: 2008	en
dc.description.tableofcontents	ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v CHAPTER 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 CHAPTER 2. Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 CHAPTER 3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.1. TDOA . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.2. GCC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 3.3. Sound Source Localization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 CHAPTER 4. Afﬁne Structure from Sound . . . . . . . . . . . . . . . . . . . . . . . . 9 4.1. Problem Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 4.2. Data Measurement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 4.3. Least square solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4.4. The Far Field Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 4.5. Afﬁne Solution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 4.6. Recovering the sound source locations . . . . . . . . . . . . . . . . . . . . . . 12 CHAPTER 5. Probabilistic Structure from Sound . . . . . . . . . . . . . . . . . . . . 17 5.1. The Unscented Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 5.2. Dealing with Uncertainties . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 5.2.3. Uncertainties Estimation using Unscented Transform . . . . . . . . . . . 19 5.3. Dealing with Axis Inconsistency . . . . . . . . . . . . . . . . . . . . . . . . . 19 CHAPTER 6. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . 23 6.1. Gaussian Mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 6.2. Gaussian Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24 6.3. Probabilistic Sound Source Localization . . . . . . . . . . . . . . . . . . . . . 24 CHAPTER 7. Experimental Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.1. TDOA Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.2. PSFS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 7.2.1. The Near Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 7.2.2. The Far Field Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 7.2.3. The Echoes Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 7.2.4. The Non Afﬁne Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 7.3. The PSSL Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 CHAPTER 8. Conclusion and FutureWork . . . . . . . . . . . . . . . . . . . . . . . 34 BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
dc.language.iso	en
dc.subject	聲源定位	zh_TW
dc.subject	麥克風陣列	zh_TW
dc.subject	聲音估計結構	zh_TW
dc.subject	structure from sound	en
dc.subject	sound source localization	en
dc.subject	microphone arrays	en
dc.title	機率型麥克風陣列校正與聲源定位系統	zh_TW
dc.title	Probabilistic Structure from Sound and Sound Source Localization	en
dc.type	Thesis
dc.date.schoolyear	96-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	郭振華(Jen-Hwa Guo),黃寶儀(Polly Huang),胡竹生(Jwu-Sheng Hu)
dc.subject.keyword	麥克風陣列,聲音估計結構,聲源定位,	zh_TW
dc.subject.keyword	microphone arrays,structure from sound,sound source localization,	en
dc.relation.page	33
dc.rights.note	有償授權
dc.date.accepted	2008-07-31
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	資訊工程學研究所	zh_TW
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-97-1.pdf 未授權公開取用	3.92 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。