請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/56872
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 貝蘇章(Soo-Chang Pei) | |
dc.contributor.author | An-Jan Lo | en |
dc.contributor.author | 羅安然 | zh_TW |
dc.date.accessioned | 2021-06-16T06:30:34Z | - |
dc.date.available | 2014-08-17 | |
dc.date.copyright | 2014-08-17 | |
dc.date.issued | 2014 | |
dc.date.submitted | 2014-08-07 | |
dc.identifier.citation | [1] Baris Bozkurt, “Zeros of the z-transform (ZZT) representation and chirp group delay processing for the analysis of source and filter characteristics of speech signals”, phD thesis supervised by prof. T. Dutoit, 2005.
[2] Fant, G. and Liljencrants, J. and Lin, Q., ‘A four-parameter model of glottal flow’, STL-QPSR, vol.26, no. 4, pp.001-013, 1985. [3] T. Drugman, T. Dutoit, ‘Glottal Closure and Opening Instant Detection from Speech Signals’, Interspeech, 2009. [4] Dr. Peter Birkholz, Vocal Tract Lab, http://www.vocaltractlab.de/index.php?page =birkholz-contact [5] T. Drugman , B. Bozkurt , T. Dutoit , “Complex Cepstrum-based Decomposition of Speech for Glottal Source Estimation”, Interspeech, 2009. [6] T. Drugman, B. Bozkurt, T. Dutoit, “Causal–anticausal decomposition of speech using complex cepstrum for glottal source estimation”, Speech Communication, 2011. [7] B. Yegnanarayana, “Design of recursive group-delay filters by autoregressive modeling”, Acoustics, Speech and Signal Processing, IEEE Trans, 1982. [8] Paavo Alku and Jouni Pohjalainen, “Formant frequency estimation of high-pitched vowels using weighted linear prediction”, Interspeech, 2012. [9] Gilles Degottex, “Glottal source and vocal-tract separation estimation of glottal parameters, voice transformation and synthesis using a glottal model”, UPMC, 2010. [10] B. Yegnanarayana, D. K. Saikia, AND T. R. Krishnan, “Significance of Group Delay Functions in Signal Reconstruction from Spectral Magnitude or Phase”, Acoustics, Speech and Signal Processing, IEEE Trans, 1984. [11] T. Drugman, B. Bozkurt, T. Dutoit, “Chirp Decomposition of Speech Signals for Glottal Source Estimation”, 2009. [12] T. Drugman, B. Bozkurt, T. Dutoit, “A Comparative Study of Glottal Source Estimation Techniques”, Computer Speech & Language, 2012. | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/56872 | - |
dc.description.abstract | 在信號處理的研究領域中,語音處理占了相當重要的地位。人類語音的產生可模擬為聲門送出的激發信號,與口腔物理構造所形成之濾波器兩部份的迴旋積。對此過程之細節的研究與語音特徵參數的取得,可運用在語音合成、轉換等的諸多領域。本文中,我們將對zeros of z-transform(ZZT) 演算法及其於語音激發信號擷取的應用作討論。經過Z轉換之後,語音信號的zeros 得以在Z平面上展現其mixed-phase (在半徑1之圓內外皆有zero)的性質,並可憑此進行語音激發信號和口腔通道濾波器兩部份的分離。此外,根據ZZT圖形的研究,由群延遲函數(Group delay function) 所取得之相位資訊可以透過Chirp group delay方法得到大幅度的改善,藉以取得口腔通道濾波器的特徵峰值。所取得結果將與現有語音處理工具相比較,並測試主激發信號衰減(Attenuated Main Excitation ,AME) 方法對特徵峰值取得的改善。 | zh_TW |
dc.description.abstract | Speech processing has been one of the major topics in the research field of signal processing. The process of speech production can be modeled as the convolution of glottal source excitation and vocal tract filter. The research in the details of speech production and the characteristic extraction can be applied in the fields such as speech synthesis and transformation. In this thesis, we discuss the zeros of the z-transform(ZZT) algorithm developed by Dr. Baris Bozkurt[1] and its application to the extraction of the excitation pulse in the source-tract model of human speech signals. After z-transform, the zeros of the speech signals can be represented on z the plane and the mixed-phase property is revealed, which would be used in source-tract separation. On the other hand, by the study of the ZZT plot, the phase information obtained from group delay spectrum could be well improved using the Chirp Group Delay. Moreover, we present the capabilities of formant tracking by ZZT, making a comparison between the performances of ZZT with other speech signal processing tools, and apply Attenuated Main Excitation(AME) for further improvement. | en |
dc.description.provenance | Made available in DSpace on 2021-06-16T06:30:34Z (GMT). No. of bitstreams: 1 ntu-103-R96942127-1.pdf: 5971675 bytes, checksum: 12cfadfa9d7dfed6db009a656bc36250 (MD5) Previous issue date: 2014 | en |
dc.description.tableofcontents | 中文摘要 i
ABSTRACT ii LIST OF FIGURES v LIST OF TABLES ix Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Background 1 1.3 LF model of speech voice 4 1.4 ZZT representation of speech signals 5 Chapter 2 ZZT Algorithm 7 2.1 Definition 7 2.2 ZZT of Glottal Signal 7 2.3 Windowing effects on ZZT 10 2.4 Glottal Closure Instant(GCI) Detection 12 2.5 Conclusion 14 Chapter 3 The Source Excitation Extraction using ZZT 15 3.1 Introduction 15 3.2 ZZT Decomposition 15 3.3 Complex Cepstrum 20 3.4 Test with noise 24 3.5 Chirp decomposition 27 3.6 Conclusion 30 Chapter 4 Applications of ZZT Chirp Group Delay and Formant Tracking 31 4.1 Definition 31 4.2 Application in formant tracking 34 4.2.1 Spectrogram 35 4.2.2 Hilbert-Huang Transform 36 4.2.3 Chirp Group Delay of Zero-Phase Version Signal(CGDZP) 37 4.2.4 Disturbance of High-Pitched Frequency to Formant Tracking 42 4.2.5 Advanced Test for CGDZP and Praat 44 4.2.6 The Effect of Attenuated Main Excitation(AME) on CGDZP 52 4.3 Conclusion 58 Chapter 5 Conclusion and Future Works 59 References 61 | |
dc.language.iso | en | |
dc.title | ZZT演算法之應用於語音激發信號擷取 | zh_TW |
dc.title | The Source Excitation Extraction of Speech Signal Using ZZT Method | en |
dc.type | Thesis | |
dc.date.schoolyear | 102-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 李琳山(Lin-Shan Li),丁建均(Jian-Jiun Ding) | |
dc.subject.keyword | ZZT,語音激發/口腔通道信號分離,群延遲函數,語音特徵峰值擷取, | zh_TW |
dc.subject.keyword | ZZT,source-tract separation,group delay function,formant extraction, | en |
dc.relation.page | 62 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2014-08-08 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 電信工程學研究所 | zh_TW |
顯示於系所單位: | 電信工程學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-103-1.pdf 目前未授權公開取用 | 5.83 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。