請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88513
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 賴飛羆 | zh_TW |
dc.contributor.advisor | Fei-Pei Lai | en |
dc.contributor.author | 朱婉瑄 | zh_TW |
dc.contributor.author | Wan-Xuan Zhu | en |
dc.date.accessioned | 2023-08-15T16:38:11Z | - |
dc.date.available | 2023-11-09 | - |
dc.date.copyright | 2023-08-15 | - |
dc.date.issued | 2023 | - |
dc.date.submitted | 2023-07-26 | - |
dc.identifier.citation | [1] John Dubnowski, Ronald Schafer, and Lawrence Rabiner, "Real-time digital hardware pitch detector." IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 1, 1976, pp. 2-8.
[2] Myron Ross, Harry Shaffer, Andrew Cohen, Richard Freud- berg, and Harold Manley, "Average magnitude difference function pitch extractor." IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 22, no. 5, 1974, pp. 353-362. [3] De Cheveigné, Alain, and Hideki Kawahara. "YIN, a fundamental frequency estimator for speech and music." The Journal of the Acoustical Society of America vol. 111, no. 4, 2002, pp. 1917-1930. [4] Mauch, Matthias, and Simon Dixon. "pYIN: A fundamental frequency estimator using probabilistic threshold distributions." 2014 ieee international conference on acoustics, speech and signal processing (icassp). IEEE, 2014. [5] D. Talkin, "A robust algorithm for pitch tracking (RAPT)" Speech Coding and Synthesis, 1995, pp. 495-518. [6] P. Boersma and P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound," IFA Proceedings vol. 17, 1993, pp. 97--110. [7] Arturo Camacho and John G Harris, "A sawtooth waveform inspired pitch estimator for speech and music," The Journal of the Acoustical Society of America, vol. 124, no. 3, 2008, pp. 1638-1652. [8] P. Martin, “Comparison of pitch detection by cepstrum and spectral comb analysis,” in ICASSP, 1982, pp. 180–183. [9] T. Ramabadran, A. Sorin, M. McLaughlin, D. Chazan, D. Pearce, and R. Hoory, "The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction," in ICASSP, vol. 1. IEEE, 2004, pp. I-53-6. [10] Kawahara, Hideki, et al. "Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT." Ninth European Conference on Speech Communication and Technology, 2005. [11] Kim, Jong Wook, et al. "Crepe: A convolutional representation for pitch estimation." 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2018. [12] Gfeller, Beat, et al. "SPICE: Self-supervised pitch estimation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2020, pp. 1118-1128. [13] McFee, Brian, et al. "librosa: Audio and music signal analysis in python." Proceedings of the 14th python in science conference. vol. 8. 2015. [14] Harris, Fredric J. "On the use of windows for harmonic analysis with the discrete Fourier transform." Proceedings of the IEEE vol. 66, no. 1, 1978, pp. 51-83. [15] D. Jouvet and Y. Laprie, “Performance Analysis of Several Pitch Detection Algorithms on Simulated and Real Noisy Speech Data,” in EUSIPCO, European Signal Processing Conference, 2017. [16] Strömbergsson, Sofia. "Today's Most Frequently Used F0 Estimation Methods, and Their Accuracy in Estimating Male and Female Pitch in Clean Speech." Interspeech, 2016. [17] Babacan, Onur, et al. "A comparative study of pitch extraction algorithms on a large variety of singing sounds." 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2013. [18] Von Dem Knesebeck, Adrian, and Udo Zölzer. "Comparison of pitch trackers for real-time guitar effects." Proc. of the 13th Int. Conference on Digital Audio Effects, 2010. [19] Laroche, Jean, and Mark Dolson. "Improved phase vocoder time-scale modification of audio." IEEE Transactions on Speech and Audio processing vol. 7, no. 3, 1999, pp. 323-332. | - |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88513 | - |
dc.description.abstract | 本研究的目的是開發一個系統,幫助演奏家練習單音曲,同時減輕音樂老師的負擔。
我們提出了一個創新的解決方案,讓學生在不依賴音樂老師即時指導的情況下,隨時隨地進行練習。學生只需錄製自己的演奏並上傳到系統中,系統將自動分析並提供即時的音高評估和回饋。這樣一來,學生可以更靈活地安排練習時間,並在需要時獲得準確的指導。 在方法方面,我們使用了離散餘弦轉換和峰值檢測演算法。DCT用於將錄音數據從時域轉換為頻域表示,並且峰值檢測算法用於檢測頻譜中的主要音高。 研究結果顯示,系統具有出色的音高準確性和一致性。據我們的結果,以範例的Mazas No. 5練習曲與MIDI計算相異,系統的平均音高誤差小,並具有低標準差。 這個系統的開發不僅對學生有益,同時也能減輕音樂老師的負擔。音樂老師可以專注於更有價值的指導和支持方面,提供更專業和個性化的指導。 | zh_TW |
dc.description.abstract | The purpose of this study is to develop a system (ViolinPitch) that helps musicians practice solo pieces while reducing the burden on music teachers, allowing them to focus on providing more accurate and valuable guidance.
We propose an innovative solution that enables students to practice anytime, anywhere without relying on immediate guidance from a music teacher. Students simply record their performance and upload it to the system, which will automatically analyze and provide real-time pitch assessment and feedback. This allows students to have more flexibility in arranging practice time and receive accurate guidance when needed. In terms of methodology, we utilize the Discrete Cosine Transform (DCT) and the peak detection algorithm. The DCT is used to transform the recorded audio data from the time domain to the frequency domain representation, while the peak detection algorithm is employed to identify the main pitches in the spectrum. Based on our findings, the system exhibits a small average pitch error for the Mazas No. 5 etude compared to MIDI calculations, with a low standard deviation. The development of this system not only benefits students but also alleviates the burden on music teachers. It enables music teachers to focus on providing more valuable guidance and support, delivering more professional and personalized instruction, and better meeting the needs of their students. | en |
dc.description.provenance | Submitted by admin ntu (admin@lib.ntu.edu.tw) on 2023-08-15T16:38:11Z No. of bitstreams: 0 | en |
dc.description.provenance | Made available in DSpace on 2023-08-15T16:38:11Z (GMT). No. of bitstreams: 0 | en |
dc.description.tableofcontents | 致謝 iii
中文摘要 iv ABSTRACT v CONTENTS vii Chapter 1 Introduction 1 1.1 Background 1 1.2 Related Works 1 Chapter 2 Method 4 2.1 Recording 4 2.2 Preprocessing Data According to Music Theory 4 2.2.1 Generating a New Sheet Music Chart 4 2.2.2 Calculate the Number of Samples for One Sixteenth Note 5 2.2.3 Determining the Starting Position of a Music Performance Using a Metronome Signal 6 2.2.4 Selecting Stable Audio Signals 7 2.3 Discrete Cosine Transform 8 2.3.1 The Hanning Window 8 2.3.2 Zero-padding in Time Domain 10 2.3.3 Discrete Cosine Transform 10 2.3.4 Converting Amplitude to Decibels 12 2.4 Peak Detection Algorithm 13 2.4.1 Implementation of the Peak Detection Algorithm 13 2.4.2 Correspondence between the resulting signal after DCT and its frequency in Hz 14 Chapter 3 Result 16 3.1 Comparison of Detected Frequencies and MIDI Ground Truth for Mazas No. 5 Audio Recording 16 3.2 System Design and Functionality of the Pitch Detection System on iOS Devices 24 Chapter 4 Discussion 30 4.1 Principal Findings 30 4.2 Limitations 31 Chapter 5 Conclusion 35 REFERENCE 36 Development Environment and Tools 40 | - |
dc.language.iso | en | - |
dc.title | 基於離散餘弦轉換之音高偵測系統 | zh_TW |
dc.title | Pitch Detection System Based on Discrete Cosine Transform | en |
dc.type | Thesis | - |
dc.date.schoolyear | 111-2 | - |
dc.description.degree | 碩士 | - |
dc.contributor.coadvisor | 陳怡茹 | zh_TW |
dc.contributor.coadvisor | Yi-ju Chen | en |
dc.contributor.oralexamcommittee | 趙坤茂;許凱平;顏廷聿 | zh_TW |
dc.contributor.oralexamcommittee | Kun-Mao Chao;Kai-Ping Hsu;Ting-Yu Yen | en |
dc.subject.keyword | 小提琴,樂器,音高偵測,離散餘弦轉換,峰值檢測演算法,iOS 作業系統應用程式開發, | zh_TW |
dc.subject.keyword | Violin,Instrument,Pitch detection,Discrete Cosine Transform,DCT,Peak detection algorithm,iOS application development, | en |
dc.relation.page | 40 | - |
dc.identifier.doi | 10.6342/NTU202301694 | - |
dc.rights.note | 未授權 | - |
dc.date.accepted | 2023-07-27 | - |
dc.contributor.author-college | 電機資訊學院 | - |
dc.contributor.author-dept | 資訊網路與多媒體研究所 | - |
顯示於系所單位: | 資訊網路與多媒體研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-111-2.pdf 目前未授權公開取用 | 3.52 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。