Skip navigation

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料(如:文字、圖片、PDF)並使其易於取用。

點此認識 DSpace
DSpace logo
English
中文
  • 瀏覽論文
    • 校院系所
    • 出版年
    • 作者
    • 標題
    • 關鍵字
  • 搜尋 TDR
  • 授權 Q&A
  • 幫助
    • 我的頁面
    • 接受 E-mail 通知
    • 編輯個人資料
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊網路與多媒體研究所
請用此 Handle URI 來引用此文件: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88513
完整後設資料紀錄
DC 欄位值語言
dc.contributor.advisor賴飛羆zh_TW
dc.contributor.advisorFei-Pei Laien
dc.contributor.author朱婉瑄zh_TW
dc.contributor.authorWan-Xuan Zhuen
dc.date.accessioned2023-08-15T16:38:11Z-
dc.date.available2023-11-09-
dc.date.copyright2023-08-15-
dc.date.issued2023-
dc.date.submitted2023-07-26-
dc.identifier.citation[1] John Dubnowski, Ronald Schafer, and Lawrence Rabiner, "Real-time digital hardware pitch detector." IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 24, no. 1, 1976, pp. 2-8.
[2] Myron Ross, Harry Shaffer, Andrew Cohen, Richard Freud- berg, and Harold Manley, "Average magnitude difference function pitch extractor." IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 22, no. 5, 1974, pp. 353-362.
[3] De Cheveigné, Alain, and Hideki Kawahara. "YIN, a fundamental frequency estimator for speech and music." The Journal of the Acoustical Society of America vol. 111, no. 4, 2002, pp. 1917-1930.
[4] Mauch, Matthias, and Simon Dixon. "pYIN: A fundamental frequency estimator using probabilistic threshold distributions." 2014 ieee international conference on acoustics, speech and signal processing (icassp). IEEE, 2014.
[5] D. Talkin, "A robust algorithm for pitch tracking (RAPT)" Speech Coding and Synthesis, 1995, pp. 495-518.
[6] P. Boersma and P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound," IFA Proceedings vol. 17, 1993, pp. 97--110.
[7] Arturo Camacho and John G Harris, "A sawtooth waveform inspired pitch estimator for speech and music," The Journal of the Acoustical Society of America, vol. 124, no. 3, 2008, pp. 1638-1652.
[8] P. Martin, “Comparison of pitch detection by cepstrum and spectral comb analysis,” in ICASSP, 1982, pp. 180–183.
[9] T. Ramabadran, A. Sorin, M. McLaughlin, D. Chazan, D. Pearce, and R. Hoory, "The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction," in ICASSP, vol. 1. IEEE, 2004, pp. I-53-6.
[10] Kawahara, Hideki, et al. "Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT." Ninth European Conference on Speech Communication and Technology, 2005.
[11] Kim, Jong Wook, et al. "Crepe: A convolutional representation for pitch estimation." 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2018.
[12] Gfeller, Beat, et al. "SPICE: Self-supervised pitch estimation." IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 2020, pp. 1118-1128.
[13] McFee, Brian, et al. "librosa: Audio and music signal analysis in python." Proceedings of the 14th python in science conference. vol. 8. 2015.
[14] Harris, Fredric J. "On the use of windows for harmonic analysis with the discrete Fourier transform." Proceedings of the IEEE vol. 66, no. 1, 1978, pp. 51-83.
[15] D. Jouvet and Y. Laprie, “Performance Analysis of Several Pitch Detection Algorithms on Simulated and Real Noisy Speech Data,” in EUSIPCO, European Signal Processing Conference, 2017.
[16] Strömbergsson, Sofia. "Today's Most Frequently Used F0 Estimation Methods, and Their Accuracy in Estimating Male and Female Pitch in Clean Speech." Interspeech, 2016.
[17] Babacan, Onur, et al. "A comparative study of pitch extraction algorithms on a large variety of singing sounds." 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2013.
[18] Von Dem Knesebeck, Adrian, and Udo Zölzer. "Comparison of pitch trackers for real-time guitar effects." Proc. of the 13th Int. Conference on Digital Audio Effects, 2010.
[19] Laroche, Jean, and Mark Dolson. "Improved phase vocoder time-scale modification of audio." IEEE Transactions on Speech and Audio processing vol. 7, no. 3, 1999, pp. 323-332.
-
dc.identifier.urihttp://tdr.lib.ntu.edu.tw/jspui/handle/123456789/88513-
dc.description.abstract  本研究的目的是開發一個系統,幫助演奏家練習單音曲,同時減輕音樂老師的負擔。
  我們提出了一個創新的解決方案,讓學生在不依賴音樂老師即時指導的情況下,隨時隨地進行練習。學生只需錄製自己的演奏並上傳到系統中,系統將自動分析並提供即時的音高評估和回饋。這樣一來,學生可以更靈活地安排練習時間,並在需要時獲得準確的指導。
  在方法方面,我們使用了離散餘弦轉換和峰值檢測演算法。DCT用於將錄音數據從時域轉換為頻域表示,並且峰值檢測算法用於檢測頻譜中的主要音高。
  研究結果顯示,系統具有出色的音高準確性和一致性。據我們的結果,以範例的Mazas No. 5練習曲與MIDI計算相異,系統的平均音高誤差小,並具有低標準差。
  這個系統的開發不僅對學生有益,同時也能減輕音樂老師的負擔。音樂老師可以專注於更有價值的指導和支持方面,提供更專業和個性化的指導。
zh_TW
dc.description.abstractThe purpose of this study is to develop a system (ViolinPitch) that helps musicians practice solo pieces while reducing the burden on music teachers, allowing them to focus on providing more accurate and valuable guidance.
We propose an innovative solution that enables students to practice anytime, anywhere without relying on immediate guidance from a music teacher. Students simply record their performance and upload it to the system, which will automatically analyze and provide real-time pitch assessment and feedback. This allows students to have more flexibility in arranging practice time and receive accurate guidance when needed.
In terms of methodology, we utilize the Discrete Cosine Transform (DCT) and the peak detection algorithm. The DCT is used to transform the recorded audio data from the time domain to the frequency domain representation, while the peak detection algorithm is employed to identify the main pitches in the spectrum.
Based on our findings, the system exhibits a small average pitch error for the Mazas No. 5 etude compared to MIDI calculations, with a low standard deviation. The development of this system not only benefits students but also alleviates the burden on music teachers. It enables music teachers to focus on providing more valuable guidance and support, delivering more professional and personalized instruction, and better meeting the needs of their students.
en
dc.description.provenanceSubmitted by admin ntu (admin@lib.ntu.edu.tw) on 2023-08-15T16:38:11Z
No. of bitstreams: 0
en
dc.description.provenanceMade available in DSpace on 2023-08-15T16:38:11Z (GMT). No. of bitstreams: 0en
dc.description.tableofcontents致謝 iii
中文摘要 iv
ABSTRACT v
CONTENTS vii
Chapter 1 Introduction 1
1.1 Background 1
1.2 Related Works 1
Chapter 2 Method 4
2.1 Recording 4
2.2 Preprocessing Data According to Music Theory 4
2.2.1 Generating a New Sheet Music Chart 4
2.2.2 Calculate the Number of Samples for One Sixteenth Note 5
2.2.3 Determining the Starting Position of a Music Performance Using a Metronome Signal 6
2.2.4 Selecting Stable Audio Signals 7
2.3 Discrete Cosine Transform 8
2.3.1 The Hanning Window 8
2.3.2 Zero-padding in Time Domain 10
2.3.3 Discrete Cosine Transform 10
2.3.4 Converting Amplitude to Decibels 12
2.4 Peak Detection Algorithm 13
2.4.1 Implementation of the Peak Detection Algorithm 13
2.4.2 Correspondence between the resulting signal after DCT and its frequency in Hz 14
Chapter 3 Result 16
3.1 Comparison of Detected Frequencies and MIDI Ground Truth for Mazas No. 5 Audio Recording 16
3.2 System Design and Functionality of the Pitch Detection System on iOS Devices 24
Chapter 4 Discussion 30
4.1 Principal Findings 30
4.2 Limitations 31
Chapter 5 Conclusion 35
REFERENCE 36
Development Environment and Tools 40
-
dc.language.isoen-
dc.title基於離散餘弦轉換之音高偵測系統zh_TW
dc.titlePitch Detection System Based on Discrete Cosine Transformen
dc.typeThesis-
dc.date.schoolyear111-2-
dc.description.degree碩士-
dc.contributor.coadvisor陳怡茹zh_TW
dc.contributor.coadvisorYi-ju Chenen
dc.contributor.oralexamcommittee趙坤茂;許凱平;顏廷聿zh_TW
dc.contributor.oralexamcommitteeKun-Mao Chao;Kai-Ping Hsu;Ting-Yu Yenen
dc.subject.keyword小提琴,樂器,音高偵測,離散餘弦轉換,峰值檢測演算法,iOS 作業系統應用程式開發,zh_TW
dc.subject.keywordViolin,Instrument,Pitch detection,Discrete Cosine Transform,DCT,Peak detection algorithm,iOS application development,en
dc.relation.page40-
dc.identifier.doi10.6342/NTU202301694-
dc.rights.note未授權-
dc.date.accepted2023-07-27-
dc.contributor.author-college電機資訊學院-
dc.contributor.author-dept資訊網路與多媒體研究所-
顯示於系所單位:資訊網路與多媒體研究所

文件中的檔案:
檔案 大小格式 
ntu-111-2.pdf
  目前未授權公開取用
3.52 MBAdobe PDF
顯示文件簡單紀錄


系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved