Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48330
Title: 利用稀疏表示法之自動鋼琴轉譜
Automatic Transcription of Piano Music by Sparse Representation
Authors: Cheng-Te Lee
李政德
Advisor: 李明穗(Ming-Sui Lee)
Co-Advisor: 陳宏銘(Homer H. Chen)
Keyword: 基頻偵測,音高偵測,稀疏表示法,自動轉譜,
automatic music transcription,F0 estimation,multiple pitch estimation,sparse representation,l1-regularized minimization,
Publication Year : 2011
Degree: 碩士
Abstract: 音高資訊和其他中階特徵被視為跨越語意隔閡的關鍵資訊,因其與人類感知十分密切相關。本篇論文提出了一個極為有效的解決方法來估計鋼琴音樂中的音高資訊,並將此資訊轉為樂譜。實驗結果證實,我們的系統其轉譜準確率目前居世界領先地位。我們將轉譜問題以頻譜的稀疏表示法來解決。我們所提出的系統首先找到有可能出現的音高,再透過解l1最小化問題及隱馬可夫模型,抽取出鋼琴音樂中每個音符的音高及持續時間,完成轉譜。
Like rhythm and timbre, pitch as a mid-level music feature holds the promise of bridging the well-known semantic gap between low-level features and high-level semantics of music. Pitch estimation is an important first step towards this ultimate goal. In this thesis, we target the extraction of multiple pitch contours from piano music signals. Specifically, the pitch estimation is formulated as a sparse representation problem, in which the feature vector of a piano music segment (or frame) is represented as a linear combination of the feature vectors of individual piano notes. The note candidates of the input piano music segment are determined according to the harmonic structure of piano sounds. Then, the sparse representation problem is solved by l1-regularized minimization. A post-processing method based on hidden Markov models (HMMs) is applied to the resulting sparse vector for accuracy refinement. The system performance is evaluated using l1 classical music recordings of a real piano. The results show that the proposed system outperforms three state-of-the-art systems.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/48330
Fulltext Rights: 有償授權
Appears in Collections:資訊工程學系

Files in This Item:
File SizeFormat 
ntu-100-1.pdf
  Restricted Access
1.31 MBAdobe PDF
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved