Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 電信工程學研究所
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/98383
Title: Vocal2Fail:演奏不佳程度可控的人聲至破音直笛風格轉換
Vocal2Fail: Controllable Timbre Transfer and Evaluation for Failed Recorder Style
Authors: 鍾乙綾
I-Ling Chung
Advisor: 吳沛遠
Pei-Yuan Wu
Keyword: 失敗音樂,音色風格轉換,屬性向量,客觀指標,哼唱選歌,
failed music,timbre transfer,attribute vector,objective metrics,query by singing/humming,
Publication Year : 2025
Degree: 碩士
Abstract: 音色轉換的目標是在保留輸入音訊內容的同時,改變其音色。在本研究中,我們深入探討「失敗音樂音色風格轉換」(failed music timbre transfer)任務,並開發一套能夠進行歌聲轉換為「破音直笛」的音色轉換系統,藉由屬性向量(attribute vector)實現對演奏失敗程度的可控調節。為了解決音色轉換領域中,特別是在「失敗音樂」情境下缺乏客觀評估標準的問題,我們引入一組客觀評估指標:用以捕捉病態聲音特徵的諧波噪音比(Harmonics-to-Noise Ratio, HNR)、用以衡量音高輪廓一致性的動態時間校正距離(Dynamic Time Warping, DTW),以及根據哼唱選歌(Query by Singing/Humming, QbSH)設計的旋律辨識度指標。實驗結果顯示,這些指標與人類感知高度一致,能有效反映可控的演奏失敗程度。我們的研究為音色轉換任務中的表現劣化評估與控制提供了穩健的基礎。
The goal of timbre transfer is to modify the timbre of an input audio while preserving its content. In this work, we conduct an in-depth investigation into the failed music timbre transfer by developing a vocal-to-failed-recorder timbre transfer system with an attribute vector for poor performance controllability. To address the lack of objective evaluation criteria in timbre transfer, particularly for failed music, we introduce a set of objective metrics: Harmonics-to-Noise Ratio (HNR) for capturing pathological sound traits, Dynamic Time Warping (DTW) distance for assessing pitch contour consistency, and Query by Singing/Humming (QbSH)-based metrics for quantifying melodic identity preservation. Experiments show these metrics align well with human perception and effectively reflect controllable poor performance. Our work offers a robust foundation for evaluating and controlling performance degradation in timbre transfer tasks.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/98383
DOI: 10.6342/NTU202502619
Fulltext Rights: 同意授權(全球公開)
metadata.dc.date.embargo-lift: 2025-08-06
Appears in Collections:電信工程學研究所

Files in This Item:
File SizeFormat 
ntu-113-2.pdf4.21 MBAdobe PDFView/Open
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved