Skip navigation

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets

Learn More
DSpace logo
English
中文
  • Browse
    • Communities
      & Collections
    • Publication Year
    • Author
    • Title
    • Subject
    • Advisor
  • Search TDR
  • Rights Q&A
    • My Page
    • Receive email
      updates
    • Edit Profile
  1. NTU Theses and Dissertations Repository
  2. 電機資訊學院
  3. 資訊工程學系
Please use this identifier to cite or link to this item: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101162
Title: 透過聲紋事件偵測豬隻健康狀況
Detecting Pig Health Conditions through Acoustic Event Detection
Authors: 陳威宇
Wei-Yu Chen
Advisor: 張智星
Jyh-Shing Roger Jang
Keyword: 豬隻呼吸疾病檢測,聲學事件檢測原型網路k-平均演算法注意力機制機器學習深度學習
pig respiratory health,acoustic event detectionprototypical networksk-means clusteringattention mechanismsmachine learningdeep learning
Publication Year : 2025
Degree: 碩士
Abstract: 本研究提出一個基於聲音的豬隻健康檢測系統,利用聲學事件來評估呼吸狀況,並實現畜牧業中的早期疾病檢測。我們透過放置在豬隻背部的聽診器捕捉呼吸聲音,並使用梅爾頻譜圖 (Mel spectrogram) 分析結合先進機器學習技術處理音頻數據。本研究解決了畜牧業聲學監測中的挑戰,包括嘈雜的音頻環境、呼吸資料的不足與類別不平衡,以及有限標註數據的問題。為克服這些限制,我們提出結合 k-means clustering 和 query attention 機制的 prototypical network 方法,並嘗試傳統機器學習模型,包括高斯混合模型 (GMM)、隱馬可夫模型 (HMM) 和梯度提升 (XGBoost, LightGBM) 等演算法作為對比模型。我們的實驗框架評估了多種編碼器架構,從傳統的特徵基礎方法到複雜的深度學習模型,包括卷積神經網路 (convolutional neural network, CNN)、Transformers 和四種預訓練聲學模型。本研究所提出的方法雖遜色於 baseline, 但其中 query attention 搭配 k-means 原型網路依舊展現優異性能,達到類似基準模型的效果,展現出模型成功區分不同的呼吸模式,實現非侵入性即時健康監測。
This study presents a sound-based health monitoring system for pigs using acoustic event detection to assess respiratory conditions and enable early disease detection in livestock farming. The system captures breathing sounds through stethoscopes placed on pigs' backs and processes the audio data using Mel spectrogram analysis combined with advanced machine learning techniques. We propose novel methodological approaches incorporating prototypical networks with k-means clustering and attention mechanisms, alongside traditional machine learning models including Gaussian mixture models, hidden Markov models, and gradient boosting algorithms. Our experimental framework evaluates multiple encoder architectures, ranging from traditional feature-based approaches to sophisticated deep learning models including Convolutional Neural Networks, Transformers, and pretrained audio models. The proposed prototypical network approaches demonstrate poorer performance than the deep learning baseline, but the k-means prototypical network with query attention shows a more stable result than the k-means prototypical network without the query attention mechanism when K is greater than 2, showing the effectiveness of how query attention is able to stabilize the k prototypes, while the linear encoder baseline approach achieves the best overall classification results. The system successfully distinguishes between different respiratory patterns, enabling non-invasive real-time health monitoring that reduces animal stress while improving disease detection efficiency.
URI: http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/101162
DOI: 10.6342/NTU202502218
Fulltext Rights: 同意授權(全球公開)
metadata.dc.date.embargo-lift: 2026-01-01
Appears in Collections:資訊工程學系

Files in This Item:
File SizeFormat 
ntu-114-1.pdf2.6 MBAdobe PDFView/Open
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

社群連結
聯絡資訊
10617臺北市大安區羅斯福路四段1號
No.1 Sec.4, Roosevelt Rd., Taipei, Taiwan, R.O.C. 106
Tel: (02)33662353
Email: ntuetds@ntu.edu.tw
意見箱
相關連結
館藏目錄
國內圖書館整合查詢 MetaCat
臺大學術典藏 NTU Scholars
臺大圖書館數位典藏館
本站聲明
© NTU Library All Rights Reserved