請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/83797| 標題: | 基於強化學習和基底核-丘腦動態網路之帕金森氏症閉迴路深腦電刺激演算法 Closed-loop Deep Brain Stimulation Algorithm for Parkinson's Disease based on Reinforcement Learning and Basal Ganglia-Thalamus Network Dynamics |
| 作者: | Chia-Hung Cho 卓嘉虹 |
| 指導教授: | 林啟萬(Chii-Wann Lin) |
| 關鍵字: | 基底核-丘腦網路,閉迴路深腦電刺激,帕金森氏症,強化學習,獎勵塑造, basal ganglia-thalamic (BGT) network,closed-loop deep brain stimulation (cl-DBS),Parkinson’s disease (PD),reinforcement learning (RL),reward shaping, |
| 出版年 : | 2022 |
| 學位: | 碩士 |
| 摘要: | 帕金森氏症 (Parkinson’s disease, PD) 是一種影響中樞神經系統的慢性神經 退化性疾病,目前影響全球約一千萬人 [1]。深腦電刺激 (deep brain stimulation, DBS) 的技術在運動障礙和神經系統疾病中的應用,包括 PD、震顫、肌張力障礙、癲癇、強迫症等,已被證明是一種有效的治療方式 [2]。然而,廣泛使用的 開迴路系統仍然存在一些尚待修正的缺點,例如它們的個體依賴性、能量消耗 程度、頻繁回診和試錯性調整的特徵 [3]。閉迴路的策略採用具有判別性的訊 號或生物標誌物,從而使系統能夠透過算法自動調整 DBS 參數 [3]。我們設計強化學習 (reinforcement learning, RL) 與 Gym 框架,模擬基底神經節-丘腦 (basal ganglia-thalamic, BGT) 大腦網路作為訓練環境,並為任何輸入狀態找到適當的刺激參數(頻率和振幅)。特徵提取模塊則作為 BGT 大腦網路(動作電位訊號)與來自真實大腦的胞外訊號之間的映射工具,進而允許未來的動物實驗和臨床試 驗的測試。結果顯示,基於RL的DBS控制策略在能秏上較開迴路系統節省了 68.81% 的平均功率,並修正丘腦(thalamus, TH)中的錯誤響應(平均錯誤響應在正常情況為 0.0; 在PD下修正回 0.0258),同時為未來應用奠定了基礎。 Parkinson’s disease (PD) is a chronic neurodegenerative disease affecting the central nervous system and currently influencing about 10 million people worldwide [1]. The usage of deep brain stimulation (DBS) technology in movement disorders and neurological diseases, including PD, tremor, dystonia, epilepsy, obsessive-compulsive disorder (OCD), etc., has proven to be an effective treatment modality [2]. However, general open-loop systems pose several shortcomings that have yet to be revised, such as their subject dependency, energy-consuming, frequent-clinic visiting, and trial-and-error adjusting features [3]. The closed-loop strategy employs discriminative signals/biomarkers to enable the system to tune parameters automatically through the designed algorithms [3]. We designed reinforcement learning (RL) with the Gym framework that models the basal ganglia-thalamic (BGT) brain network as a training environment and finds appropriate stimulation parameters (frequency and amplitude) for different input states. The feature extraction module was a mapping tool between the BGT brain network (AP signals) and extracellular signals from real brains, permitting future animal experiments and clinical trials. Results showed that the RL-based DBS control strategy significantly outperforms open-loop systems in energy efficiency, i.e., conserving 68.81% of average power dissipation, and revises error responses in the thalamus (i.e., an average EI of 0.0 in normal and 0.0258 in PD states) while establishing a foundation for future application. |
| URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/83797 |
| DOI: | 10.6342/NTU202200118 |
| 全文授權: | 未授權 |
| 顯示於系所單位: | 醫學工程學研究所 |
文件中的檔案:
| 檔案 | 大小 | 格式 | |
|---|---|---|---|
| U0001-2001202216034000.pdf 未授權公開取用 | 24.3 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。
