仿射非線性系統之適應最佳追蹤控制器設計

Sheng-Fu Lai; 賴昇甫

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62131

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.advisor	林巍聳
dc.contributor.author	Sheng-Fu Lai	en
dc.contributor.author	賴昇甫	zh_TW
dc.date.accessioned	2021-06-16T13:29:27Z	-
dc.date.available	2016-07-30
dc.date.copyright	2013-07-30
dc.date.issued	2013
dc.date.submitted	2013-07-22
dc.identifier.citation	[1] X. Wang, P. Cui, and H. Cui, 'A note on Poisson stability and controllability,' Nonlinear Dynamics, vol. 66, pp. 789-793, 2011/12/01 2011. [2] A. Isidori and C. I. Byrnes, 'Output regulation of nonlinear systems,' Automatic Control, IEEE Transactions on, vol. 35, pp. 131-140, 1990. [3] S. Devasia, C. Degang, and B. Paden, 'Nonlinear inversion-based output tracking,' Automatic Control, IEEE Transactions on, vol. 41, pp. 930-942, 1996. [4] B. A. Francis and W. M. Wonham, 'The internal model principle of control theory,' Automatica, vol. 12, pp. 457-465, 9// 1976. [5] R. W. Bea, 'Successive Galerkin approximation algorithms for nonlinear optimal and robust control,' International Journal of Control, vol. 71, pp. 717-743, 1998/01/01 1998. [6] W. L. Garrard, D. F. Enns, and S. Antony Snell, 'Nonlinear feedback control of highly manoeuvrable aircraft,' International Journal of Control, vol. 56, pp. 799-812, 1992/10/01 1992. [7] H. T. Banks, B. M. Lewis, and H. T. Tran, 'Nonlinear feedback controllers and compensators: a state-dependent Riccati equation approach,' Computational Optimization and Applications, vol. 37, pp. 177-218, 2007/06/01 2007. [8] D. Wang, D. Liu, and Q. Wei, 'Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach,' Neurocomputing, vol. 78, pp. 14-22, 2/15/ 2012. [9] W.-S. Lin, 'Optimality and convergence of adaptive optimal control by reinforcement synthesis,' Automatica, vol. 47, pp. 1047-1052, 5// 2011. [10] P. J. Webros, 'A menu of designs for reinforcement learning over time,' in Neural networks for control, W. Thomas Miller, III, S. S. Richard, and J. W. Paul, Eds., ed: MIT Press, 1990, pp. 67-95. [11] F. L. Lewis, Optimal control. New York: Wiley, 1986. [12] D. V. Prokhorov and D. C. Wunsch, 'Adaptive critic designs,' Neural Networks, IEEE Transactions on, vol. 8, pp. 997-1007, 1997. [13] S. Haykin, Neural Networks: A Comprehensive Foundation: Prentice Hall PTR, 1998. [14] T. Poggio and F. Girosi, 'Networks for approximation and learning,' Proceedings of the IEEE, vol. 78, pp. 1481-1497, 1990. [15] D. Z. Ronald J. Williams, 'Experimental Analysis of the Real-time Recurrent Learning Algorithm,' Connection Science, vol. 1, pp. 87-111, 1989. [16] F. Moldoveanu, 'Trajectory Tracking Control of a Two-link Robot Manipulator Using Variable Structure System Theory,' Control Engineering and Applied Informatics (CEAI) Journal, vol. 7, 2005.
dc.identifier.uri	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/62131	-
dc.description.abstract	仿射非線性系統是無控制訊號非線性項的非線性系統，其軌跡追蹤控制問題的傳統解法是利用線性化求解，但是仿射非線性系統的線性化設計受到很多限制，而且線性化會降低控制精度也不適用於具有強非線性的系統。本論文採用適應型最佳控制法求解仿射非線性系統的軌跡追蹤最佳控制問題，適應型最佳控制法的特點是用成本函數表示優化的目標，然後把根據最小值原理導出的最佳控制條件式安排成一個強化學習架構，此架構可以透過學習程序建立評議函數，完成學習的評議函數會在強化學習架構之下引導循序優化控制器的程序，循序優化程序收斂之後會得到軌跡追蹤最佳控制器的近似解。本論文針對仿射非線性系統軌跡追蹤最佳控制問題建立適應型最佳控制法的求解步驟，透過許多個應用例的電腦模擬結果檢驗此求解方法的成效，並且與疊代試探動態規劃法的成效相比較，結果顯示適應型最佳控制法確實可以透過學習和循序優化程序，求得仿射非線性系統的軌跡追蹤最佳控制器的近似解，對照疊代試探動態規劃法的成效，適應型最佳控制法的收斂速率明顯的快很多，而且可以容忍比較大的系統模型誤差。	zh_TW
dc.description.provenance	Made available in DSpace on 2021-06-16T13:29:27Z (GMT). No. of bitstreams: 1 ntu-102-R00921015-1.pdf: 5798038 bytes, checksum: cbc0b7275befc8c22be413c9a6b96a3b (MD5) Previous issue date: 2013	en
dc.description.tableofcontents	口試委員會審定書 i 誌謝 iii 中文摘要 v ABSTRACT vi 目錄 vii 圖目錄 ix 表目錄 xi 第1章緒論 1 1.1 研究背景 1 1.2 研究動機 2 1.3 論文架構 3 第2章疊代試探動態規劃法 4 2.1 問題描述 4 2.2 近似動態規劃法 5 2.3 疊代試探動態規劃法 6 2.3.1 iterHDP演算流程 7 2.3.2 iterHDP訓練策略 9 2.3.3 模擬驗證 11 2.3.4 iterHDP總結 17 第3章適應最佳控制法 18 3.1 最佳控制之必要條件 18 3.2 AOC之訓練策略 20 3.3 模擬驗證 23 3.4 幅狀基底類神經網路 27 3.4.1 幅狀基底類神經網路架構 28 3.4.2 幅狀基底類神經網路中心點選取 30 3.4.3 誤差倒傳遞演算法 30 3.4.4 遞迴式類神經網路 32 3.4.5 即時回饋學習演算法 33 第4章模型偏差對AOC和iterHDP的影響 36 4.1 模型偏差對AOC和iterHDP的影響 36 4.1.1 使用類神經網路建模型 36 4.1.2 使用數學模型設計模型偏差 42 4.1.3 使用AOC求解前饋控制 49 4.2 AOC在平面機器人追蹤控制之應用 53 4.2.1 簡介 53 4.2.2 平面機器人之系統模型 53 4.2.3 平面機器人軌跡追蹤控制之模擬驗證 55 4.2.4 模型偏差的影響 58 第5章結論與未來展望 63 參考文獻 65
dc.language.iso	zh-TW
dc.subject	最佳追蹤控制	zh_TW
dc.subject	仿射非線性系統	zh_TW
dc.subject	適應型最佳控制	zh_TW
dc.subject	試探動態規劃法	zh_TW
dc.subject	optimal tracking control	en
dc.subject	affine nonlinear system	en
dc.subject	Adaptive optimal control	en
dc.subject	Heuristic Dynamic Programming	en
dc.title	仿射非線性系統之適應最佳追蹤控制器設計	zh_TW
dc.title	Design of Adaptive Optimal Tracking Controller of Affine Nonlinear Systems	en
dc.type	Thesis
dc.date.schoolyear	101-2
dc.description.degree	碩士
dc.contributor.oralexamcommittee	張志永,蔡超人,鍾鴻源
dc.subject.keyword	仿射非線性系統,適應型最佳控制,試探動態規劃法,最佳追蹤控制,	zh_TW
dc.subject.keyword	affine nonlinear system,Adaptive optimal control,Heuristic Dynamic Programming,optimal tracking control,	en
dc.relation.page	66
dc.rights.note	有償授權
dc.date.accepted	2013-07-22
dc.contributor.author-college	電機資訊學院	zh_TW
dc.contributor.author-dept	電機工程學研究所	zh_TW
顯示於系所單位：	電機工程學系

文件中的檔案：

檔案	大小	格式
ntu-102-1.pdf 未授權公開取用	5.66 MB	Adobe PDF

顯示文件簡單紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。