以機器學習預測城市用水需求之研究

Wei-Chin Liao; 廖偉欽

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/21300

標題:	以機器學習預測城市用水需求之研究 Urban Water Demand Forecasting Using Machine Learning
作者:	Wei-Chin Liao 廖偉欽
指導教授:	張智星(Jyh-Shing Roger Jang)
關鍵字:	機器學習,城市用水需求,lasso regression,ridge regression,random forest,XGBoost,neural network,LSTM,特徵選取, machine learning,urban water demand,lasso regression,ridge regression,random forest,XGBoost,neural network,LSTM,feature selection,
出版年 :	2019
學位:	碩士
摘要:	水資源是國家追求永續發展的關鍵要素，了解未來水資源需求的變化為重要課題，需水量的預測為達此目的的有效方法。本研究為月售水量的預測，屬於短期預測，針對重點為系統操作、供水管理、最佳化供水的決策問題。本研究使用機器學習中neural network、LSTM (long short term memory) 、lasso regression、ridge regression、random forest及XGBoost演算法作為售水量預測方法。以預測基隆市的月售水量為例，結果顯示所實現機器學習演算法都對售水量預測之MAPE (mean absolute percentage error) 皆於3.04%以下，顯示其對售水量能做出不錯的預測。本研究各機器學習方法比較了未經特徵選取和經特徵選取後的模型成效，其中XGBoost在未經特徵選取中的資料表現較好，而random forest則是在經特徵選取後的資料表現較好。綜合而言，對於時間性的資料預測，機器學習的演算法普遍來說能充分運用資料，並儘量抑制overfitting的發生，以達到較高的預測準確度。 Water supply is a key element in a country's pursuit of sustainable development. Analyzing future changes in water demand is essential in optimizing water supply, and algorithmic prediction of water demand is an effective way to achieve this goal. This study aims to forecast water demand on a short-term (monthly) basis. These prediction statistics may allow for advanced water supply management technology by assisting a system's decision making process and allowing for more efficient resource management. This study uses the neural network, LSTM (long short-term memory), lasso regression, ridge regression, random forest, and XGBoost, each of which generate unique water demand forecasting statistics. Taking the forecast of the monthly water demand in Keelung as an example, results show that these selected machine learning algorithms may reach an MAPE (mean absolute percentage error) index of below 3.04%, proving that it is an accurate prediction of water demand. In this study, the machine learning algorithms implemented compare the effects of the model with feature selection versus without feature selection. Among the chosen algorithms, XGBoost performs better without feature selection, while random forest performs optimally by using feature selection. The factor of overfitting must be taken into account. For time-based data prediction, the machine learning algorithms implemented are generally ideal in making full use of the data by suppressing the occurrence of overfitting to achieve better accuracy.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/21300
DOI:	10.6342/NTU201901433
全文授權:	未授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-108-1.pdf 未授權公開取用	2.34 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。