請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/60721
完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.advisor | 蘇雅韻(Ya-Yunn Su) | |
dc.contributor.author | Chen-Hsin Ding | en |
dc.contributor.author | 丁振新 | zh_TW |
dc.date.accessioned | 2021-06-16T10:27:26Z | - |
dc.date.available | 2013-08-17 | |
dc.date.copyright | 2013-08-17 | |
dc.date.issued | 2013 | |
dc.date.submitted | 2013-08-15 | |
dc.identifier.citation | [1] Brining animation to life through cloud computing. In http://softwareinsight.intel.com/visual/visual-feature.php.
[2] Matlab k-means function help page. In http://www.mathworks.com/help/stats/kmeans.html. [3] Support vector machine wiki page. In https://en.wikipedia.org/wiki/Support_vector_machine. [4] Workstaion hardwares in ntu csie. In http://wslab.csie.ntu.edu.tw/hardware/. [5] Adamic, L. Zipf, power-laws and pareto - a ranking tutorial. [6] Basak, D., Pal, S., and Patranabis, D. C. Support vector regression. In Neural Information Processing (October 2007). [7] Chang, C.-C., and Lin, C.-J. LIBSVM: A library for support vector machines. In ACM Transactions on Intelligent Systems and Technology (2011). Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. [8] Chen, Y., Ganapathi, A., Griffith, R., and Katz, R. The case for evaluating mapre- duce performance using workload suites. In Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), IEEE (July 2011). [9] Chen, Y., Ganapathi, A. S., Griffith, R., and Katz, R. H. Analysis and lessons from a publicly available google cluster trace. In UC Berkeley Technical Report (June 2010). [10] Dinda, P. Online prediction of the running time of tasks. In Cluster Computing (2001). [11] Dinda, P. A., and O’Hallaron, D. R. Host load prediction using linear models. In Cluster Computing (2000). [12] Drucker,H.,Burges,C.J.C.,Kaufman,L.,Smola,A.,andVapnik,V.Supportvector regression machines. In Neural Information Processing Systems (1997). [13] Ferguson, A. D., Bodik, P., Kandula, S., Boutin, E., and Fonseca, R. Jockey: Guar- anteed job latency in data parallel clusters. In Proceedings of the 7th ACM european conference on Computer Systems (EuroSys) (2012). [14] Iverson, M. A., Ozguner, F., and Potter, L. C. Statistical prediction of task execution times through analytic benchmarking for scheduling in a heterogeneous environ- ment. In IEEE Heterogeneous Computing Workshop (1999). [15] Kapadia, N. H., Fortes, J. A. B., and Brodley, C. E. Predictive application- performance modeling in a computational grid environment. In High Performance Distributed Computing (ISHPDC), IEEE (August 1999). [16] Kavulya, S., Tan, J., Gandhi, R., and Narasimhan, P. An analysis of traces from a production mapreduce cluster december. In Cluster, Cloud and Grid Computing (CCGrid), IEEE (May 2010). [17] Lama, P., and Zhou, X. Aroma: Automated resource allocation and configuration of mapreduce environment in the cloud. In Proceedings of the 9th international conference on Autonomic computing (ICAC) (2012). [18] Leung, A. W., Pasupathy, S., Goodson, G., and Miller, E. L. Measurement and analysis of large-scale network file system workloads. In USENIX Annual Technical Conference on Annual Technical Conference (ATC) (2008). [19] Mishra, A. K., Hellerstein, J. L., Cirne, W., and Das, C. R. Towards characterizing cloud backend workloads: Insight from google compute clusters. In ACM SIGMET- RICS Performance Evaluation Review (March 2010). 54 [20] Newman, M. Power laws, pareto distributions and zipf’s law. In Contemporary Physics (2005). [21] Reiss, C., Tumanov, A., Ganger, G. R., Katz, R. H., and Kozuch, M. A. Heterogene- ity and dynamicity of clouds at scale: Google trace analysis. In Proceedings of the Third ACM Symposium on Cloud Computing Article No. 7 (2012). [22] Smith,W.Predictionservicesfordistributedcomputing.InParallelandDistributed Processing Symposium (IPDPS), IEEE (March 2007). [23] Smith, W., Foster, I., and Taylor, V. Predicting application run times using historical information. In Lecture Notes on Computer Science (1998). [24] Smola, A. J., and Scholkopf, B. A tutorial on support vector regression. In Statistics and Computing (August 2004). [25] Vaarandi, R. Simple logfile clustering tool. In http://ristov.users.sourceforge.net/slct/. [26] Vaarandi, R. A data clustering algorithm for mining patterns from event logs. In IEEE workshop on IP Operations and Management (2003). [27] Verma, A., Cherkasova, L., and Campbell, R. H. Aria: Automatic resource inference and allocation for mapreduce. In Proceedings of the 9th international conference on Autonomic computing (ICAC) (2011). | |
dc.identifier.uri | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/60721 | - |
dc.description.abstract | 雲端運算可以在本地機器計算資源不足時提供計算資源。為了幫助 系統排程器決定一個程序該在本地端執行或者在雲端上執行,本論文 提供一個程序執行時間預測方法,使得排程器可根據預估的執行時間 決定要在本地或雲端執行此程序。本論文提出一個機器學習 (支持向量 回歸) 的方法,使用歷史資訊來建立預測模型。我們收集了程序執行的 歷史檔、系統負載資訊、檔案系統資訊,來預測重覆執行的高計算用 量程序執行時間。此方法實驗評估可達 20% 相對平均執行時間的平均 誤差,預測高計算用量開放原始碼程式的執行時間俱可行性。 | zh_TW |
dc.description.abstract | Cloud computing is widely used in on-demand computing in recent years. A local computing cluster cannot always provide sufficient resources for ev- ery user. A computing cluster with cloud resources assisted could let users obtain computing resource faster while local machine are fully loaded. We collected traces from workstations in our department to understand how users use machines for computing and try to improve the scheduler. To make a sys- tem scheduler dispatch an incoming job to a local machine or to the cloud, we provide a task run time advisor for the scheduler making decisions accu- ractely. The run time advisor is a support vector regression model which is constructed by historical information. The prediction error is less than 20% error of mean run time in predicting CPU-bound open source projects. Our evaluation experiment is a data driven approach that the trace is collected from workstations in NTU CSIE. | en |
dc.description.provenance | Made available in DSpace on 2021-06-16T10:27:26Z (GMT). No. of bitstreams: 1 ntu-102-R00944031-1.pdf: 1119823 bytes, checksum: f8e517bb3a0c56001d6b6d1927b95f34 (MD5) Previous issue date: 2013 | en |
dc.description.tableofcontents | 誌謝 iii
摘要 v Abstract vii 1 Introduction 1 2 Problem Statements 3 2.1 Motivation.................................. 3 2.2 Understandingworkloadcharacteristics .................. 4 2.3 Improvingschedulerdecisions ....................... 4 2.4 Goals .................................... 5 3 Trace Collection 7 3.1 Processstatustrace ............................. 7 3.2 Hostloadtrace ............................... 8 3.3 Filesystemtrace .............................. 8 4 Workload Characteristics 13 4.1 Methodology ................................ 13 4.2 Workloadclustering............................. 16 4.3 Statisticalanalysis.............................. 16 4.4 Insights from task classification and statistical analysis . . . . . . . . . . 23 5 Task Run Time Prediction 25 5.1 Amachinelearningtechniqueapproach .................. 25 5.2 Methodology ................................ 25 6 Evaluations 31 6.1 Adatadrivenapproachexperiment..................... 31 6.2 Initialevaluation .............................. 32 6.3 Onemonthlongtraceevaluation...................... 34 6.4 Filteringunpredictableprocesses...................... 36 7 Discussions 41 7.1 Factorsofunpredictability ......................... 41 7.2 Whywhitelistworks ............................ 44 7.3 Futurework................................. 46 8 Related Works 47 9 Conclusions 51 Bibliography 53 | |
dc.language.iso | en | |
dc.title | 使用機器學習方法預測程序執行時間 | zh_TW |
dc.title | A task run time predictor using machine learning techniques | en |
dc.type | Thesis | |
dc.date.schoolyear | 101-2 | |
dc.description.degree | 碩士 | |
dc.contributor.oralexamcommittee | 薛智文(Chih-Wen Hsueh),林守德(Shou-De Lin) | |
dc.subject.keyword | 雲端運算,排程器,機器學習,預測,資料探勘, | zh_TW |
dc.subject.keyword | cloud computing,scheduler,machine learning,prediction,data mining, | en |
dc.relation.page | 55 | |
dc.rights.note | 有償授權 | |
dc.date.accepted | 2013-08-15 | |
dc.contributor.author-college | 電機資訊學院 | zh_TW |
dc.contributor.author-dept | 資訊網路與多媒體研究所 | zh_TW |
顯示於系所單位: | 資訊網路與多媒體研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-102-1.pdf 目前未授權公開取用 | 1.09 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。