大規模線性排序支持向量機在分散式環境下之分析實作

Wei-Lun Huang; 黃煒倫

請用此 Handle URI 來引用此文件： http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/51324

標題:	大規模線性排序支持向量機在分散式環境下之分析實作 Analysis and Implementation of Large-scale Linear RankSVM in Distributed Environments
作者:	Wei-Lun Huang 黃煒倫
指導教授:	林智仁(Chih-Jen Lin)
關鍵字:	大規模學習,排序支持向量機,分散式牛頓法, Learning to rank,Ranking support vector machines,Large-scale learning,Linear model,Distributed Newton method,
出版年 :	2016
學位:	碩士
摘要:	在排序學習中，要快速地得到一個基準模型作為比較，線性排序支持向量機是一個有用的方法。雖然它的平行機制已經被探討且實作在圖形處理器上面，但此實作有可能無法處理大規模的數據集。在本論文中，我們提出兩種平行架構，用分散式牛頓法訓練L2損失函數之線性排序支持向量機。我們小心的探討降低溝通成本以及加速運算的技術，並且在稠密和稀疏的數據集上比較兩種平行機制的優劣。實驗顯示本文提出的方法在兩種數據集上會遠比單機運算快，分別為資料量遠大於特徵數以及特徵數遠大於資料量的數據集。 Linear rankSVM is a useful method to quickly produce a baseline model for learning to rank. Although its parallelization has been investigated and implemented on GPU, it may not handle large-scale data sets. In this thesis, we propose a distributed trust region Newton method for training L2-loss linear rankSVM with two kinds of parallelizations. We carefully discuss the techniques for reducing the communication cost and speeding up the computation, and compare both kinds of parallelizations on dense and sparse data sets. Experiments show that our distributed methods are much faster than the single machine method on two kinds of data sets: one with its number of instances much larger than its number of features, and the other is the opposite.
URI:	http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/51324
全文授權:	有償授權
顯示於系所單位：	資訊工程學系

文件中的檔案：

檔案	大小	格式
ntu-105-1.pdf 目前未授權公開取用	6.47 MB	Adobe PDF

顯示文件完整紀錄

系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

DSpace

機構典藏 DSpace 系統致力於保存各式數位資料（如：文字、圖片、PDF）並使其易於取用。