請用此 Handle URI 來引用此文件:
http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/59284
標題: | 零售電商之消費者回購預測:機械學習應用之實證研究 The Customer Repurchase Prediction of E-commerce Brand: An Empirical Study of Machine Learning Technique Implication |
作者: | Yueh-Hsuan Tsai 蔡曜亘 |
指導教授: | 黃俊堯(Chun-Yao Huang) |
關鍵字: | 機械學習,消費行為預測,回購預測,決策樹,支持向量,管理經驗法, Machine Learning,Customer Base Analysis,Repurchase Forecast,Decision Tree,Support Vector Machine,Managerial Heuristic, |
出版年 : | 2017 |
學位: | 碩士 |
摘要: | 近年分析模型與機械學習演算法的蓬勃發展,各式各樣演算法應用於不同領域,但零售業甚至電子商務零售業的數據分析應用卻不是非常普遍,價值創造也不如預期,接著又面對開源數據分析工具的發展,我認為應該能夠找出有效、也易於應用的演算法來建構回購預測模型,並解決近代電子商務常見的過度行銷問題,以及協助完成完整的CRM系統,因此產生了兩個研究目的:
(1)支持向量機與梯度增強決策樹兩種分類演算法,在跨產業的回購預測效果比較上,何者較優。 (2) 用機械學習方法(支持向量機與梯度增強決策樹)做回購預測,相較於過去常用的管理經驗法則建構的模型,是否能顯著提升回購預測的效果。 為了證明上述,本研究利用跨三個電商產業(美妝、女裝、食品)之實證研究,去建構以下三個模型(1)XGBOOST模型 (2)支持向量機模型 (3)管理經驗法模型,並比較每個模型預測出的F1score,去驗證機械學習的分類方法(支持向量機、梯度增強決策樹)應用在消費者回購行為的預測時,是不是能提供有效的預測結果。 實證結果證實以下發現: 支持向量機優於XGBOOST:由第五章第一節的結果可以明確表示,支持向量機在電商零售情境下的表現優於XGBOOST。 支持向量機方法整體占優:在不同產業、不同品牌情境下,支持向量機預測的效果皆優於管理經驗法模型的預測效果。 預測工具使用應適時適所:雖然在整理預測效果上(F1 score),支持向量機有較優秀的表現,但細看至精確率(Precision Rate)與抓取率(Recall Rate)下,我們可以發現支持向量機有較好的精確率,而管理經驗模型的抓取率卻普遍優秀,所以管理者在使用工具時,應該要挑選適合搭配該檔行銷活動的工作座使用。 研究限制:整個實證研究過程中,發現了許多研究限制,如1.演算法極限 2.特徵變數有限 3.跨產業限制,大品牌小品牌差異 4.預測區間N天,這幾個限制都會影響實證研究成果,若在未來有辦法突破某些研究限制,相信預測結果會大大提升。 With the rising of data analytic model and Machine Learning algorithm, all kinds of algorithm have been applied on different area. Even so, the implication of data analysis in the e-commerce industry in Taiwan is still little, the value created by data analysis is not as much as expected. While the open source analytic tools being hot nowadays , there might be powerful and easy way to implement data analysis to build customer forecast model and solve the over marketing problems which is rising in modern e-commerce industry. Due to all above, there are two research problems: (1)Which performs better on customer forecast in different industry? Support Vector Machine(SVM) or Gradient Boosting Decision Tree? (2)Can Machine Learning techniques perform better than managerial heuristic method on customer forecast in different industry? This research use transaction data from three different e-commerce and retail industry(cosmetics, women clothes, food product) to build three models as below. (1)XGBOOSTt model (2)Support Vector Machine Model (3) managerial heuristic model. This research will compare the performance of three different model base on F1 score。 The result as following: Support Vector Machine performs better than XGBOOST. Support Vector Machine performs better than managerial heuristic model in three different industry. Different forecasting tools can be useful in different situation. Though base on F1 score, SVM always performs better, the Precision rate and Recall rate can tell different story. SVM always performs better on Precision rate while managerial heuristic method performs better on Recall rate. Marketers and managers should pick right tools base on different marketing situation. Through the research, it found out several research restrictions that are algorithm limit, limited factor, industrial variation, scale impact and predict N days. These factors can cause severe impact on the model, yet the model efficiency can be improve by reducing these restrictions. |
URI: | http://tdr.lib.ntu.edu.tw/jspui/handle/123456789/59284 |
DOI: | 10.6342/NTU201701328 |
全文授權: | 有償授權 |
顯示於系所單位: | 商學研究所 |
文件中的檔案:
檔案 | 大小 | 格式 | |
---|---|---|---|
ntu-106-1.pdf 目前未授權公開取用 | 1.43 MB | Adobe PDF |
系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。