跳到主要內容

簡易檢索 / 詳目顯示

研究生: 陳冠廷
Chen, Kuan-Ting
論文名稱: 隨機梯度下降法對於順序迴歸模型估計之收斂研究及推薦系統應用
Convergence of Stochastic Gradient Descent for Ordinal Regression Model and Applications for Recommender Systems
指導教授: 翁久幸
Weng, Chiu-Hsing
口試委員: 黃子銘
Huang, Tzee-Ming
蔡恆修
Tsai, Heng-Hsiu
學位類別: 碩士
Master
系所名稱: 商學院 - 統計學系
Department of Statistics
論文出版年: 2020
畢業學年度: 108
語文別: 英文
論文頁數: 47
中文關鍵詞: 矩陣分解順序迴歸隨機梯度下降法批次隨機梯度下降法平均估計
外文關鍵詞: Matrix Factorization, Ordinal Regression, Stochastic Gradient Descent, Mini-Batch Stochastic Gradient Descent, Average Estimate
DOI URL: http://doi.org/10.6814/NCCU202000780
相關次數: 點閱:109下載:19
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 矩陣分解是一種普及的協同過濾方法,Koren和Sill在2011年提出了基於順序迴歸的矩陣分解方法。相較於傳統的矩陣分解方法,由於基於順序迴歸的矩陣分解方法能夠輸出用戶對物品各項評分的出現機率,因此在應用方面上具有優勢。雖然他們的實驗在準確性上表現優異,但目前尚沒有開源的程式能夠使用。此次論文我們便應用隨機梯度下降法來實現此矩陣分解模型,並討論遭遇到的數值問題,由於此模型涉及順序迴歸模型,我們也研究了順序迴歸模型在隨機梯度下降法下,其參數估計的收斂。


    Matrix factorization is a popular Collaborating Filtering (CF) method. Koren and Sill (2011) proposed an ordinal regression model with a matrix factorization CF method. This approach is advantageous over traditional matrix factorization methods by its ability to output a full probability distribution of the user-item ratings. Though their experiments showed superior results in its accuracy, there is no publicly available software. In this thesis, we implement the algorithms by Stochastic Gradient Descent (SGD) and discuss the numerical issues encountered. As this approach involves ordinal regression models, we will study the convergence of SGD for ordinal regression models as well.

    1 INTRODUCTION 5
    2 REVIEW 7
    2.1 Matrix Factorization Model 7
    2.2 Ordinal Regression Model 10
    2.3 OrdRec Model 11
    2.4 Learning Algorithms 13
    3 ORDINAL REGRESSION IMPLEMENTATION 16
    3.1 Numerical Problems in Simulation 16
    3.2 ASGD for Ordinal Regression Model 20
    4 RECOMMENDER SYSTEM IMPLEMENTATION 22
    4.1 Optimization Details 22
    4.2 Recommender System Update Rule 24
    4.3 Simulation for OrdRec Model 27
    5 REAL APPLICATION 31
    5.1 Experiment Dataset 31
    5.2 Evaluation Metrics 33
    5.3 Performance Comparison 34
    5.4 Check for Convergence 36
    6 CONCLUSION 39
    A Optimization Detail of Ordinal Regression Models 40
    B Assumption Verification 44
    References 47

    [1] Léon Bottou, Frank E Curtis, and Jorge Nocedal. Optimization Methods for Large-Scale Machine Learning.Siam Review, 60(2):223–311, 2018.
    [2] Yixin Fang, Jinfeng Xu, and Lei Yang. Online Bootstrap Confidence Intervals for the Stochastic Gradient Descent Estimator.The Journal of Machine Learning Research, 19(1):3053–3073, 2018.
    [3] Simon Funk. Netflix Update: Try This at Home, 2006.
    [4] F Maxwell Harper and Joseph A Konstan. The Movielens Datasets: History and Context.ACM Transactions on Interactive Intelligent Systems (TIIS), 5(4):1–19,2015.
    [5] Jack Kiefer and Jacob Wolfowitz. Stochastic Estimation of The Maximum of ARegression Function.The Annals of Mathematical Statistics, 23(3):462–466, 1952.
    [6] Yehuda Koren. Factorization Meets the Neighborhood: A MultifacetedCollaborative Filtering Model. In Proceedings of the 14th ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining, pages 426–434, 2008.
    [7] Yehuda Koren, Robert Bell, and Chris Volinsky. Matrix Factorization Techniques for Recommender Systems.Computer, 42(8):30–37, 2009.
    [8] Yehuda Koren and Joe Sill. Ordrec: An Ordinal Model for Predicting PersonalizedItem Rating Distributions. In Proceedings of the 5th ACM Conference on Recommender Systems, pages 117–124, 2011.
    [9] Peter McCullagh. Regression Models for Ordinal Data.Journal of the RoyalStatistical Society: Series B (Methodological), 42(2):109–127, 1980.
    [10] Boris T Polyak and Anatoli B Juditsky. Acceleration of Stochastic Approximation by Averaging.SIAM Journal on Control and Optimization, 30(4):838–855, 1992.
    [11] Herbert Robbins and Sutton Monro. A Stochastic Approximation Method.TheAnnals of Mathematical Statistics, pages 400–407, 1951.
    [12] David Ruppert. Efficient Estimations from A Slowly Convergent Robbins-MonroProcess. Technical report, Cornell University Operations Research and IndustrialEngineering, 1988.

    QR CODE
    :::