:::

詳目顯示

回上一頁
題名:基於文字熱門程度之銷售預測和點擊率預測資料探勘方法
作者:歐 漢 尼
作者(外文):Hani Omar
校院名稱:國立交通大學
系所名稱:資訊管理研究所
指導教授:劉敦仁
學位類別:博士
出版日期:2015
主題關鍵詞:ForecastingPredictionAutoregressive Integrated Moving Average (ARIMA)Back Propagation Neural Network (BPNN)Google SearchPopularityLatent Dirichlet Analysis (LDA)ForecastingPredictionAutoregressive Integrated Moving Average (ARIMA)Back Propagation Neural Network (BPNN)Google SearchPopularityLatent Dirichlet Analysis (LDA)
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:15
Internet technology has become a part of everyday life for retrieving data, contacting, entertainments, shopping, marketing, and some in the emerging business and developing world. Due to thousands of pages and services on the web, search engines are designed to search for information on the World Wide Web. The words of query are the main part in the retrieving results by search engines; and hence the word popularity is important to improve the correlated business for service providers.
In this study, we first proposed a hybrid ARIMA and Back Propagation Neural Network for sales forecasting based on the popularity of article titles to enhance sales and operations planning. Publishing industries usually pick attractive titles and headlines for their stories to increase sales, since popular article titles and headlines can attract readers to buy or subscribe to magazines. The popularity of article titles are analyzed by using the search indexes obtained from Google search engine. We proposed a novel hybrid neural network model for sales forecasting based on the popularity of article titles, historical sales data, and the prediction result of Autoregressive Integrated Moving Average (ARIMA) forecasting method. Our proposed forecasting model is experimentally evaluated and the result shows that our proposed forecasting method outperforms conventional techniques which do not consider the popularity of title words.
Second, we use the power of words of online advertisements, which impressed by search engines (where users add their queries for searching), to predict the users’ click-through rate (CTR) of advertisements. We use the important words in the queries which correlated to the advertisements and to boost the prediction performance. Also, we use the popularity of words to cope the cold-start problem when new users insert their query without having any knowledge about them using just their queries. Our proposed prediction model is evaluated and the result of the experiments shows that CTR prediction using word popularity outperform the prediction models without word popularity, and the same for cold start problem.
Internet technology has become a part of everyday life for retrieving data, contacting, entertainments, shopping, marketing, and some in the emerging business and developing world. Due to thousands of pages and services on the web, search engines are designed to search for information on the World Wide Web. The words of query are the main part in the retrieving results by search engines; and hence the word popularity is important to improve the correlated business for service providers.
In this study, we first proposed a hybrid ARIMA and Back Propagation Neural Network for sales forecasting based on the popularity of article titles to enhance sales and operations planning. Publishing industries usually pick attractive titles and headlines for their stories to increase sales, since popular article titles and headlines can attract readers to buy or subscribe to magazines. The popularity of article titles are analyzed by using the search indexes obtained from Google search engine. We proposed a novel hybrid neural network model for sales forecasting based on the popularity of article titles, historical sales data, and the prediction result of Autoregressive Integrated Moving Average (ARIMA) forecasting method. Our proposed forecasting model is experimentally evaluated and the result shows that our proposed forecasting method outperforms conventional techniques which do not consider the popularity of title words.
Second, we use the power of words of online advertisements, which impressed by search engines (where users add their queries for searching), to predict the users’ click-through rate (CTR) of advertisements. We use the important words in the queries which correlated to the advertisements and to boost the prediction performance. Also, we use the popularity of words to cope the cold-start problem when new users insert their query without having any knowledge about them using just their queries. Our proposed prediction model is evaluated and the result of the experiments shows that CTR prediction using word popularity outperform the prediction models without word popularity, and the same for cold start problem.
[1] Ahmadzade, F., Model for forecasting passenger of airport, in International Conference on Industrial Engineering and Operations Management Dhaka, Bangladesh, 2010.new window
[2] Alexander Dietzel, M., Braun, N., and Schäfers, W., Sentiment-based commercial real estate forecasting with Google search volume data, Journal of Property Investment &; Finance, vol. 32, no. 6, pp. 540-569, 2014.
[3] Archak, N., Mirrokni, V. S., and Muthukrishnan, S., Budget Optimization for Online Campaigns with Positive Carryover Effects, in WINE, 2012, pp. 86-99.
[4] Askitas, N., and Zimmermann, K. F., Google econometrics and unemployment forecasting, German Council for Social and Economic Data (RatSWD) Research Notes, no. 41, 2009.
[5] Blei, D. M., Ng, A. Y., and Jordan, M. I., Latent dirichlet allocation, the Journal of machine Learning research, pp. 993-1022, 2003.
[6] Bos, C. S., Franses, P. H., and Ooms, M., Inflation, forecast intervals and long memory regression models, International Journal of Forecasting, vol. 18, no. 2, pp. 243-264, 2002.
[7] Brown, R. G., Smoothing, forecasting and prediction of discrete time series: Courier Corporation, 2004.
[8] Chapelle, O., Manavoglu, E., and Rosales, R., Simple and scalable response prediction for display advertising, ACM Transactions on Intelligent Systems and Technology (TIST), 2014.
[9] Chen, T., and Wang, M.-J. J., Forecasting methods using fuzzy concepts, Fuzzy sets and systems, vol. 105, no. 3, pp. 339-352, 1999.
[10] Chiang, Y.-H., Peng, T.-C., and Chang, C.-O., The nonlinear effect of convenience stores on residential property prices: A case study of Taipei, Taiwan, Habitat International, vol. 46, pp. 82-90, 2015.
[11] Choi, H., and Varian, H., Predicting the present with google trends, Economic Record, vol. 88, no. s1, pp. 2-9, 2012.
[12] Cramton, P., and de Castro, L. I., Prediction markets to forecast electricity demand, in Fiftieth Annual Allerton Conference, Monticello, IL, 2012, pp. 1097 - 1104.
[13] D’Amuri, F., and Marcucci, J., 'Google it!'Forecasting the US unemployment rate with a Google job search index, Social Science Research Network, 2010.
[14] Delaware, ANNUAL REPORT PURSUANT TO SECTION 13 OR 15(d) OF THE SECURITIES EXCHANGE ACT OF 1934 For the fiscal year ended December 31, 2014, vol. UNITED STATES SECURITIES AND EXCHANGE COMMISSION Washington, D.C. 20549, Google-Inc, 2014.
[15] El Rhalibi, A., and Merabti, M., A hybrid fuzzy ANN system for agent adaptation in a first person shooter, International Journal of Computer Games Technology, 2007.
[16] Fiordaliso, A., A nonlinear forecasts combination method based on Takagi–Sugeno fuzzy systems, International Journal of Forecasting, vol. 14, no. 3, pp. 367-379, 1998.
[17] Friedman, N., Geiger, D., and Goldszmidt, M., Bayesian network classifiers, Machine learning, vol. 29, no. 2-3, pp. 131-163, 1997.
[18] Fu, W. J., Penalized regressions: the bridge versus the lasso, Journal of computational and graphical statistics, vol. 7, no. 3, pp. 397-416, 1998.
[19] George E. P. Box, G. M. J., Gregory C. Reinsel, Time Series Analysis: Forecasting and Control, 4th ed.: John Wiley &; Sons, 2011.
[20] Goel, S., Hofman, J. M., Lahaie, S., Pennock, D. M., and Watts, D. J., Predicting consumer behavior with Web search, Proceedings of the National Academy of Sciences, vol. 107, no. 41, pp. 17486-17490, 2010.
[21] Goel, S., and Goldstein, D. G., Predicting individual behavior with social networks, Marketing Science, vol. 33, no. 1, pp. 82-93, 2013.new window
[22] Gollapudi, S., Panigrahy, R., and Goldszmidt, M., Inferring clickthrough rates on ads from click behavior on search results, in Fourth International Conference on Web Search and Web Data Mining, WSDM, 2011.
[23] Gould, J. H., FOREX prediction using an artificial intelligence system, Research Thesis, Oklahoma State University, 2004.
[24] Graepel, T., Candela, J. Q., Borchert, T., and Herbrich, R., Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine, in Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010, pp. 13-20.
[25] Granger, C. W., Long memory relationships and the aggregation of dynamic models, Journal of econometrics, vol. 14, no. 2, pp. 227-238, 1980.
[26] Gunn, S. R., Support vector machines for classification and regression, ISIS technical report, vol. 14, 1998.
[27] Guzman, G., Internet search behavior as an economic forecasting tool: The case of inflation expectations, Journal of economic and social measurement, vol. 36, no. 3, pp. 119-167, 2011.
[28] Han, J., Kamber, M., and Pei, J., Data mining, southeast asia edition: Concepts and techniques: Morgan kaufmann, 2006.
[29] Hand, C., and Judge, G., Searching for the picture: forecasting UK cinema admissions using Google Trends data, Applied Economics Letters, vol. 19, no. 11, pp. 1051-1055, 2012.
[30] Hargittai, E., Do you" google"? Understanding search engine use beyond the hype, First Monday, 2004.
[31] Hastie, T., Tibshirani, R., Friedman, J., Hastie, T., Friedman, J., and Tibshirani, R., The elements of statistical learning: Springer, 2009.
[32] Holt, C. C., Author's retrospective on ‘Forecasting seasonals and trends by exponentially weighted moving averages’, International Journal of Forecasting, vol. 20, no. 1, pp. 11-13, 2004.new window
[33] Hummel, P., and McAfee, R. P., Loss functions for predicted click-through rates in auctions for online advertising, Preprint, Google Inc, 2013.
[34] Inc, e. M. Google Will Take 55% of Search Ad Dollars Globally in 2015, March 31, 2015; http://www.emarketer.com/Article/Google-Will-Take-55-of-Search-Ad-Dollars-Globally-2015/1012294.
[35] Inc, G., Marketing and Advertising Using Google™ Targeting Your Advertising to the Right Audience: Cengage Learning, 2007.
[36] Jain, A., and Kumar, A. M., Hybrid neural network models for hydrologic time series forecasting, Applied Soft Computing, vol. 7, no. 2, pp. 585-592, 2007.
[37] Jansen, B. J., Booth, D. L., and Spink, A., Determining the user intent of web search engine queries, in Proceedings of the 16th international conference on World Wide Web, 2007, pp. 1149-1150.
[38] Ji, L., and Peters, A. J., Forecasting vegetation greenness with satellite and climate data, Geoscience and Remote Sensing Letters, IEEE, vol. 1, no. 1, pp. 3-6, 2004.new window
[39] Khalaf, G., and Iguernane, M., Ridge Regression and Ill-Conditioning, Journal of Modern Applied Statistical Methods, 2014.
[40] Kongcharoen, C., and Kruangpradit, T., Autoregressive Integrated Moving Average with Explanatory Variable (ARIMAX) Model for Thailand Export, in This paper is presented at the 33rd International Symposium on Forecasting, South Korea, 2013.
[41] Li, J., Zhang, P., Cao, Y., Liu, P., and Guo, L., Efficient Behavior Targeting Using SVM Ensemble Indexing, in Proceedings of the 2012 IEEE 12th International Conference on Data Mining, 2012, pp. 409-418.
[42] Loughlin, C., and Harnisch, E., The viability of StockTwits and Google Trends to predict the stock market, Short Report, Spring, 2014.
[43] Mastorocostas, P. A., Theocharis, J. B., and Petridis, V. S., A constrained orthogonal least-squares method for generating TSK fuzzy models: Application to short-term load forecasting, Fuzzy Sets and Systems, vol. 118, no. 2, pp. 215-233, 2001.
[44] McLaren, N., and Shanbhogue, R., Using internet search data as economic indicators, Bank of England Quarterly Bulletin, 2011.
[45] McMahan, H. B., Holt, G., Sculley, D., Young, M., Ebner, D., Grady, J., Nie, L., Phillips, T., Davydov, E., and Golovin, D., Ad click prediction: a view from the trenches, in Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, 2013, pp. 1222-1230.
[46] Mitrea, C., Lee, C., and Wu, Z., A comparison between neural networks and traditional forecasting methods: A case study, International Journal of Engineering Business Management, vol. 1, no. 2, pp. 19-24, 2009.new window
[47] Olson, D., and Mossman, C., Neural network forecasts of Canadian stock returns using accounting ratios, International Journal of Forecasting, vol. 19, no. 3, pp. 453-465, 2003.
[48] Papalexopoulos, A. D., and Hesterberg, T. C., A regression-based approach to short-term system load forecasting, Power Systems, IEEE Transactions on, vol. 5, no. 4, pp. 1535-1547, 1990.
[49] Park, J., Park, Y., and Lee, K., Composite modeling for adaptive short-term load forecasting, Power Systems, IEEE Transactions on, vol. 6, no. 2, pp. 450-457, 1991.
[50] Richardson, M., Dominowska, E., and Ragno, R., Predicting clicks: estimating the click-through rate for new ads, in Proceedings of the 16th international conference on World Wide Web, 2007, pp. 521-530.
[51] Sallehuddin, R., and Hj. Shamsuddin, S. M., Hybrid grey relational artificial neural network and auto regressive integrated moving average model for forecasting time-series data, Applied Artificial Intelligence, vol. 23, no. 5, pp. 443-486, 2009.
[52] Sculley, D., Malkin, R. G., Basu, S., and Bayardo, R. J., Predicting bounce rates in sponsored search advertisements, in Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, 2009, pp. 1325-1334.
[53] Sculley, D., Combined regression and ranking, in Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, 2010, pp. 979-988.
[54] Shafer Jr, E. L., Moeller, G., and Upper Darby, P., Predicting quantitative and qualitative values of recreation participation, in Recreation Symp. Proc. NE For. Expt. Sta. USDA, Upper Darby, Pa, 1971, pp. 5-22.
[55] Snyder, R. D., Koehler, A. B., Hyndman, R. J., and Ord, J. K., Exponential smoothing models: Means and variances for lead-time demand, European Journal of Operational Research, vol. 158, no. 2, pp. 444-455, 2004.
[56] Tagami, Y., Ono, S., Yamamoto, K., Tsukamoto, K., and Tajima, A., Ctr prediction for contextual advertising: learning-to-rank approach, in Proceedings of the Seventh International Workshop on Data Mining for Online Advertising, 2013, pp. 4.
[57] Terui, N., and Van Dijk, H. K., Combined forecasts from linear and nonlinear time series models, International Journal of Forecasting, vol. 18, no. 3, pp. 421-438, 2002.
[58] Tewari, A., and Bartlett, P. L., On the consistency of multiclass classification methods, The Journal of Machine Learning Research, vol. 8, pp. 1007-1025, 2007.
[59] Thomassey, S., and Happiette, M., A neural clustering and classification system for sales forecasting of new apparel items, Applied Soft Computing, vol. 7, no. 4, pp. 1177-1187, 2007.
[60] Tian, Y., Liu, Y., Xu, D., Yao, T., Zhang, M., and Ma, S., Incorporating Seasonal Time Series Analysis with Search Behavior Information in Sales Forecasting, in Proceedings of the 21st international conference companion on World Wide Web, 2012, pp. 615-616.
[61] Vapnik, V., The nature of statistical learning theory: Springer Science &; Business Media, 2000.
[62] Vapnik, V. N., An overview of statistical learning theory, Neural Networks, IEEE Transactions on, vol. 10, no. 5, pp. 988-999, 1999.
[63] Voronin, S., and Partanen, J., A hybrid electricity price forecasting model for the Finnish electricity spot market, in The 32st Annual International Symposium on Forecasting, Boston, 2012.
[64] Wang, C.-J., and Chen, H.-H., Learning user behaviors for advertisements click prediction, in Proceedings of the 34rd international ACM SIGIR conference on research and development in information retrieval Workshop on Internet Advertising, 2011, pp. 1-6.
[65] Wang, X., Li, W., Cui, Y., Zhang, R., and Mao, J., Click-through rate estimation for rare events in online advertising, Online Multimedia Advertising: Techniques and Technologies, pp. 1-12, 2010.
[66] White, R. W., and Dumais, S. T., Characterizing and predicting search engine switching behavior, in Proceedings of the 18th ACM conference on Information and knowledge management, 2009, pp. 87-96.
[67] Wu, K.-W., Ferng, C.-S., Ho, C.-H., Liang, A.-C., Huang, C.-H., Shen, W.-Y., Jiang, J.-Y., Yang, M.-H., Lin, T.-W., and Lee, C.-P., A two-stage ensemble of diverse models for advertisement ranking in KDD Cup 2012, KDDCup, 2012.
[68] Wu, L., and Shahidehpour, M., A hybrid model for day-ahead price forecasting, Power Systems, IEEE Transactions on, vol. 25, no. 3, pp. 1519-1530, 2010.
[69] Wu, L., and Brynjolfsson, E., The future of prediction: How Google searches foreshadow housing prices and sales: University of Chicago Press, 2014.
[70] Xing, B., and Lin, Z., The impact of search engine optimization on online advertising market, in Proceedings of the 8th international conference on Electronic commerce, 2006, pp. 519-529.
[71] Xiong, C., Wang, T., Ding, W., Shen, Y., and Liu, T.-Y., Relational click prediction for sponsored search, in Proceedings of the fifth ACM international conference on Web search and data mining, 2012, pp. 493-502.
[72] Yan, L., Li, W.-j., Xue, G.-R., and Han, D., Coupled Group Lasso for Web-Scale CTR Prediction in Display Advertising, in Proceedings of the 31st International Conference on Machine Learning (ICML-14), 2014, pp. 802-810.
[73] Yoo, H., and Pimmely, R., Short term load forecasting using a self-supervised adaptive neural network, Power Systems, IEEE Transactions on, vol. 14, no. 2, pp. 779-784, 1999.
[74] Zhang, G. P., Time series forecasting using a hybrid ARIMA and neural network model, Neurocomputing, vol. 50, pp. 159-175, 2003.
[75] Zhang, G. P., and Qi, M., Neural network forecasting for seasonal and trend time series, European journal of operational research, vol. 160, no. 2, pp. 501-514, 2005.
[76] Zhang, X., Liu, Y., Yang, M., Zhang, T., Young, A. A., and Li, X., Comparative study of four time series methods in forecasting typhoid fever incidence in China, PLOS ONE, vol. Volume 8, no. Issue 5, 2013.
[77] Zhang, Y., Jansen, B. J., and Spink, A., Identification of factors predicting clickthrough in Web searching using neural network analysis, Journal of the American Society for Information Science and Technology, vol. 60, no. 3, pp. 557-570, 2009.

 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top