Scaling-Up Item-Based Collaborative Filtering Recommendation Algorithm Based on Hadoop

Publisher:
IEEE
Publication Type:
Conference Proceeding
Citation:
2011 IEEE World Congress on Services (SERVICES 2011), 2011, pp. 490 - 497
Issue Date:
2011-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2013003627OK.pdf1.74 MB
Adobe PDF
Collaborative filtering (CF) techniques have achieved widespread success in E-commerce nowadays. The tremendous growth of the number of customers and products in recent years poses some key challenges for recommender systems in which high quality recommendations are required and more recommendations per second for millions of customers and products need to be performed. Thus, the improvement of scalability and efficiency of collaborative filtering (CF) algorithms become increasingly important and difficult. In this paper, we developed and implemented a scaling-up item-based collaborative filtering algorithm on MapReduce, by splitting the three most costly computations in the proposed algorithm into four Map-Reduce phases, each of which can be independently executed on different nodes in parallel. We also proposed efficient partition strategies not only to enable the parallel computation in each Map-Reduce phase but also to maximize data locality to minimize the communication cost. Experimental results effectively showed the good performance in scalability and efficiency of the item-based CF algorithm on a Hadoop cluster.
Please use this identifier to cite or link to this item: