Parallel Proximal Support Vector Machine for High-dimensional Pattern Classification

Publisher:
ACM
Publication Type:
Conference Proceeding
Citation:
Proc. Of The 21st ACM Conference on Information and Knowledge Management (CIKM-12), 2012, pp. 2351 - 2354
Issue Date:
2012-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2011008006OK.pdf1.21 MB
Adobe PDF
Proximal support vector machine (PSVM) is a simple but effective classifier, especially for solving large-scale data classification problems. An inherent deficiency of PSVM lies on its inefficiency for dealing with high-dimensional data. In this paper, we propose a parallel version of PSVM (PPSVM). Based on random dimensionality partitioning, PPSVM can obtain partitioned local model parameters in parallel, with combined parameters to form the final global solution. In fact, PPSVM enjoys two properties: 1) It can calculate model parameters in parallel and is therefore a fast learning method with theoretically proved convergence; and 2) It can avoid the inversion of large matrix, which makes it suitable for high-dimensional data. In the paper, we also propose a random PPSVM with randomly partitioned data in each iteration to improve the performance of PSVM. Experimental results on real-world data demonstrate that the proposed methods can obtain similar or even better prediction accuracy than PSVM with much better runtime efficiency.
Please use this identifier to cite or link to this item: