A free stale synchronous parallel strategy for distributed machine learning

ACM Press
Publication Type:
Conference Proceeding
ACM International Conference Proceeding Series, 2019, pp. 23-29
Issue Date:
Filename Description Size
3341620.3341625.pdfPublished version656.56 kB
Adobe PDF
Full metadata record
© 2019 Association for Computing Machinery. With the machine learning applications processing larger and more complex data, people tend to use multiple computing nodes to execute the machine learning tasks in distributed way. However, in real world, people always encounter a problem that a few nodes in system exhibit poor performance and drag down the efficiency of the whole system. In existing parallel strategies such as bulk synchronous parallel and stale synchronous parallel, these nodes with poor performance may not be monitored and found out in time. To address this problem, we proposed a free stale synchronous parallel (FSSP) strategy to free the system from the negative impact of those nodes. Our experimental results on some classical machine leaning algorithms and datasets demonstrated that FSSP strategy outperformed other existing parallel computing strategy.
Please use this identifier to cite or link to this item: