Optimal Online Data Partitioning for Geo-Distributed Machine Learning in Edge of Wireless Networks

Publisher:
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Publication Type:
Journal Article
Citation:
IEEE Journal on Selected Areas in Communications, 2019, 37, (10), pp. 2393-2406
Issue Date:
2019-10-01
Filename Description Size
08793221.pdfPublished version1.87 MB
Adobe PDF
Full metadata record
© 1983-2012 IEEE. To enable machine learning at the edge of wireless networks (such as edge cloud), close to mobile users, is critical for future wireless networks, but challenging since the lower layers in edge cloud are substantially different from existing machine learning configurations in the cloud. In such geo-distributed computing environment, streaming data need to be evenly and cost-efficiently partitioned for different workers to produce an unbiased learning model with reduced parameter synchronization frequency. This paper presents a new online approach to optimally partitioning streaming data under time-varying network conditions. A new measure is proposed to quantify the evenness of data partitioning and restrain the optimization of data admission, partitioning, and processing. Stochastic gradient descent is applied to learn the optimal decisions online and asymptotically maximize the time-average utility of data partitioning. A new protocol is designed to further reduce the measurements of link costs, while preserving the asymptotic optimality, data evenness, and stability of the platform. Simulation results show that the proposed approach is superior to the state of the art in terms of throughput and cost efficiency, while only 24% of the links need to be measured to achieve the asymptotic optimality.
Please use this identifier to cite or link to this item: