Learning Private Neural Language Modeling with Attentive Aggregation

Ji, S; Pan, S; Long, G; Li, X; Jiang, J; Huang, Z

Learning Private Neural Language Modeling with Attentive Aggregation

Ji, S Pan, S

Long, G

Li, X Jiang, J

Huang, Z

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: arXiv preprint arXiv:1812.07108, 2018, 2019-July
Issue Date:: 2018

Closed Access

	Filename	Description	Size
	IJCNN 2019 - Shaoxiong - paper.pdf	Published version	435.91 kB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Ji, S
dc.contributor.author	Pan, S https://orcid.org/0000-0003-0794-527X
dc.contributor.author	Long, G https://orcid.org/0000-0003-3740-9515
dc.contributor.author	Li, X
dc.contributor.author	Jiang, J https://orcid.org/0000-0001-5301-7779
dc.contributor.author	Huang, Z
dc.date	2019-07-14
dc.date.accessioned	2021-05-11T01:17:11Z
dc.date.available	2021-05-11T01:17:11Z
dc.date.issued	2018
dc.identifier.citation	arXiv preprint arXiv:1812.07108, 2018, 2019-July
dc.identifier.isbn	9781728119854
dc.identifier.issn	2161-4393
dc.identifier.uri	http://hdl.handle.net/10453/148830
dc.description.abstract	Mobile keyboard suggestion is typically regarded as a word-level language modeling problem. Centralized machine learning techniques require the collection of massive user data for training purposes, which may raise privacy concerns in relation to users' sensitive data. Federated learning (FL) provides a promising approach to learning private language modeling for intelligent personalized keyboard suggestions by training models on distributed clients rather than training them on a central server. To obtain a global model for prediction, existing FL algorithms simply average the client models and ignore the importance of each client during model aggregation. Furthermore, there is no optimization for learning a well-generalized global model on the central server. To solve these problems, we propose a novel model aggregation with an attention mechanism considering the contribution of client models to the global model, together with an optimization technique during server aggregation. Our proposed attentive aggregation method minimizes the weighted distance between the server model and client models by iteratively updating parameters while attending to the distance between the server model and client models. Experiments on two popular language modeling datasets and a social media dataset show that our proposed method outperforms its counterparts in terms of perplexity and communication cost in most settings of comparison.
dc.language	en
dc.publisher	IEEE
dc.relation	http://purl.org/au-research/grants/arc/LP150100671
dc.relation.ispartof	arXiv preprint arXiv:1812.07108
dc.relation.ispartof	International Joint Conference on Neural Networks (IJCNN)
dc.relation.ispartofseries	IEEE International Joint Conference on Neural Networks (IJCNN)
dc.relation.isbasedon	10.1109/IJCNN.2019.8852464
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Learning Private Neural Language Modeling with Attentive Aggregation
dc.type	Conference Proceeding
utslib.citation.volume	2019-July
utslib.location.activity	Budapest, HUNGARY
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2021-05-11T01:17:10Z
pubs.finish-date	2019-07-19
pubs.publication-status	Published
pubs.start-date	2019-07-14
pubs.volume	2019-July

Abstract:

Mobile keyboard suggestion is typically regarded as a word-level language modeling problem. Centralized machine learning techniques require the collection of massive user data for training purposes, which may raise privacy concerns in relation to users' sensitive data. Federated learning (FL) provides a promising approach to learning private language modeling for intelligent personalized keyboard suggestions by training models on distributed clients rather than training them on a central server. To obtain a global model for prediction, existing FL algorithms simply average the client models and ignore the importance of each client during model aggregation. Furthermore, there is no optimization for learning a well-generalized global model on the central server. To solve these problems, we propose a novel model aggregation with an attention mechanism considering the contribution of client models to the global model, together with an optimization technique during server aggregation. Our proposed attentive aggregation method minimizes the weighted distance between the server model and client models by iteratively updating parameters while attending to the distance between the server model and client models. Experiments on two popular language modeling datasets and a social media dataset show that our proposed method outperforms its counterparts in terms of perplexity and communication cost in most settings of comparison.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/148830