Neural Topic Model Training with the REBAR Gradient Estimator

Kumar, A; Esmaili, N; Piccardi, M

Neural Topic Model Training with the REBAR Gradient Estimator

Kumar, A Esmaili, N Piccardi, M

Permalink

Publisher:: Association for Computing Machinery (ACM)
Publication Type:: Journal Article
Citation:: ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 21, (5), pp. 1-18
Issue Date:: 2022-11-15

Closed Access

	Filename	Description	Size
	3517336.pdf	Published version	450.63 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Kumar, A
dc.contributor.author	Esmaili, N
dc.contributor.author	Piccardi, M https://orcid.org/0000-0001-9250-6604
dc.date.accessioned	2023-05-25T01:44:17Z
dc.date.available	2023-05-25T01:44:17Z
dc.date.issued	2022-11-15
dc.identifier.citation	ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 21, (5), pp. 1-18
dc.identifier.issn	2375-4699
dc.identifier.issn	2375-4702
dc.identifier.uri	http://hdl.handle.net/10453/170440
dc.description.abstract	Topic modelling is an important approach of unsupervised machine learning that allows automatically extracting the main "topics"from large collections of documents. In addition, topic modelling is able to identify the topic proportions of each individual document, which can be helpful for organizing the collections. Many topic modelling algorithms have been proposed to date, including several that leverage advanced techniques such as variational inference and deep autoencoders. However, to date topic modelling has made limited use of reinforcement learning, a framework that has obtained vast success in many other unsupervised learning tasks. For this reason, in this article we propose training a neural topic model using a reinforcement learning objective and minimizing the objective with the recently-proposed REBAR gradient estimator. Experiments performed over two probing datasets have shown that the proposed model has achieved improvements over all the compared models in terms of both model perplexity and topic coherence, and produced topics that appear qualitatively informative and consistent.
dc.language	en
dc.publisher	Association for Computing Machinery (ACM)
dc.relation.ispartof	ACM Transactions on Asian and Low-Resource Language Information Processing
dc.relation.isbasedon	10.1145/3517336
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Neural Topic Model Training with the REBAR Gradient Estimator
dc.type	Journal Article
utslib.citation.volume	21
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	closed_access	*
dc.date.updated	2023-05-25T01:44:15Z
pubs.issue	5
pubs.publication-status	Published
pubs.volume	21
utslib.citation.issue	5

Abstract:

Topic modelling is an important approach of unsupervised machine learning that allows automatically extracting the main "topics"from large collections of documents. In addition, topic modelling is able to identify the topic proportions of each individual document, which can be helpful for organizing the collections. Many topic modelling algorithms have been proposed to date, including several that leverage advanced techniques such as variational inference and deep autoencoders. However, to date topic modelling has made limited use of reinforcement learning, a framework that has obtained vast success in many other unsupervised learning tasks. For this reason, in this article we propose training a neural topic model using a reinforcement learning objective and minimizing the objective with the recently-proposed REBAR gradient estimator. Experiments performed over two probing datasets have shown that the proposed model has achieved improvements over all the compared models in terms of both model perplexity and topic coherence, and produced topics that appear qualitatively informative and consistent.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/170440