Deep Reinforcement Learning with Transformers for Text Adventure Games

Xu, Y; Chen, L; Fang, M; Wang, Y; Zhang, C

Deep Reinforcement Learning with Transformers for Text Adventure Games

Xu, Y Chen, L

Fang, M Wang, Y Zhang, C

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: IEEE Conference on Computatonal Intelligence and Games, CIG, 2020, 2020-August, pp. 65-72
Issue Date:: 2020-08-01

Closed Access

	Filename	Description	Size
	09231622.pdf	Published version	3.47 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Xu, Y
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Fang, M
dc.contributor.author	Wang, Y
dc.contributor.author	Zhang, C https://orcid.org/0000-0001-5715-7154
dc.date	2020-08-24
dc.date.accessioned	2021-05-30T21:04:18Z
dc.date.available	2021-05-30T21:04:18Z
dc.date.issued	2020-08-01
dc.identifier.citation	IEEE Conference on Computatonal Intelligence and Games, CIG, 2020, 2020-August, pp. 65-72
dc.identifier.isbn	9781728145334
dc.identifier.issn	2325-4270
dc.identifier.issn	2325-4289
dc.identifier.uri	http://hdl.handle.net/10453/149306
dc.description.abstract	In this paper, we study transformers for text-based games. As a promising replacement of recurrent modules in Natural Language Processing (NLP) tasks, the transformer architecture could be treated as a powerful state representation generator for reinforcement learning. However, the vanilla transformer is neither effective nor efficient to learn with a huge amount of weight parameters. Unlike existing research that encodes states using LSTMs or GRUs, we develop a novel lightweight transformer-based representation generator featured with reordered layer normalization, weight sharing and block-wise aggregation. The experimental results show that our proposed model not only solves single games with much fewer interactions, but also achieves better generalization on a set of unseen games. Furthermore, our model outperforms state-of-the-art agents in a variety of man-made games.
dc.language	en
dc.publisher	IEEE
dc.relation	http://purl.org/au-research/grants/arc/DP180100966
dc.relation.ispartof	IEEE Conference on Computatonal Intelligence and Games, CIG
dc.relation.ispartof	IEEE Conference on Games
dc.relation.ispartofseries	IEEE Conference on Computational Intelligence and Games
dc.relation.isbasedon	10.1109/CoG47356.2020.9231622
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Deep Reinforcement Learning with Transformers for Text Adventure Games
dc.type	Conference Proceeding
utslib.citation.volume	2020-August
utslib.location.activity	Osaka, Japan
utslib.for	0801 Artificial Intelligence and Image Processing
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/DVC (International)
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - ACRI - Australia China Relations Institute
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-05-30T21:04:15Z
pubs.finish-date	2020-08-27
pubs.place-of-publication	Piscataway, USA
pubs.publication-status	Published
pubs.start-date	2020-08-24
pubs.volume	2020-August
dc.location	Piscataway, USA

Abstract:

In this paper, we study transformers for text-based games. As a promising replacement of recurrent modules in Natural Language Processing (NLP) tasks, the transformer architecture could be treated as a powerful state representation generator for reinforcement learning. However, the vanilla transformer is neither effective nor efficient to learn with a huge amount of weight parameters. Unlike existing research that encodes states using LSTMs or GRUs, we develop a novel lightweight transformer-based representation generator featured with reordered layer normalization, weight sharing and block-wise aggregation. The experimental results show that our proposed model not only solves single games with much fewer interactions, but also achieves better generalization on a set of unseen games. Furthermore, our model outperforms state-of-the-art agents in a variety of man-made games.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/149306