A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models

Naseem, U; Razzak, I; Khan, SK; Prasad, M

A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models

Naseem, U Razzak, I Khan, SK Prasad, M

Permalink

Publisher:: Association for Computing Machinery (ACM)
Publication Type:: Journal Article
Citation:: ACM Transactions on Asian and Low-Resource Language Information Processing, 2021, 20, (5), pp. 1-35
Issue Date:: 2021-09-01

Closed Access

	Filename	Description	Size
	2010.15036.pdf	Published version	1.36 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Naseem, U
dc.contributor.author	Razzak, I
dc.contributor.author	Khan, SK
dc.contributor.author	Prasad, M https://orcid.org/0000-0002-7745-9667
dc.date.accessioned	2022-05-15T21:27:15Z
dc.date.available	2022-05-15T21:27:15Z
dc.date.issued	2021-09-01
dc.identifier.citation	ACM Transactions on Asian and Low-Resource Language Information Processing, 2021, 20, (5), pp. 1-35
dc.identifier.issn	2375-4699
dc.identifier.issn	2375-4702
dc.identifier.uri	http://hdl.handle.net/10453/157373
dc.description.abstract	Word representation has always been an important research area in the history of natural language processing (NLP). Understanding such complex text data is imperative, given that it is rich in information and can be used widely across various applications. In this survey, we explore different word representation models and its power of expression, from the classical to modern-day state-of-the-art word representation language models (LMS). We describe a variety of text representation methods, and model designs have blossomed in the context of NLP, including SOTA LMs. These models can transform large volumes of text into effective vector representations capturing the same semantic information. Further, such representations can be utilized by various machine learning (ML) algorithms for a variety of NLP-related tasks. In the end, this survey briefly discusses the commonly used ML- and DL-based classifiers, evaluation metrics, and the applications of these word embeddings in different NLP tasks.
dc.language	en
dc.publisher	Association for Computing Machinery (ACM)
dc.relation.ispartof	ACM Transactions on Asian and Low-Resource Language Information Processing
dc.relation.isbasedon	10.1145/3434237
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models
dc.type	Journal Article
utslib.citation.volume	20
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
dc.date.updated	2022-05-15T21:27:13Z
pubs.issue	5
pubs.publication-status	Published
pubs.volume	20
utslib.citation.issue	5

Abstract:

Word representation has always been an important research area in the history of natural language processing (NLP). Understanding such complex text data is imperative, given that it is rich in information and can be used widely across various applications. In this survey, we explore different word representation models and its power of expression, from the classical to modern-day state-of-the-art word representation language models (LMS). We describe a variety of text representation methods, and model designs have blossomed in the context of NLP, including SOTA LMs. These models can transform large volumes of text into effective vector representations capturing the same semantic information. Further, such representations can be utilized by various machine learning (ML) algorithms for a variety of NLP-related tasks. In the end, this survey briefly discusses the commonly used ML- and DL-based classifiers, evaluation metrics, and the applications of these word embeddings in different NLP tasks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/157373