Bi-level Masked Multi-scale CNN-RNN Networks for Short Text Representation

Li, Q; Wu, Q; Zhu, C; Zhang, J

Bi-level Masked Multi-scale CNN-RNN Networks for Short Text Representation

Li, Q

Wu, Q

Zhu, C Zhang, J

Permalink

Publication Type:: Conference Proceeding
Citation:: 2019
Issue Date:: 2019-09-20

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

The embargo period expires on 20 Sep 2021

Download Accepted Manuscript versionAdobe PDF (496.96 kB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Li, Q https://orcid.org/0000-0002-6149-5081	en_US
dc.contributor.author	Wu, Q https://orcid.org/0000-0001-5641-2483	en_US
dc.contributor.author	Zhu, C	en_US
dc.contributor.author	Zhang, J	en_US
dc.date	2019-09-20	en_US
dc.date.issued	2019-09-20	en_US
dc.identifier.citation	2019	en_US
dc.identifier.uri	http://hdl.handle.net/10453/133900
dc.description.abstract	Representing short text is becoming extremely important for a variety of valuable applications. However, representing short text is critical yet challenging because it involves lots of informal words and typos (i.e. the noise problem) but only a few vocabularies in each text (i.e. the sparsity problem). Most of the existing work on representing short text relies on noise recognition and sparsity expansion. However, the noises in short text are with various forms and changing fast, but, most of the current methods may fail to adaptively recognize the noise. Also, it is hard to explicitly expand a sparse text to a high-quality dense text. In this paper, we tackle the noise and sparsity problems in short text representation by learning multi-grain noise-tolerant patterns and then embedding the most significant patterns in a text as its representation. To achieve this goal, we propose a bi-level multi-scale masked CNN-RNN network to embed the most significant multi-grain noise-tolerant relations among words and characters in a text into a dense vector space. Comprehensive experiments on five large real-world data sets demonstrate our method significantly outperforms the state-of-the-art competitors.	en_US
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.title	Bi-level Masked Multi-scale CNN-RNN Networks for Short Text Representation	en_US
dc.type	Conference Proceeding
utslib.location.activity	Sydney, Australia	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Strength - INEXT - Innovation in IT Services and Applications
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	open_access	*
pubs.consider-herdc	false	en_US
utslib.copyright.embargo	2021-09-20T00:00:00+1000
pubs.finish-date	2019-09-25	en_US
pubs.start-date	2019-09-20	en_US

Abstract:

Representing short text is becoming extremely important for a variety of valuable applications. However, representing short text is critical yet challenging because it involves lots of informal words and typos (i.e. the noise problem) but only a few vocabularies in each text (i.e. the sparsity problem). Most of the existing work on representing short text relies on noise recognition and sparsity expansion. However, the noises in short text are with various forms and changing fast, but, most of the current methods may fail to adaptively recognize the noise. Also, it is hard to explicitly expand a sparse text to a high-quality dense text. In this paper, we tackle the noise and sparsity problems in short text representation by learning multi-grain noise-tolerant patterns and then embedding the most significant patterns in a text as its representation. To achieve this goal, we propose a bi-level multi-scale masked CNN-RNN network to embed the most significant multi-grain noise-tolerant relations among words and characters in a text into a dense vector space. Comprehensive experiments on five large real-world data sets demonstrate our method significantly outperforms the state-of-the-art competitors.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/133900