Domain-specific meta-embedding with latent semantic structures

Liu, Q; Lu, J; Zhang, G; Shen, T; Zhang, Z; Huang, H

Domain-specific meta-embedding with latent semantic structures

Liu, Q Lu, J

Zhang, G

Shen, T Zhang, Z Huang, H

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Information Sciences, 2021, 555, pp. 410-423
Issue Date:: 2021

Closed Access

	Filename	Description	Size
	1-s2.0-S002002552031029X-main.pdf	Published version	631.33 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, Q
dc.contributor.author	Lu, J https://orcid.org/0000-0003-0690-4732
dc.contributor.author	Zhang, G https://orcid.org/0000-0003-3960-0583
dc.contributor.author	Shen, T
dc.contributor.author	Zhang, Z
dc.contributor.author	Huang, H
dc.date.accessioned	2022-04-18T02:08:59Z
dc.date.available	2022-04-18T02:08:59Z
dc.date.issued	2021
dc.identifier.citation	Information Sciences, 2021, 555, pp. 410-423
dc.identifier.issn	0020-0255
dc.identifier.issn	1872-6291
dc.identifier.uri	http://hdl.handle.net/10453/156320
dc.description.abstract	Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors. .
dc.language	en
dc.publisher	Elsevier
dc.relation	http://purl.org/au-research/grants/arc/DP170101632
dc.relation.ispartof	Information Sciences
dc.relation.isbasedon	10.1016/j.ins.2020.10.030
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	01 Mathematical Sciences, 08 Information and Computing Sciences, 09 Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Domain-specific meta-embedding with latent semantic structures
dc.type	Journal Article
utslib.citation.volume	555
utslib.for	01 Mathematical Sciences
utslib.for	08 Information and Computing Sciences
utslib.for	09 Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2022-04-18T02:08:58Z
pubs.publication-status	Published
pubs.volume	555

Abstract:

Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors. .

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/156320