Graph-based Chinese word sense disambiguation with multi-knowledge integration

Lu, W; Meng, F; Wang, S; Zhang, G; Zhang, X; Ouyang, A

Graph-based Chinese word sense disambiguation with multi-knowledge integration

Lu, W Meng, F Wang, S

Zhang, G

Zhang, X Ouyang, A

Permalink

Publication Type:: Journal Article
Citation:: Computers, Materials and Continua, 2019, 61 (1), pp. 197 - 212
Issue Date:: 2019-01-01

Closed Access

	Filename	Description	Size
	Graph-Based Chinese Word Sense Disambiguation with Multi-Knowledge Integration.pdf	Published Version	449.44 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Lu, W	en_US
dc.contributor.author	Meng, F	en_US
dc.contributor.author	Wang, S https://orcid.org/0000-0003-1133-9379	en_US
dc.contributor.author	Zhang, G https://orcid.org/0000-0003-4521-542X	en_US
dc.contributor.author	Zhang, X	en_US
dc.contributor.author	Ouyang, A	en_US
dc.date.accessioned	2020-03-30T04:30:06Z
dc.date.available	2020-03-30T04:30:06Z
dc.date.issued	2019-01-01	en_US
dc.identifier.citation	Computers, Materials and Continua, 2019, 61 (1), pp. 197 - 212	en_US
dc.identifier.issn	1546-2218	en_US
dc.identifier.uri	http://hdl.handle.net/10453/139614
dc.description.abstract	© 2019 Tech Science Press. All rights reserved. Word sense disambiguation (WSD) is a fundamental but significant task in natural language processing, which directly affects the performance of upper applications. However, WSD is very challenging due to the problem of knowledge bottleneck, i.e., it is hard to acquire abundant disambiguation knowledge, especially in Chinese. To solve this problem, this paper proposes a graph-based Chinese WSD method with multi-knowledge integration. Particularly, a graph model combining various Chinese and English knowledge resources by word sense mapping is designed. Firstly, the content words in a Chinese ambiguous sentence are extracted and mapped to English words with BabelNet. Then, English word similarity is computed based on English word embeddings and knowledge base. Chinese word similarity is evaluated with Chinese word embedding and HowNet, respectively. The weights of the three kinds of word similarity are optimized with simulated annealing algorithm so as to obtain their overall similarities, which are utilized to construct a disambiguation graph. The graph scoring algorithm evaluates the importance of each word sense node and judge the right senses of the ambiguous words. Extensive experimental results on SemEval dataset show that our proposed WSD method significantly outperforms the baselines.	en_US
dc.relation.ispartof	Computers, Materials and Continua	en_US
dc.relation.isbasedon	10.32604/cmc.2019.06068	en_US
dc.subject.classification	Applied Mathematics	en_US
dc.title	Graph-based Chinese word sense disambiguation with multi-knowledge integration	en_US
dc.type	Journal Article
utslib.citation.volume	1	en_US
utslib.citation.volume	61	en_US
utslib.for	0103 Numerical and Computational Mathematics	en_US
utslib.for	0912 Materials Engineering	en_US
utslib.for	0915 Interdisciplinary Engineering	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	1	en_US
pubs.publication-status	Published	en_US
pubs.volume	61	en_US

Abstract:

© 2019 Tech Science Press. All rights reserved. Word sense disambiguation (WSD) is a fundamental but significant task in natural language processing, which directly affects the performance of upper applications. However, WSD is very challenging due to the problem of knowledge bottleneck, i.e., it is hard to acquire abundant disambiguation knowledge, especially in Chinese. To solve this problem, this paper proposes a graph-based Chinese WSD method with multi-knowledge integration. Particularly, a graph model combining various Chinese and English knowledge resources by word sense mapping is designed. Firstly, the content words in a Chinese ambiguous sentence are extracted and mapped to English words with BabelNet. Then, English word similarity is computed based on English word embeddings and knowledge base. Chinese word similarity is evaluated with Chinese word embedding and HowNet, respectively. The weights of the three kinds of word similarity are optimized with simulated annealing algorithm so as to obtain their overall similarities, which are utilized to construct a disambiguation graph. The graph scoring algorithm evaluates the importance of each word sense node and judge the right senses of the ambiguous words. Extensive experimental results on SemEval dataset show that our proposed WSD method significantly outperforms the baselines.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/139614