Chinese Sentence Semantic Matching Based on Multi-Granularity Fusion Model

Publication Type:
Conference Proceeding
Advances in Knowledge Discovery and Data Mining, 2020, 12085, pp. 246-257
Issue Date:
Filename Description Size
Zhang2020_Chapter_ChineseSentenceSemanticMatchin.pdf783.48 kB
Adobe PDF
Full metadata record
Sentence semantic matching is the cornerstone of many natural language processing tasks, including Chinese language processing. It is well known that Chinese sentences with different polysemous words or word order may have totally different semantic meanings. Thus, to represent and match the sentence semantic meaning accurately, one challenge that must be solved is how to capture the semantic features from the multi-granularity perspective, e.g., characters and words. To address the above challenge, we propose a novel sentence semantic matching model which is based on the fusion of semantic features from character-granularity and word-granularity, respectively. Particularly, the multi-granularity fusion intends to extract more semantic features to better optimize the downstream sentence semantic matching. In addition, we propose the equilibrium cross-entropy, a novel loss function, by setting mean square error (MSE) as an equilibrium factor of cross-entropy. The experimental results conducted on Chinese open data set demonstrate that our proposed model combined with binary equilibrium cross-entropy loss function is superior to the existing state-of-the-art sentence semantic matching models.
Please use this identifier to cite or link to this item: