DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching

Zuo, Y; Lu, W; Peng, X; Wang, S; Zhang, W; Qiao, X

DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching

Zuo, Y Lu, W Peng, X

Wang, S

Zhang, W Qiao, X

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Computers and Electrical Engineering, 2023, 106, pp. 108574
Issue Date:: 2023-03-01

Embargoed

	Filename	Description	Size
	DuCL Dual-stage contrastive learning framework for Chinese semantic textual matching.pdf	Accepted version	430.01 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Embargoed
Open Access

This item is currently unavailable due to the publisher's embargo.

The embargo period expires on 1 Mar 2025

Full metadata record

Field	Value	Language
dc.contributor.author	Zuo, Y
dc.contributor.author	Lu, W
dc.contributor.author	Peng, X https://orcid.org/0000-0002-8901-1472
dc.contributor.author	Wang, S https://orcid.org/0000-0003-1133-9379
dc.contributor.author	Zhang, W
dc.contributor.author	Qiao, X
dc.date.accessioned	2023-02-10T04:53:41Z
dc.date.available	2023-02-10T04:53:41Z
dc.date.issued	2023-03-01
dc.identifier.citation	Computers and Electrical Engineering, 2023, 106, pp. 108574
dc.identifier.issn	0045-7906
dc.identifier.uri	http://hdl.handle.net/10453/166046
dc.description.abstract	Chinese semantic textual matching is a fundamental yet challenging task in natural language processing (NLP). How to accurately capture the features in a single piece of text and the interactive features between pieces of text is the core problem of the task. Although pretrained language models (PLMs) and contrastive learning (CL) have been applied to address the problem to some extent, the existing works usually just utilize contrastive learning to finetune the PLMs on one single perspective, such as the sentence or pair level, which neglects to capture the semantic features from the other perspective, leading to inefficient learning and suboptimal performance. To tackle the problem, we propose a novel dual-stage contrastive learning framework (DuCL) for Chinese semantic textual matching. Specifically, DuCL consists of two stages sequentially, i.e., CL on the sentence level and CL on the pair level, each of which is responsible to finetune PLMs from the corresponding perspective. Besides, DuCL introduces a block-enhanced interaction module to integrate token-level and block-level interactive features to generate a semantic matching representation for two pieces of text. Extensive experimental results on two real-world public datasets demonstrate that our method can achieve better performance than the representative and state-of-the-art methods.
dc.language	en
dc.publisher	Elsevier
dc.relation.ispartof	Computers and Electrical Engineering
dc.relation.isbasedon	10.1016/j.compeleceng.2022.108574
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.subject	0803 Computer Software, 0805 Distributed Computing, 0906 Electrical and Electronic Engineering
dc.subject.classification	Electrical & Electronic Engineering
dc.title	DuCL: Dual-stage contrastive learning framework for Chinese semantic textual matching
dc.type	Journal Article
utslib.citation.volume	106
utslib.for	0803 Computer Software
utslib.for	0805 Distributed Computing
utslib.for	0906 Electrical and Electronic Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	embargoed	*
utslib.copyright.embargo	2025-03-01T00:00:00+1000Z
dc.date.updated	2023-02-10T04:53:40Z
pubs.publication-status	Accepted
pubs.volume	106

Abstract:

Chinese semantic textual matching is a fundamental yet challenging task in natural language processing (NLP). How to accurately capture the features in a single piece of text and the interactive features between pieces of text is the core problem of the task. Although pretrained language models (PLMs) and contrastive learning (CL) have been applied to address the problem to some extent, the existing works usually just utilize contrastive learning to finetune the PLMs on one single perspective, such as the sentence or pair level, which neglects to capture the semantic features from the other perspective, leading to inefficient learning and suboptimal performance. To tackle the problem, we propose a novel dual-stage contrastive learning framework (DuCL) for Chinese semantic textual matching. Specifically, DuCL consists of two stages sequentially, i.e., CL on the sentence level and CL on the pair level, each of which is responsible to finetune PLMs from the corresponding perspective. Besides, DuCL introduces a block-enhanced interaction module to integrate token-level and block-level interactive features to generate a semantic matching representation for two pieces of text. Extensive experimental results on two real-world public datasets demonstrate that our method can achieve better performance than the representative and state-of-the-art methods.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/166046