Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision

Li, Y; Long, G; Shen, T; Jiang, J

Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision

Li, Y Long, G

Shen, T Jiang, J

Permalink

Publication Type:: Conference Proceeding
Citation:: Findings of the Association for Computational Linguistics: NAACL 2022 - Findings, 2022, pp. 316-326
Issue Date:: 2022-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Submitted versionAdobe PDF (700.58 kB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Li, Y
dc.contributor.author	Long, G https://orcid.org/0000-0003-3740-9515
dc.contributor.author	Shen, T
dc.contributor.author	Jiang, J https://orcid.org/0000-0001-5301-7779
dc.date.accessioned	2023-03-14T23:32:27Z
dc.date.available	2023-03-14T23:32:27Z
dc.date.issued	2022-01-01
dc.identifier.citation	Findings of the Association for Computational Linguistics: NAACL 2022 - Findings, 2022, pp. 316-326
dc.identifier.isbn	9781955917766
dc.identifier.uri	http://hdl.handle.net/10453/167337
dc.description.abstract	Distant supervision uses triple facts in knowledge graphs to label a corpus for relation extraction, leading to wrong labeling and longtail problems. Some works use the hierarchy of relations for knowledge transfer to longtail relations. However, a coarse-grained relation often implies only an attribute (e.g., domain or topic) of the distant fact, making it hard to discriminate relations based solely on sentence semantics. One solution is resorting to entity types, but open questions remain about how to fully leverage the information of entity types and how to align multi-granular entity types with sentences. In this work, we propose a novel model to enrich distantlysupervised sentences with entity types. It consists of (1) a pairwise type-enriched sentence encoding module injecting both context-free and -related backgrounds to alleviate sentencelevel wrong labeling, and (2) a hierarchical type-sentence alignment module enriching a sentence with the triple fact's basic attributes to support long-tail relations. Our model achieves new state-of-the-art results in overall and long-tail performance on benchmarks.
dc.language	en
dc.relation.ispartof	Findings of the Association for Computational Linguistics: NAACL 2022 - Findings
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision
dc.type	Conference Proceeding
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	open_access	*
dc.date.updated	2023-03-14T23:32:26Z
pubs.publication-status	Published

Abstract:

Distant supervision uses triple facts in knowledge graphs to label a corpus for relation extraction, leading to wrong labeling and longtail problems. Some works use the hierarchy of relations for knowledge transfer to longtail relations. However, a coarse-grained relation often implies only an attribute (e.g., domain or topic) of the distant fact, making it hard to discriminate relations based solely on sentence semantics. One solution is resorting to entity types, but open questions remain about how to fully leverage the information of entity types and how to align multi-granular entity types with sentences. In this work, we propose a novel model to enrich distantlysupervised sentences with entity types. It consists of (1) a pairwise type-enriched sentence encoding module injecting both context-free and -related backgrounds to alleviate sentencelevel wrong labeling, and (2) a hierarchical type-sentence alignment module enriching a sentence with the triple fact's basic attributes to support long-tail relations. Our model achieves new state-of-the-art results in overall and long-tail performance on benchmarks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/167337