Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision

Publication Type:
Conference Proceeding
Citation:
Findings of the Association for Computational Linguistics: NAACL 2022 - Findings, 2022, pp. 316-326
Issue Date:
2022-01-01
Full metadata record
Distant supervision uses triple facts in knowledge graphs to label a corpus for relation extraction, leading to wrong labeling and longtail problems. Some works use the hierarchy of relations for knowledge transfer to longtail relations. However, a coarse-grained relation often implies only an attribute (e.g., domain or topic) of the distant fact, making it hard to discriminate relations based solely on sentence semantics. One solution is resorting to entity types, but open questions remain about how to fully leverage the information of entity types and how to align multi-granular entity types with sentences. In this work, we propose a novel model to enrich distantlysupervised sentences with entity types. It consists of (1) a pairwise type-enriched sentence encoding module injecting both context-free and -related backgrounds to alleviate sentencelevel wrong labeling, and (2) a hierarchical type-sentence alignment module enriching a sentence with the triple fact's basic attributes to support long-tail relations. Our model achieves new state-of-the-art results in overall and long-tail performance on benchmarks.
Please use this identifier to cite or link to this item: