OpenWGL: Open-World Graph Learning

Wu, M; Pan, S; Zhu, X

OpenWGL: Open-World Graph Learning

Wu, M Pan, S

Zhu, X

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: 2020 IEEE International Conference on Data Mining (ICDM), 2021, 2020-November, pp. 681-690
Issue Date:: 2021-02-09

Closed Access

	Filename	Description	Size
	09338284.pdf	Published version	786.45 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wu, M
dc.contributor.author	Pan, S https://orcid.org/0000-0003-0794-527X
dc.contributor.author	Zhu, X
dc.date	2020-11-17
dc.date.accessioned	2021-04-14T19:55:11Z
dc.date.available	2021-04-14T19:55:11Z
dc.date.issued	2021-02-09
dc.identifier.citation	2020 IEEE International Conference on Data Mining (ICDM), 2021, 2020-November, pp. 681-690
dc.identifier.isbn	978-1-7281-8316-9
dc.identifier.issn	1550-4786
dc.identifier.issn	2374-8486
dc.identifier.uri	http://hdl.handle.net/10453/148107
dc.description.abstract	In traditional graph learning tasks, such as node classification, learning is carried out in a closed-world setting where the number of classes and their training samples are provided to help train models, and the learning goal is to correctly classify unlabeled nodes into classes already known. In reality, due to limited labeling capability and dynamic evolving of networks, some nodes in the networks may not belong to any existing/seen classes, and therefore cannot be correctly classified by closed-world learning algorithms. In this paper, we propose a new open-world graph learning paradigm, where the learning goal is to not only classify nodes belonging to seen classes into correct groups, but also classify nodes not belonging to existing classes to an unseen class. The essential challenge of the open-world graph learning is that (1) unseen class has no labeled samples, and may exist in an arbitrary form different from existing seen classes; and (2) both graph feature learning and prediction should differentiate whether a node may belong to an existing/seen class or an unseen class. To tackle the challenges, we propose an uncertain node representation learning approach, using constrained variational graph autoencoder networks, where the label loss and class uncertainty loss constraints are used to ensure that the node representation learning are sensitive to unseen class. As a result, node embedding features are denoted by distributions, instead of deterministic feature vectors. By using a sampling process to generate multiple versions of feature vectors, we are able to test the certainty of a node belonging to seen classes, and automatically determine a threshold to reject nodes not belonging to seen classes as unseen class nodes. Experiments on real-world networks demonstrate the algorithm performance, comparing to baselines. Case studies and ablation analysis also show the rationale of our design for open-world graph learning.
dc.language	en
dc.publisher	IEEE
dc.relation.ispartof	2020 IEEE International Conference on Data Mining (ICDM)
dc.relation.ispartof	IEEE International Conference on Data Mining
dc.relation.isbasedon	10.1109/icdm50108.2020.00077
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	OpenWGL: Open-World Graph Learning
dc.type	Conference Proceeding
utslib.citation.volume	2020-November
utslib.location.activity	Sorrento, Italy
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-04-14T19:55:10Z
pubs.finish-date	2020-11-20
pubs.place-of-publication	Piscataway,. USA
pubs.publication-status	Published
pubs.start-date	2020-11-17
pubs.volume	2020-November
dc.location	Piscataway,. USA

Abstract:

In traditional graph learning tasks, such as node classification, learning is carried out in a closed-world setting where the number of classes and their training samples are provided to help train models, and the learning goal is to correctly classify unlabeled nodes into classes already known. In reality, due to limited labeling capability and dynamic evolving of networks, some nodes in the networks may not belong to any existing/seen classes, and therefore cannot be correctly classified by closed-world learning algorithms. In this paper, we propose a new open-world graph learning paradigm, where the learning goal is to not only classify nodes belonging to seen classes into correct groups, but also classify nodes not belonging to existing classes to an unseen class. The essential challenge of the open-world graph learning is that (1) unseen class has no labeled samples, and may exist in an arbitrary form different from existing seen classes; and (2) both graph feature learning and prediction should differentiate whether a node may belong to an existing/seen class or an unseen class. To tackle the challenges, we propose an uncertain node representation learning approach, using constrained variational graph autoencoder networks, where the label loss and class uncertainty loss constraints are used to ensure that the node representation learning are sensitive to unseen class. As a result, node embedding features are denoted by distributions, instead of deterministic feature vectors. By using a sampling process to generate multiple versions of feature vectors, we are able to test the certainty of a node belonging to seen classes, and automatically determine a threshold to reject nodes not belonging to seen classes as unseen class nodes. Experiments on real-world networks demonstrate the algorithm performance, comparing to baselines. Case studies and ablation analysis also show the rationale of our design for open-world graph learning.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/148107