Multi-graph learning with positive and unlabeled bags

Wu, J; Hong, Z; Pan, S; Zhu, X; Zhang, C; Cai, Z

Multi-graph learning with positive and unlabeled bags

Wu, J Hong, Z Pan, S

Zhu, X Zhang, C

Cai, Z

Permalink

Publication Type:: Conference Proceeding
Citation:: SIAM International Conference on Data Mining 2014, SDM 2014, 2014, 1 pp. 217 - 225
Issue Date:: 2014-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (4.24 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Wu, J	en_US
dc.contributor.author	Hong, Z	en_US
dc.contributor.author	Pan, S https://orcid.org/0000-0003-0794-527X	en_US
dc.contributor.author	Zhu, X	en_US
dc.contributor.author	Zhang, C https://orcid.org/0000-0001-5715-7154	en_US
dc.contributor.author	Cai, Z	en_US
dc.date.issued	2014-01-01	en_US
dc.identifier.citation	SIAM International Conference on Data Mining 2014, SDM 2014, 2014, 1 pp. 217 - 225	en_US
dc.identifier.isbn	9781510811515	en_US
dc.identifier.uri	http://hdl.handle.net/10453/33940
dc.description.abstract	© SIAM. In this paper, we formulate a new multi-graph learning task with only positive and unlabeled bags, where labels are only available for bags but not for individual graphs inside the bag. This problem setting raises significant challenges because bag-of-graph setting does not have features to directly represent graph data, and no negative bags exits for deriving discriminative classification models. To solve the challenge, we propose a puMGL learning framework which relies on two iteratively combined processes for multigraph learning: (1) deriving features to represent graphs for learning; and (2) deriving discriminative models with only positive and unlabeled graph bags. For the former, we derive a subgraph scoring criterion to select a set of informative subgraphs to convert each graph into a feature space. To handle unlabeled bags, we assign a weight value to each bag and use the adjusted weight values to select most promising unlabeled bags as negative bags. A margin graph pool (MGP), which contains some representative graphs from positive bags and identified negative bags, is used for selecting subgraphs and training graph classifiers. The iterative subgraph scoring, bag weight updating, and MGP based graph classification forms a closed loop to find optimal subgraphs and most suitable unlabeled bags for multi-graph learning. Experiments and comparisons on real-world multigraph data demonstrate the algorithm performance. Copyright	en_US
dc.relation.ispartof	SIAM International Conference on Data Mining 2014, SDM 2014	en_US
dc.relation.isbasedon	10.1137/1.9781611973440.25	en_US
dc.title	Multi-graph learning with positive and unlabeled bags	en_US
dc.type	Conference Proceeding
utslib.citation.volume	1	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
dc.location.activity	Philadelphia, Pennsylvania, USA
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/DVC (International)
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney/Strength - ACRI - Australia China Relations Institute
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	1	en_US

Abstract:

© SIAM. In this paper, we formulate a new multi-graph learning task with only positive and unlabeled bags, where labels are only available for bags but not for individual graphs inside the bag. This problem setting raises significant challenges because bag-of-graph setting does not have features to directly represent graph data, and no negative bags exits for deriving discriminative classification models. To solve the challenge, we propose a puMGL learning framework which relies on two iteratively combined processes for multigraph learning: (1) deriving features to represent graphs for learning; and (2) deriving discriminative models with only positive and unlabeled graph bags. For the former, we derive a subgraph scoring criterion to select a set of informative subgraphs to convert each graph into a feature space. To handle unlabeled bags, we assign a weight value to each bag and use the adjusted weight values to select most promising unlabeled bags as negative bags. A margin graph pool (MGP), which contains some representative graphs from positive bags and identified negative bags, is used for selecting subgraphs and training graph classifiers. The iterative subgraph scoring, bag weight updating, and MGP based graph classification forms a closed loop to find optimal subgraphs and most suitable unlabeled bags for multi-graph learning. Experiments and comparisons on real-world multigraph data demonstrate the algorithm performance. Copyright

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/33940