Supervised sampling for networked data

Fang, M; Yin, J; Zhu, X

Supervised sampling for networked data

Fang, M Yin, J Zhu, X

Permalink

Publication Type:: Journal Article
Citation:: Signal Processing, 2016, 124 pp. 93 - 102
Issue Date:: 2016-07-01

Closed Access

	Filename	Description	Size
	Supervised.pdf	Published Version	703.79 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Fang, M	en_US
dc.contributor.author	Yin, J	en_US
dc.contributor.author	Zhu, X	en_US
dc.date.issued	2016-07-01	en_US
dc.identifier.citation	Signal Processing, 2016, 124 pp. 93 - 102	en_US
dc.identifier.issn	0165-1684	en_US
dc.identifier.uri	http://hdl.handle.net/10453/95800
dc.description.abstract	© 2015 Elsevier B.V. All rights reserved. Traditional graph sampling methods reduce the size of a large network via uniform sampling of nodes from the original network. The sampled network can be used to estimate the topological properties of the original network. However, in some application domains (e.g., disease surveillance), the goal of sampling is also to help identify a specified category of nodes (e.g., affected individuals) in a large network. This work therefore aims to, given a large information network, sample a subgraph under a specific goal of acquiring as many nodes with a particular category as possible. We refer to this problem as supervised sampling, where we sample a large network for a specific category of nodes. To this end, we model a network as a Markov chain and derive supervised random walks to learn stationary distributions of the sampled network. The learned stationary distribution can help identify the best node to be sampled in the next iteration. The iterative sampling process ensures that with new sampled nodes being acquired, supervised sampling can be strengthened in turn. Experiments on synthetic as well as real-world networks show that our supervised sampling algorithm outperforms existing methods in obtaining target nodes in the sampled networks.	en_US
dc.relation.ispartof	Signal Processing	en_US
dc.relation.isbasedon	10.1016/j.sigpro.2015.09.040	en_US
dc.subject.classification	Networking & Telecommunications	en_US
dc.title	Supervised sampling for networked data	en_US
dc.type	Journal Article
utslib.citation.volume	124	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	10 Technology	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	124	en_US

Abstract:

© 2015 Elsevier B.V. All rights reserved. Traditional graph sampling methods reduce the size of a large network via uniform sampling of nodes from the original network. The sampled network can be used to estimate the topological properties of the original network. However, in some application domains (e.g., disease surveillance), the goal of sampling is also to help identify a specified category of nodes (e.g., affected individuals) in a large network. This work therefore aims to, given a large information network, sample a subgraph under a specific goal of acquiring as many nodes with a particular category as possible. We refer to this problem as supervised sampling, where we sample a large network for a specific category of nodes. To this end, we model a network as a Markov chain and derive supervised random walks to learn stationary distributions of the sampled network. The learned stationary distribution can help identify the best node to be sampled in the next iteration. The iterative sampling process ensures that with new sampled nodes being acquired, supervised sampling can be strengthened in turn. Experiments on synthetic as well as real-world networks show that our supervised sampling algorithm outperforms existing methods in obtaining target nodes in the sampled networks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/95800