Cross-domain learning for network representations

Xue, Shan

Cross-domain learning for network representations

Xue, Shan

Permalink

Publication Type:: Thesis
Issue Date:: 2019

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download contents and abstractAdobe PDF (2.37 MB)

Adobe PDF

Download thesisAdobe PDF (5.65 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Xue, Shan
dc.date.accessioned	2019-05-09T04:26:49Z
dc.date.available	2020-05-25T19:17:47Z
dc.date.issued	2019
dc.identifier.uri	http://hdl.handle.net/10453/133268
dc.description	University of Technology Sydney. Faculty of Engineering and Information Technology.	en_AU
dc.description.abstract	Network representation aims to learn a latent feature space so that artificial intelligent algorithms can be applied based on the latent features. The set of latent features is obtained from the information hidden behind network structures, which is learned to provide knowledge for traditional machine learning tasks, such as node classification, recommendation and data visualization. Networks, which are a kind of structured data, limit the representation performance in the structure searching process. Therefore, a good node sampling strategy plays an important role in network representation. Recent research has driven significant progress in network representation by employing random walk as the network sampling strategy. However, real-world large-scale information networks naturally have structural sparsity. The existing approaches to random walk-based network representations are in the domain-specific view to represent the nodes in a vector format, which cannot guarantee a good representation by one network knowledge learning. To address these gaps, this research proposes a framework and develops two algorithms to adapt useful information across relational large-scale information networks and allows the information of the network structure to be transferred from one network to another network to improve the performance of network representation. First, a novel framework of transferring structures across large-scale information networks (FTLSIN) is proposed. FTLSIN consists of a two-layer random walk to measure the relations between two networks and predict the links across them. Second, a cross-domain network representation algorithm (CDNR) is proposed to demonstrate the knowledge which transfers across domains. CDNR learns the structural information from dense networks to sparse networks and further defines the two-layer random walk in unsupervised feature learning with a cross-domain node mapping procedure and a cross-domain walk mapping procedure. Thirdly, a cross-domain similarity learning algorithm (CDSL) is proposed to acquire the most relevant knowledge from the external network. CDSL is nested in the biased random walk-based node sampling and targets the minimum cost of searching the neighborhood in the biased random walk that considers the first-order and second-order walking; and the neighborhood is described by a dual centrality indicator which consists of closeness centrality and betweenness centrality. The developed framework and the two algorithms are very innovative and significantly contribute to both fields of transfer learning and network representation.	en_AU
dc.format	Thesis (PhD)
dc.language.iso	en_AU	en_AU
dc.relation	https://opus.lib.uts.edu.au/bitstream/10453/133268/2/02whole.pdf
dc.rights	The author owns the copyright in this thesis including all reproduction and reuse rights for the work. The work may not be altered without the permission of the copyright owner. Attribution is essential when quoting or paraphrasing from this thesis.
dc.rights	au.edu.uts.lib/ppc
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Cross-domain learning for network representations	en_AU
dc.type	Thesis	en_AU
utslib.copyright.status	open_access

Abstract:

Network representation aims to learn a latent feature space so that artificial intelligent algorithms can be applied based on the latent features. The set of latent features is obtained from the information hidden behind network structures, which is learned to provide knowledge for traditional machine learning tasks, such as node classification, recommendation and data visualization. Networks, which are a kind of structured data, limit the representation performance in the structure searching process. Therefore, a good node sampling strategy plays an important role in network representation. Recent research has driven significant progress in network representation by employing random walk as the network sampling strategy. However, real-world large-scale information networks naturally have structural sparsity. The existing approaches to random walk-based network representations are in the domain-specific view to represent the nodes in a vector format, which cannot guarantee a good representation by one network knowledge learning. To address these gaps, this research proposes a framework and develops two algorithms to adapt useful information across relational large-scale information networks and allows the information of the network structure to be transferred from one network to another network to improve the performance of network representation. First, a novel framework of transferring structures across large-scale information networks (FTLSIN) is proposed. FTLSIN consists of a two-layer random walk to measure the relations between two networks and predict the links across them. Second, a cross-domain network representation algorithm (CDNR) is proposed to demonstrate the knowledge which transfers across domains. CDNR learns the structural information from dense networks to sparse networks and further defines the two-layer random walk in unsupervised feature learning with a cross-domain node mapping procedure and a cross-domain walk mapping procedure. Thirdly, a cross-domain similarity learning algorithm (CDSL) is proposed to acquire the most relevant knowledge from the external network. CDSL is nested in the biased random walk-based node sampling and targets the minimum cost of searching the neighborhood in the biased random walk that considers the first-order and second-order walking; and the neighborhood is described by a dual centrality indicator which consists of closeness centrality and betweenness centrality. The developed framework and the two algorithms are very innovative and significantly contribute to both fields of transfer learning and network representation.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/133268