Learning word dependencies in text by means of a deep recurrent belief network

Chaturvedi, I; Ong, YS; Tsang, IW; Welsch, RE; Cambria, E

Learning word dependencies in text by means of a deep recurrent belief network

Chaturvedi, I Ong, YS Tsang, IW

Welsch, RE Cambria, E

Permalink

Publication Type:: Journal Article
Citation:: Knowledge-Based Systems, 2016, 108 pp. 144 - 154
Issue Date:: 2016-09-15

Closed Access

	Filename	Description	Size
	1-s2.0-S0950705116302349-main.pdf	Published Version	719.3 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Chaturvedi, I	en_US
dc.contributor.author	Ong, YS	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.contributor.author	Welsch, RE	en_US
dc.contributor.author	Cambria, E	en_US
dc.date.issued	2016-09-15	en_US
dc.identifier.citation	Knowledge-Based Systems, 2016, 108 pp. 144 - 154	en_US
dc.identifier.issn	0950-7051	en_US
dc.identifier.uri	http://hdl.handle.net/10453/122816
dc.description.abstract	© 2016 We propose a deep recurrent belief network with distributed time delays for learning multivariate Gaussians. Learning long time delays in deep belief networks is difficult due to the problem of vanishing or exploding gradients with increase in delay. To mitigate this problem and improve the transparency of learning time-delays, we introduce the use of Gaussian networks with time-delays to initialize the weights of each hidden neuron. From our knowledge of time delays, it is possible to learn the long delays from short delays in a hierarchical manner. In contrast to previous works, here dynamic Gaussian Bayesian networks over training samples are evolved using Markov Chain Monte Carlo to determine the initial weights of each hidden layer of neurons. In this way, the time-delayed network motifs of increasing Markov order across layers can be modeled hierarchically using a deep model. To validate the proposed Variable-order Belief Network (VBN) framework, it is applied for modeling word dependencies in text. To explore the generality of VBN, it is further considered for a real-world scenario where the dynamic movements of basketball players are modeled. Experimental results obtained showed that the proposed VBN could achieve over 30% improvement in accuracy on real-world scenarios compared to the state-of-the-art baselines.	en_US
dc.relation.ispartof	Knowledge-Based Systems	en_US
dc.relation.isbasedon	10.1016/j.knosys.2016.07.019	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Learning word dependencies in text by means of a deep recurrent belief network	en_US
dc.type	Journal Article
utslib.citation.volume	108	en_US
utslib.for	080101 Adaptive Agents and Intelligent Robotics	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	15 Commerce, Management, Tourism and Services	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	108	en_US

Abstract:

© 2016 We propose a deep recurrent belief network with distributed time delays for learning multivariate Gaussians. Learning long time delays in deep belief networks is difficult due to the problem of vanishing or exploding gradients with increase in delay. To mitigate this problem and improve the transparency of learning time-delays, we introduce the use of Gaussian networks with time-delays to initialize the weights of each hidden neuron. From our knowledge of time delays, it is possible to learn the long delays from short delays in a hierarchical manner. In contrast to previous works, here dynamic Gaussian Bayesian networks over training samples are evolved using Markov Chain Monte Carlo to determine the initial weights of each hidden layer of neurons. In this way, the time-delayed network motifs of increasing Markov order across layers can be modeled hierarchically using a deep model. To validate the proposed Variable-order Belief Network (VBN) framework, it is applied for modeling word dependencies in text. To explore the generality of VBN, it is further considered for a real-world scenario where the dynamic movements of basketball players are modeled. Experimental results obtained showed that the proposed VBN could achieve over 30% improvement in accuracy on real-world scenarios compared to the state-of-the-art baselines.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/122816