Decoupling Sparsity and Smoothness in Dirichlet Belief Networks

Li, Y; Fan, X; Chen, L; Li, B; Sisson, SA

Decoupling Sparsity and Smoothness in Dirichlet Belief Networks

Li, Y Fan, X Chen, L

Li, B Sisson, SA

Permalink

Publisher:: Springer International Publishing
Publication Type:: Chapter
Citation:: Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, 12976 LNAI, pp. 148-163
Issue Date:: 2021-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Accepted versionAdobe PDF (753.98 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Li, Y
dc.contributor.author	Fan, X
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Li, B
dc.contributor.author	Sisson, SA
dc.date.accessioned	2022-05-19T03:00:34Z
dc.date.available	2022-05-19T03:00:34Z
dc.date.issued	2021-01-01
dc.identifier.citation	Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, 12976 LNAI, pp. 148-163
dc.identifier.isbn	9783030865191
dc.identifier.uri	http://hdl.handle.net/10453/157519
dc.description.abstract	The Dirichlet Belief Network (DirBN) has been proposed as a promising deep generative model that uses Dirichlet distributions to form layer-wise connections and thereby construct a multi-stochastic layered deep architecture. However, the DirBN cannot simultaneously achieve both sparsity, whereby the generated latent distributions place weights on a subset of components, and smoothness, which requires that the posterior distribution should not be dominated by the data. To address this limitation we introduce the sparse and smooth Dirichlet Belief Network (ssDirBN) which can achieve both sparsity and smoothness simultaneously, thereby increasing modelling flexibility over the DirBN. This gain is achieved by introducing binary variables to indicate whether each entity’s latent distribution at each layer uses a particular component. As a result, each latent distribution may use only a subset of components in each layer, and smoothness is enforced on this subset. Extra efforts on modifying the models are also made to fix the issues which is caused by introducing these binary variables. Extensive experimental results on real-world data show significant performance improvements of ssDirBN over state-of-the-art models in terms of both enhanced model predictions and reduced model complexity.
dc.language	en
dc.publisher	Springer International Publishing
dc.relation	http://purl.org/au-research/grants/arc/CE140100049
dc.relation.ispartof	Machine Learning and Knowledge Discovery in Databases. Research Track
dc.relation.isbasedon	10.1007/978-3-030-86520-7_10
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	This is a post-peer-review, pre-copyedit version of a chapter published in [Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, 12976 LNAI, pp. 148-163]. The final authenticated version is available online at https://link.springer.com/chapter/10.1007/978-3-030-86520-7_10]”
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	Decoupling Sparsity and Smoothness in Dirichlet Belief Networks
dc.type	Chapter
utslib.citation.volume	12976 LNAI
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
dc.date.updated	2022-05-19T03:00:33Z
pubs.publication-status	Published
pubs.volume	12976 LNAI

Abstract:

The Dirichlet Belief Network (DirBN) has been proposed as a promising deep generative model that uses Dirichlet distributions to form layer-wise connections and thereby construct a multi-stochastic layered deep architecture. However, the DirBN cannot simultaneously achieve both sparsity, whereby the generated latent distributions place weights on a subset of components, and smoothness, which requires that the posterior distribution should not be dominated by the data. To address this limitation we introduce the sparse and smooth Dirichlet Belief Network (ssDirBN) which can achieve both sparsity and smoothness simultaneously, thereby increasing modelling flexibility over the DirBN. This gain is achieved by introducing binary variables to indicate whether each entity’s latent distribution at each layer uses a particular component. As a result, each latent distribution may use only a subset of components in each layer, and smoothness is enforced on this subset. Extra efforts on modifying the models are also made to fix the issues which is caused by introducing these binary variables. Extensive experimental results on real-world data show significant performance improvements of ssDirBN over state-of-the-art models in terms of both enhanced model predictions and reduced model complexity.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/157519