Maximum penalized likelihood Kernel regression for fast adaptation

Mak, BKW; Lai, TC; Tsang, IW; Kwok, JTY

Maximum penalized likelihood Kernel regression for fast adaptation

Mak, BKW Lai, TC Tsang, IW

Kwok, JTY

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Audio, Speech and Language Processing, 2009, 17 (7), pp. 1372 - 1381
Issue Date:: 2009-09-01

Closed Access

	Filename	Description	Size
	2013004145OK.pdf		578.96 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Mak, BKW	en_US
dc.contributor.author	Lai, TC	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.contributor.author	Kwok, JTY	en_US
dc.date.issued	2009-09-01	en_US
dc.identifier.citation	IEEE Transactions on Audio, Speech and Language Processing, 2009, 17 (7), pp. 1372 - 1381	en_US
dc.identifier.issn	1558-7916	en_US
dc.identifier.uri	http://hdl.handle.net/10453/29745
dc.description.abstract	This paper proposes a nonlinear generalization of the popular maximum-likelihood linear regression (MLLR) adaptation algorithm using kernel methods. The proposed method, called maximum penalized likelihood kernel regression adaptation (MPLKR), applies kernel regression with appropriate regularization to determine the affine model transform in a kernel-induced high-dimensional feature space. Although this is not the first attempt of applying kernel methods to conventional linear adaptation algorithms, unlike most of other kernelized adaptation methods such as kernel eigenvoice or kernel eigen-MLLR, MPLKR has the advantage that it is a convex optimization and its solution is always guaranteed to be globally optimal. In fact, the adapted Gaussian means can be obtained analytically by simply solving a system of linear equations. From the Bayesian perspective, MPLKR can also be considered as the kernel version of maximum a posteriori linear regression (MAPLR) adaptation. Supervised and unsupervised speaker adaptation using MPLKR were evaluated on the Resource Management and Wall Street Journal 5K tasks, respectively, achieving a word error rate reduction of 23.6% and 15.5% respectively over the speaker-independently model. © 2006 IEEE.	en_US
dc.relation.ispartof	IEEE Transactions on Audio, Speech and Language Processing	en_US
dc.relation.isbasedon	10.1109/TASL.2009.2019920	en_US
dc.subject.classification	Speech-Language Pathology & Audiology	en_US
dc.title	Maximum penalized likelihood Kernel regression for fast adaptation	en_US
dc.type	Journal Article
utslib.citation.volume	7	en_US
utslib.citation.volume	17	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.issue	7	en_US
pubs.publication-status	Published	en_US
pubs.volume	17	en_US

Abstract:

This paper proposes a nonlinear generalization of the popular maximum-likelihood linear regression (MLLR) adaptation algorithm using kernel methods. The proposed method, called maximum penalized likelihood kernel regression adaptation (MPLKR), applies kernel regression with appropriate regularization to determine the affine model transform in a kernel-induced high-dimensional feature space. Although this is not the first attempt of applying kernel methods to conventional linear adaptation algorithms, unlike most of other kernelized adaptation methods such as kernel eigenvoice or kernel eigen-MLLR, MPLKR has the advantage that it is a convex optimization and its solution is always guaranteed to be globally optimal. In fact, the adapted Gaussian means can be obtained analytically by simply solving a system of linear equations. From the Bayesian perspective, MPLKR can also be considered as the kernel version of maximum a posteriori linear regression (MAPLR) adaptation. Supervised and unsupervised speaker adaptation using MPLKR were evaluated on the Resource Management and Wall Street Journal 5K tasks, respectively, achieving a word error rate reduction of 23.6% and 15.5% respectively over the speaker-independently model. © 2006 IEEE.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/29745