Algorithm-Dependent Generalization Bounds for Multi-Task Learning

Liu, T; Tao, D; Song, M; Maybank, SJ

Algorithm-Dependent Generalization Bounds for Multi-Task Learning

Liu, T Tao, D

Song, M Maybank, SJ

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (2), pp. 227 - 241
Issue Date:: 2017-02-01

Closed Access

	Filename	Description	Size
	C:\Users\135854\Downloads\Algorithm-Dependent Generalization Bounds for Multi-Task Learning..pdf	Published Version	315.87 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, T	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.contributor.author	Song, M	en_US
dc.contributor.author	Maybank, SJ	en_US
dc.date.issued	2017-02-01	en_US
dc.identifier.citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (2), pp. 227 - 241	en_US
dc.identifier.issn	0162-8828	en_US
dc.identifier.uri	http://hdl.handle.net/10453/112885
dc.description.abstract	© 1979-2012 IEEE. Often, tasks are collected for multi-Task learning (MTL) because they share similar feature structures. Based on this observation, in this paper, we present novel algorithm-dependent generalization bounds for MTL by exploiting the notion of algorithmic stability. We focus on the performance of one particular task and the average performance over multiple tasks by analyzing the generalization ability of a common parameter that is shared in MTL. When focusing on one particular task, with the help of a mild assumption on the feature structures, we interpret the function of the other tasks as a regularizer that produces a specific inductive bias. The algorithm for learning the common parameter, as well as the predictor, is thereby uniformly stable with respect to the domain of the particular task and has a generalization bound with a fast convergence rate of order mathcal {O}(1/n), where is the sample size of the particular task. When focusing on the average performance over multiple tasks, we prove that a similar inductive bias exists under certain conditions on the feature structures. Thus, the corresponding algorithm for learning the common parameter is also uniformly stable with respect to the domains of the multiple tasks, and its generalization bound is of the order mathcal {O}(1/T), where T is the number of tasks. These theoretical analyses naturally show that the similarity of feature structures in MTL will lead to specific regularizations for predicting, which enables the learning algorithms to generalize fast and correctly from a few examples.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP140102164
dc.relation	http://purl.org/au-research/grants/arc/FT130101457
dc.relation.ispartof	IEEE Transactions on Pattern Analysis and Machine Intelligence	en_US
dc.relation.isbasedon	10.1109/TPAMI.2016.2544314	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Algorithm-Dependent Generalization Bounds for Multi-Task Learning	en_US
dc.type	Journal Article
utslib.citation.volume	2	en_US
utslib.citation.volume	39	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0806 Information Systems	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	2	en_US
pubs.publication-status	Published	en_US
pubs.volume	39	en_US

Abstract:

© 1979-2012 IEEE. Often, tasks are collected for multi-Task learning (MTL) because they share similar feature structures. Based on this observation, in this paper, we present novel algorithm-dependent generalization bounds for MTL by exploiting the notion of algorithmic stability. We focus on the performance of one particular task and the average performance over multiple tasks by analyzing the generalization ability of a common parameter that is shared in MTL. When focusing on one particular task, with the help of a mild assumption on the feature structures, we interpret the function of the other tasks as a regularizer that produces a specific inductive bias. The algorithm for learning the common parameter, as well as the predictor, is thereby uniformly stable with respect to the domain of the particular task and has a generalization bound with a fast convergence rate of order mathcal {O}(1/n), where is the sample size of the particular task. When focusing on the average performance over multiple tasks, we prove that a similar inductive bias exists under certain conditions on the feature structures. Thus, the corresponding algorithm for learning the common parameter is also uniformly stable with respect to the domains of the multiple tasks, and its generalization bound is of the order mathcal {O}(1/T), where T is the number of tasks. These theoretical analyses naturally show that the similarity of feature structures in MTL will lead to specific regularizations for predicting, which enables the learning algorithms to generalize fast and correctly from a few examples.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/112885