Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier

Carroll, RJ; Delaigle, A; Hall, P

Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier

Carroll, RJ

Delaigle, A Hall, P

Permalink

Publication Type:: Journal Article
Citation:: Annals of Statistics, 2013, 41 (6), pp. 2739 - 2767
Issue Date:: 2013-12-01

Closed Access

	Filename	Description	Size
	1312.5082v1.pdf	Accepted Manuscript Version	338.58 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Carroll, RJ https://orcid.org/0000-0002-5465-9682	en_US
dc.contributor.author	Delaigle, A	en_US
dc.contributor.author	Hall, P	en_US
dc.date.issued	2013-12-01	en_US
dc.identifier.citation	Annals of Statistics, 2013, 41 (6), pp. 2739 - 2767	en_US
dc.identifier.issn	0090-5364	en_US
dc.identifier.uri	http://hdl.handle.net/10453/117593
dc.description.abstract	The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods for prediction or hypothesis testing. However, in the present paper we show that this approach is not effective in classification problems. There a useful rule of thumb is that undersmoothing is often desirable, but there are several surprising qualifications to that approach. First, the effect of smoothing the training data can be more significant than that of smoothing the new data set to be classified; second, undersmoothing is not always the right approach, and in fact in some cases using a relatively large bandwidth can be more effective; and third, these perverse results are the consequence of very unusual properties of error rates, expressed as functions of smoothing parameters. For example, the orders of magnitude of optimal smoothing parameter choices depend on the signs and sizes of terms in an expansion of error rate, and those signs and sizes can vary dramatically from one setting to another, even for the same classifier. © Institute of Mathematical Statistics, 2013.	en_US
dc.relation.ispartof	Annals of Statistics	en_US
dc.relation.isbasedon	10.1214/13-AOS1158	en_US
dc.subject.classification	Statistics & Probability	en_US
dc.title	Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier	en_US
dc.type	Journal Article
utslib.citation.volume	6	en_US
utslib.citation.volume	41	en_US
utslib.for	0104 Statistics	en_US
utslib.for	0102 Applied Mathematics	en_US
utslib.for	1403 Econometrics	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Science
pubs.organisational-group	/University of Technology Sydney/Faculty of Science/School of Mathematical and Physical Sciences
utslib.copyright.status	closed_access
pubs.issue	6	en_US
pubs.publication-status	Published	en_US
pubs.volume	41	en_US

Abstract:

The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods for prediction or hypothesis testing. However, in the present paper we show that this approach is not effective in classification problems. There a useful rule of thumb is that undersmoothing is often desirable, but there are several surprising qualifications to that approach. First, the effect of smoothing the training data can be more significant than that of smoothing the new data set to be classified; second, undersmoothing is not always the right approach, and in fact in some cases using a relatively large bandwidth can be more effective; and third, these perverse results are the consequence of very unusual properties of error rates, expressed as functions of smoothing parameters. For example, the orders of magnitude of optimal smoothing parameter choices depend on the signs and sizes of terms in an expansion of error rate, and those signs and sizes can vary dramatically from one setting to another, even for the same classifier. © Institute of Mathematical Statistics, 2013.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/117593