Entropy evaluation based on confidence intervals of frequency estimates: Application to the learning of decision trees

Serrurier, M; Prade, H

Entropy evaluation based on confidence intervals of frequency estimates: Application to the learning of decision trees

Serrurier, M Prade, H

Permalink

Publication Type:: Conference Proceeding
Citation:: 32nd International Conference on Machine Learning, ICML 2015, 2015, 2 pp. 1576 - 1584
Issue Date:: 2015-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (2.38 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Serrurier, M	en_US
dc.contributor.author	Prade, H https://orcid.org/0000-0003-4586-8527	en_US
dc.date.issued	2015-01-01	en_US
dc.identifier.citation	32nd International Conference on Machine Learning, ICML 2015, 2015, 2 pp. 1576 - 1584	en_US
dc.identifier.isbn	9781510810587	en_US
dc.identifier.uri	http://hdl.handle.net/10453/120391
dc.description.abstract	Copyright © 2015 by the author(s). Entropy gain is widely used for learning decision trees. However, as we go deeper downward the tree, the examples become rarer and the faithfulness of entropy decreases. Thus, misleading choices and over-fitting may occur and the tree has to be adjusted by using an early-stop criterion or post pruning algorithms. However, these methods still depends on the choices previously made, which may be unsatisfactory. We propose a new cumulative entropy function based on confidence intervals on frequency estimates that together considers the entropy of the probability distribution and the uncertainty around the estimation of its parameters. This function takes advantage of the ability of a possibility distribution to upper bound a family of probabilities previously estimated from a limited set of examples and of the link between possibilistic specificity order and entropy. The proposed measure has several advantages over the classical one. It performs significant choices of split and provides a statistically relevant stopping criterion that allows the learning of trees whose size is well-suited w.r.t. the available data. On the top of that, it also provides a reasonable estimator of the performances of a decision tree. Finally, we show that it can be used for designing a simple and efficient online learning algorithm.	en_US
dc.relation.ispartof	32nd International Conference on Machine Learning, ICML 2015	en_US
dc.title	Entropy evaluation based on confidence intervals of frequency estimates: Application to the learning of decision trees	en_US
dc.type	Conference Proceeding
utslib.citation.volume	2	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Software
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	2	en_US

Abstract:

Copyright © 2015 by the author(s). Entropy gain is widely used for learning decision trees. However, as we go deeper downward the tree, the examples become rarer and the faithfulness of entropy decreases. Thus, misleading choices and over-fitting may occur and the tree has to be adjusted by using an early-stop criterion or post pruning algorithms. However, these methods still depends on the choices previously made, which may be unsatisfactory. We propose a new cumulative entropy function based on confidence intervals on frequency estimates that together considers the entropy of the probability distribution and the uncertainty around the estimation of its parameters. This function takes advantage of the ability of a possibility distribution to upper bound a family of probabilities previously estimated from a limited set of examples and of the link between possibilistic specificity order and entropy. The proposed measure has several advantages over the classical one. It performs significant choices of split and provides a statistically relevant stopping criterion that allows the learning of trees whose size is well-suited w.r.t. the available data. On the top of that, it also provides a reasonable estimator of the performances of a decision tree. Finally, we show that it can be used for designing a simple and efficient online learning algorithm.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/120391