Semiparametric Estimation of the Distribution of Episodically Consumed Foods Measured With Error

Informa UK Limited
Publication Type:
Journal Article
Journal of the American Statistical Association, 2020, pp. 1-13
Issue Date:
Full metadata record
© 2020 American Statistical Association. Dietary data collected from 24-hour dietary recalls are observed with significant measurement errors. In the nonparametric curve estimation literature, much of the effort has been devoted to designing methods that are consistent under contamination by noise, and which have been traditionally applied for analyzing those data. However, some foods such as alcohol or fruits are consumed only episodically, and may not be consumed during the day when the 24-hour recall is administered. These so-called excess zeros make existing nonparametric estimators break down, and new techniques need to be developed for such data. We develop two new consistent semiparametric estimators of the distribution of such episodically consumed food data, making parametric assumptions only on some less important parts of the model. We establish its theoretical properties and illustrate the good performance of our fully data-driven method in simulated and real data. Supplementary materials for this article are available online.
Please use this identifier to cite or link to this item: