Predictive risk modelling under different data access scenarios: Who is identified as high risk and for how long?

Johnson, TL; Kaldor, J; Falster, MO; Sutherland, K; Humphries, J; Jorm, LR; Levesque, JF

Predictive risk modelling under different data access scenarios: Who is identified as high risk and for how long?

Johnson, TL Kaldor, J Falster, MO Sutherland, K Humphries, J Jorm, LR Levesque, JF

Permalink

Publication Type:: Journal Article
Citation:: BMJ Open, 2018, 8 (2)
Issue Date:: 2018-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published VersionAdobe PDF (1 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Johnson, TL	en_US
dc.contributor.author	Kaldor, J	en_US
dc.contributor.author	Falster, MO	en_US
dc.contributor.author	Sutherland, K	en_US
dc.contributor.author	Humphries, J	en_US
dc.contributor.author	Jorm, LR	en_US
dc.contributor.author	Levesque, JF	en_US
dc.date.issued	2018-01-01	en_US
dc.identifier.citation	BMJ Open, 2018, 8 (2)	en_US
dc.identifier.uri	http://hdl.handle.net/10453/134353
dc.description.abstract	© Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. Objective: This observational study critically explored the performance of different predictive risk models simulating three data access scenarios, comparing: (1) sociodemographic and clinical profles; (2) consistency in high-risk designation across models; and (3) persistence of high-risk status over time. Methods: Cross-sectional health survey data (2006-2009) for more than 260 000 Australian adults 45+ years were linked to longitudinal individual hospital, primary care, pharmacy and mortality data. Three risk models predicting acute emergency hospitalisations were explored, simulating conditions where data are accessed through primary care practice management systems, or through hospital-based electronic records, or through a hypothetical 'full' model using a wider array of linked data. High-risk patients were identified using different risk score thresholds. Models were reapplied monthly for 24 months to assess persistence in high-risk categorisation. Results: The three models displayed similar statistical performance. Three-quarters of patients in the high-risk quintile from the 'full' model were also identified using the primary care or hospital-based models, with the remaining patients differing according to age, frailty, multimorbidity, self-rated health, polypharmacy, prior hospitalisations and imminent mortality. The use of higher risk prediction thresholds resulted in lower levels of agreement in highrisk designation across models and greater morbidity and mortality in identified patient populations. Persistence of high-risk status varied across approaches according to updated information on utilisation history, with up to 25% of patients reassessed as lower risk within 1 year. Conclusion/implications: Small differences in risk predictors or risk thresholds resulted in comparatively large differences in who was classified as high risk and for how long. Pragmatic predictive risk modelling design decisions based on data availability or projected high-risk patient numbers may therefore influence individuals identified as high-risk, overall case mix and risk persistence. Routine data linkage would enable greater flexibility in developing and optimising predictive risk models appropriate to both case-finding and performance measurement applications.	en_US
dc.relation.ispartof	BMJ Open	en_US
dc.relation.isbasedon	10.1136/bmjopen-2017-018909	en_US
dc.title	Predictive risk modelling under different data access scenarios: Who is identified as high risk and for how long?	en_US
dc.type	Journal Article
utslib.citation.volume	2	en_US
utslib.citation.volume	8	en_US
utslib.for	1117 Public Health and Health Services	en_US
utslib.for	1402 Applied Economics	en_US
utslib.for	1103 Clinical Sciences	en_US
utslib.for	1199 Other Medical and Health Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Business
utslib.copyright.status	open_access
pubs.issue	2	en_US
pubs.publication-status	Published	en_US
pubs.volume	8	en_US

Abstract:

© Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. Objective: This observational study critically explored the performance of different predictive risk models simulating three data access scenarios, comparing: (1) sociodemographic and clinical profles; (2) consistency in high-risk designation across models; and (3) persistence of high-risk status over time. Methods: Cross-sectional health survey data (2006-2009) for more than 260 000 Australian adults 45+ years were linked to longitudinal individual hospital, primary care, pharmacy and mortality data. Three risk models predicting acute emergency hospitalisations were explored, simulating conditions where data are accessed through primary care practice management systems, or through hospital-based electronic records, or through a hypothetical 'full' model using a wider array of linked data. High-risk patients were identified using different risk score thresholds. Models were reapplied monthly for 24 months to assess persistence in high-risk categorisation. Results: The three models displayed similar statistical performance. Three-quarters of patients in the high-risk quintile from the 'full' model were also identified using the primary care or hospital-based models, with the remaining patients differing according to age, frailty, multimorbidity, self-rated health, polypharmacy, prior hospitalisations and imminent mortality. The use of higher risk prediction thresholds resulted in lower levels of agreement in highrisk designation across models and greater morbidity and mortality in identified patient populations. Persistence of high-risk status varied across approaches according to updated information on utilisation history, with up to 25% of patients reassessed as lower risk within 1 year. Conclusion/implications: Small differences in risk predictors or risk thresholds resulted in comparatively large differences in who was classified as high risk and for how long. Pragmatic predictive risk modelling design decisions based on data availability or projected high-risk patient numbers may therefore influence individuals identified as high-risk, overall case mix and risk persistence. Routine data linkage would enable greater flexibility in developing and optimising predictive risk models appropriate to both case-finding and performance measurement applications.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/134353