Multi-element comparisons of tapes evidence using dimensionality reduction for calculating likelihood ratios.

Gupta, A; Martinez-Lopez, C; Curran, JM; Almirall, JR

Multi-element comparisons of tapes evidence using dimensionality reduction for calculating likelihood ratios.

Gupta, A

Martinez-Lopez, C Curran, JM Almirall, JR

Permalink

Publisher:: ELSEVIER IRELAND LTD
Publication Type:: Journal Article
Citation:: Forensic science international, 2019, 301, pp. 426-434
Issue Date:: 2019-08

Closed Access

	Filename	Description	Size
	out (3).pdf	Published version	1.28 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Gupta, A https://orcid.org/0000-0003-4957-4008
dc.contributor.author	Martinez-Lopez, C
dc.contributor.author	Curran, JM
dc.contributor.author	Almirall, JR
dc.date.accessioned	2020-06-02T21:42:43Z
dc.date.available	2019-06-03
dc.date.available	2020-06-02T21:42:43Z
dc.date.issued	2019-08
dc.identifier.citation	Forensic science international, 2019, 301, pp. 426-434
dc.identifier.issn	0379-0738
dc.identifier.issn	1872-6283
dc.identifier.uri	http://hdl.handle.net/10453/141064
dc.description.abstract	Computing the likelihood ratio (LR), as a measure of weight of evidence, has traditionally been difficult for multi-element evidence. A solution based on multivariate random effects models has been adopted by the forensic community but suffers from instability and has a tendency toward extreme values. This problem is magnified by increasing the number of variables. In this study, we consider reducing the dimensionality of the problem using principal component analysis (PCA) and a post-hoc calibration step suggested by van Es et al. [1] and evaluate the performance of this method using multi-element data collected from electrical tapes with up to 18 elements measured. A set of 90 tapes known to originate from different sources were analyzed by LA-ICP-MS. We used additive log-ratio transformation with respect to the signal of 208Pb to transform the 18-dimensional data. This transformation altered the scale of the signals and more importantly, the transformed signals exhibited characteristics similar to a normal distribution. We used scores of the first five principal components (PCs) as input to the LR formula given by Aitken and Lucy [2] where we assumed multivariate normal between-sources distribution (LR MVN) to compare the tapes. We observed that the calculated LRs were extremely positive and negative and did not conform with the definition of well-calibrated LRs. Thus, we used the post-hoc calibration method given by van Es et al. [1] to calibrate the likelihood ratios. The calibrated LRs were obtained within an appropriate range. Five scenarios, each related to the number of principal components used to compare the samples formed part of this study. The first scenario made the comparisons using only the first PC, the second scenario used the first two PCs together and so on. The last scenario, LR5, used 5 PCs for the comparisons. Comparing the results of these 5 scenarios provided an understanding around sensitivity of the method based on the percentage of information used for the comparisons. The lowest false exclusion (Type I) and false inclusion (Type II) error rates were obtained for LR5 scenario in comparison to all the other scenarios. False inclusion and false exclusion error rates of 3.7% and 2.2% were reported by using only 5 out of 17 PCs. False exclusion error rates of 2.2% indicated that only two same-source comparisons had LR<1. The proposed method overcomes the problem of using highly-dimensional data for the comparisons, while using a high percentage of information present in the original data.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	ELSEVIER IRELAND LTD
dc.relation.ispartof	Forensic science international
dc.relation.isbasedon	10.1016/j.forsciint.2019.06.002
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.subject.classification	Legal & Forensic Medicine
dc.title	Multi-element comparisons of tapes evidence using dimensionality reduction for calculating likelihood ratios.
dc.type	Journal Article
utslib.citation.volume	301
utslib.location.activity	Ireland
pubs.organisational-group	/University of Technology Sydney/Faculty of Science
pubs.organisational-group	/University of Technology Sydney/Faculty of Science/School of Mathematical and Physical Sciences
pubs.organisational-group	/University of Technology Sydney
utslib.copyright.status	closed_access	*
dc.date.updated	2020-06-02T21:42:38Z
pubs.publication-status	Published
pubs.volume	301
utslib.start-page	426

Abstract:

Computing the likelihood ratio (LR), as a measure of weight of evidence, has traditionally been difficult for multi-element evidence. A solution based on multivariate random effects models has been adopted by the forensic community but suffers from instability and has a tendency toward extreme values. This problem is magnified by increasing the number of variables. In this study, we consider reducing the dimensionality of the problem using principal component analysis (PCA) and a post-hoc calibration step suggested by van Es et al. [1] and evaluate the performance of this method using multi-element data collected from electrical tapes with up to 18 elements measured. A set of 90 tapes known to originate from different sources were analyzed by LA-ICP-MS. We used additive log-ratio transformation with respect to the signal of 208Pb to transform the 18-dimensional data. This transformation altered the scale of the signals and more importantly, the transformed signals exhibited characteristics similar to a normal distribution. We used scores of the first five principal components (PCs) as input to the LR formula given by Aitken and Lucy [2] where we assumed multivariate normal between-sources distribution (LR MVN) to compare the tapes. We observed that the calculated LRs were extremely positive and negative and did not conform with the definition of well-calibrated LRs. Thus, we used the post-hoc calibration method given by van Es et al. [1] to calibrate the likelihood ratios. The calibrated LRs were obtained within an appropriate range. Five scenarios, each related to the number of principal components used to compare the samples formed part of this study. The first scenario made the comparisons using only the first PC, the second scenario used the first two PCs together and so on. The last scenario, LR5, used 5 PCs for the comparisons. Comparing the results of these 5 scenarios provided an understanding around sensitivity of the method based on the percentage of information used for the comparisons. The lowest false exclusion (Type I) and false inclusion (Type II) error rates were obtained for LR5 scenario in comparison to all the other scenarios. False inclusion and false exclusion error rates of 3.7% and 2.2% were reported by using only 5 out of 17 PCs. False exclusion error rates of 2.2% indicated that only two same-source comparisons had LR<1. The proposed method overcomes the problem of using highly-dimensional data for the comparisons, while using a high percentage of information present in the original data.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/141064