Adaptive similarity search for the retrieval of rare events from large time series databases

Schlegl, T; Schlegl, S; Tomaselli, D; West, N; Deuse, J

Adaptive similarity search for the retrieval of rare events from large time series databases

Schlegl, T Schlegl, S Tomaselli, D West, N Deuse, J

Permalink

Publisher:: ELSEVIER SCI LTD
Publication Type:: Journal Article
Citation:: Advanced Engineering Informatics, 2022, 52
Issue Date:: 2022-04-01

Closed Access

	Filename	Description	Size
	1-s2.0-S1474034622000945-main.pdf		1.18 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Schlegl, T
dc.contributor.author	Schlegl, S
dc.contributor.author	Tomaselli, D
dc.contributor.author	West, N
dc.contributor.author	Deuse, J https://orcid.org/0000-0003-4066-4357
dc.date.accessioned	2023-03-10T02:45:17Z
dc.date.available	2023-03-10T02:45:17Z
dc.date.issued	2022-04-01
dc.identifier.citation	Advanced Engineering Informatics, 2022, 52
dc.identifier.issn	1474-0346
dc.identifier.issn	1873-5320
dc.identifier.uri	http://hdl.handle.net/10453/166941
dc.description.abstract	Improving the recall of information retrieval systems for similarity search in time series databases is of great practical importance. In the manufacturing domain, these systems are used to query large databases of manufacturing process data that contain terabytes of time series data from millions of parts. This allows domain experts to identify parts that exhibit specific process faults. In practice, the search often amounts to an iterative query–response cycle in which users define new queries (time series patterns) based on results of previous queries. This is a well-documented phenomenon in information retrieval and not unique to the manufacturing domain. Indexing manufacturing databases to speed up the exploratory search is often not feasible as it may result in an unacceptable reduction in recall. In this paper, we present a novel adaptive search algorithm that refines the query based on relevance feedback provided by the user. Additionally, we propose a mechanism that allows the algorithm to self-adapt to new patterns without requiring any user input. As the search progresses, the algorithm constructs a library of time series patterns that are used to accurately find objects of the target class. Experimental validation of the algorithm on real-world manufacturing data shows, that the recall for the retrieval of fault patterns is considerably higher than that of other state-of-the-art adaptive search algorithms. Additionally, its application to publicly available benchmark data sets shows, that these results are transferable to other domains.
dc.language	English
dc.publisher	ELSEVIER SCI LTD
dc.relation.ispartof	Advanced Engineering Informatics
dc.relation.isbasedon	10.1016/j.aei.2022.101629
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	08 Information and Computing Sciences, 09 Engineering
dc.subject.classification	Design Practice & Management
dc.title	Adaptive similarity search for the retrieval of rare events from large time series databases
dc.type	Journal Article
utslib.citation.volume	52
utslib.for	08 Information and Computing Sciences
utslib.for	09 Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Mechanical and Mechatronic Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - CAM - Centre for Advanced Manufacturing
utslib.copyright.status	closed_access	*
dc.date.updated	2023-03-10T02:45:16Z
pubs.publication-status	Published
pubs.volume	52

Abstract:

Improving the recall of information retrieval systems for similarity search in time series databases is of great practical importance. In the manufacturing domain, these systems are used to query large databases of manufacturing process data that contain terabytes of time series data from millions of parts. This allows domain experts to identify parts that exhibit specific process faults. In practice, the search often amounts to an iterative query–response cycle in which users define new queries (time series patterns) based on results of previous queries. This is a well-documented phenomenon in information retrieval and not unique to the manufacturing domain. Indexing manufacturing databases to speed up the exploratory search is often not feasible as it may result in an unacceptable reduction in recall. In this paper, we present a novel adaptive search algorithm that refines the query based on relevance feedback provided by the user. Additionally, we propose a mechanism that allows the algorithm to self-adapt to new patterns without requiring any user input. As the search progresses, the algorithm constructs a library of time series patterns that are used to accurately find objects of the target class. Experimental validation of the algorithm on real-world manufacturing data shows, that the recall for the retrieval of fault patterns is considerably higher than that of other state-of-the-art adaptive search algorithms. Additionally, its application to publicly available benchmark data sets shows, that these results are transferable to other domains.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/166941