Test-Time Training for Spatial-Temporal Forecasting

Chen, C; Liu, Y; Chen, L; Zhang, C

Test-Time Training for Spatial-Temporal Forecasting

Chen, C

Liu, Y Chen, L

Zhang, C

Permalink

Publisher:: SIAM
Publication Type:: Conference Proceeding
Citation:: Proceedings of the 2024 SIAM International Conference on Data Mining, SDM 2024, 2024, pp. 463-471
Issue Date:: 2024-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Submitted versionAdobe PDF (740.31 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Chen, C https://orcid.org/0000-0002-3268-7340
dc.contributor.author	Liu, Y
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Zhang, C https://orcid.org/0000-0001-5715-7154
dc.contributor.editor	Papalexakis, V
dc.contributor.editor	Shekhar, S
dc.contributor.editor	Gao, J
dc.contributor.editor	Jiang, Z
dc.contributor.editor	Riondato, M
dc.date	2024-04-18
dc.date.accessioned	2025-01-13T02:01:45Z
dc.date.available	2025-01-13T02:01:45Z
dc.date.issued	2024-01-01
dc.identifier.citation	Proceedings of the 2024 SIAM International Conference on Data Mining, SDM 2024, 2024, pp. 463-471
dc.identifier.uri	http://hdl.handle.net/10453/183304
dc.description.abstract	Despite the recent success of deep neural networks in spatial-temporal forecasting, existing methods suffer from distribution shifts between the training and test data, failing to address the non-stationary and abrupt changes at test time. To solve this problem, we propose a novel test-time training framework for spatial-temporal forecasting. Instead of employing a fixed trained model, we adapt the trained model with only one or a mini-batch of test examples to address the test data shifts. The unique spatial structure with hundreds of geographical locations offers an effective batch size to explore the test-time distribution and avoid overfitting. To implement test-time training on spatial-temporal data, we devise a bidirectional cycle-consistent architecture consisting of a forward and a backward cyclic network. Each network has a shared encoder and two direction-aware decoders. At the test time, two self-supervised auxiliary tasks (forward→backward and backward→forward reconstruction) are proposed to adapt the trained model without accessing the target labels. Besides, the bi-cyclic structure of our model can also improve the forecasting task at training time, and ensure consistency between the training and test time. Comprehensive experiments are performed on various spatial-temporal forecasting datasets, demonstrating the effectiveness of the test-time training framework and the bidirectional-cyclic structure.
dc.language	en
dc.publisher	SIAM
dc.relation	http://purl.org/au-research/grants/arc/DP210101347
dc.relation.ispartof	Proceedings of the 2024 SIAM International Conference on Data Mining, SDM 2024
dc.relation.ispartof	SIAM International Conference on Data Mining (SDM)
dc.relation.ispartofseries	Data Mining
dc.relation.isbasedon	10.1137/1.9781611978032.54
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.title	Test-Time Training for Spatial-Temporal Forecasting
dc.type	Conference Proceeding
utslib.location.activity	TX, Houston
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/Australian Artificial Intelligence Institute (AAII)
pubs.organisational-group	University of Technology Sydney/UTS Groups/Australia China Relations Institute Research Centre (ACRI)
utslib.copyright.status	open_access	*
dc.date.updated	2025-01-13T02:01:43Z
pubs.finish-date	2024-04-20
pubs.publication-status	Published
pubs.start-date	2024-04-18

Abstract:

Despite the recent success of deep neural networks in spatial-temporal forecasting, existing methods suffer from distribution shifts between the training and test data, failing to address the non-stationary and abrupt changes at test time. To solve this problem, we propose a novel test-time training framework for spatial-temporal forecasting. Instead of employing a fixed trained model, we adapt the trained model with only one or a mini-batch of test examples to address the test data shifts. The unique spatial structure with hundreds of geographical locations offers an effective batch size to explore the test-time distribution and avoid overfitting. To implement test-time training on spatial-temporal data, we devise a bidirectional cycle-consistent architecture consisting of a forward and a backward cyclic network. Each network has a shared encoder and two direction-aware decoders. At the test time, two self-supervised auxiliary tasks (forward→backward and backward→forward reconstruction) are proposed to adapt the trained model without accessing the target labels. Besides, the bi-cyclic structure of our model can also improve the forecasting task at training time, and ensure consistency between the training and test time. Comprehensive experiments are performed on various spatial-temporal forecasting datasets, demonstrating the effectiveness of the test-time training framework and the bidirectional-cyclic structure.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/183304