Trust-Region Adaptive Frequency for Online Continual Learning

Kong, Y; Liu, L; Qiao, M; Wang, Z; Tao, D

Trust-Region Adaptive Frequency for Online Continual Learning

Kong, Y Liu, L Qiao, M

Wang, Z Tao, D

Permalink

Publisher:: Springer Nature
Publication Type:: Journal Article
Citation:: International Journal of Computer Vision, 2023, 131, (7), pp. 1825-1839
Issue Date:: 2023-07-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (1.58 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Kong, Y
dc.contributor.author	Liu, L
dc.contributor.author	Qiao, M https://orcid.org/0000-0002-0990-5506
dc.contributor.author	Wang, Z
dc.contributor.author	Tao, D
dc.date.accessioned	2023-10-24T22:44:59Z
dc.date.available	2023-10-24T22:44:59Z
dc.date.issued	2023-07-01
dc.identifier.citation	International Journal of Computer Vision, 2023, 131, (7), pp. 1825-1839
dc.identifier.issn	0920-5691
dc.identifier.issn	1573-1405
dc.identifier.uri	http://hdl.handle.net/10453/172844
dc.description.abstract	In the paradigm of online continual learning, one neural network is exposed to a sequence of tasks, where the data arrive in an online fashion and previously seen data are not accessible. Such online fashion causes insufficient learning and severe forgetting on past tasks issues, preventing a good stability-plasticity trade-off, where ideally the network is expected to have high plasticity to adapt to new tasks well and have the stability to prevent forgetting on old tasks simultaneously. To solve these issues, we propose a trust-region adaptive frequency approach, which alternates between standard-process and intra-process updates. Specifically, the standard-process replays data stored in a coreset and interleaves the data with current data, and the intra-process updates the network parameters based on the coreset. Furthermore, to improve the unsatisfactory performance stemming from online fashion, the frequency of the intra-process is adjusted based on a trust region, which is measured by the confidence score of current data. During the intra-process, we distill the dark knowledge to retain useful learned knowledge. Moreover, to store more representative data in the coreset, a confidence-based coreset selection is presented in an online manner. The experimental results on standard benchmarks show that the proposed method significantly outperforms state-of-art continual learning algorithms.
dc.language	en
dc.publisher	Springer Nature
dc.relation.ispartof	International Journal of Computer Vision
dc.relation.isbasedon	10.1007/s11263-023-01775-0
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	0801 Artificial Intelligence and Image Processing
dc.subject.classification	Artificial Intelligence & Image Processing
dc.subject.classification	4603 Computer vision and multimedia computation
dc.subject.classification	4607 Graphics, augmented reality and games
dc.subject.classification	4611 Machine learning
dc.title	Trust-Region Adaptive Frequency for Online Continual Learning
dc.type	Journal Article
utslib.citation.volume	131
utslib.for	0801 Artificial Intelligence and Image Processing
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
dc.date.updated	2023-10-24T22:44:58Z
pubs.issue	7
pubs.publication-status	Published
pubs.volume	131
utslib.citation.issue	7

Abstract:

In the paradigm of online continual learning, one neural network is exposed to a sequence of tasks, where the data arrive in an online fashion and previously seen data are not accessible. Such online fashion causes insufficient learning and severe forgetting on past tasks issues, preventing a good stability-plasticity trade-off, where ideally the network is expected to have high plasticity to adapt to new tasks well and have the stability to prevent forgetting on old tasks simultaneously. To solve these issues, we propose a trust-region adaptive frequency approach, which alternates between standard-process and intra-process updates. Specifically, the standard-process replays data stored in a coreset and interleaves the data with current data, and the intra-process updates the network parameters based on the coreset. Furthermore, to improve the unsatisfactory performance stemming from online fashion, the frequency of the intra-process is adjusted based on a trust region, which is measured by the confidence score of current data. During the intra-process, we distill the dark knowledge to retain useful learned knowledge. Moreover, to store more representative data in the coreset, a confidence-based coreset selection is presented in an online manner. The experimental results on standard benchmarks show that the proposed method significantly outperforms state-of-art continual learning algorithms.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/172844