Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Zhang, J; Wei, Q; Liu, F; Feng, L

Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Zhang, J Wei, Q Liu, F

Feng, L

Permalink

Publisher:: JMLR.org
Publication Type:: Conference Proceeding
Citation:: ICML'24: Proceedings of the 41st International Conference on Machine Learning, 2024, 235, pp. 60004-60020
Issue Date:: 2024-01-01

Closed Access

	Filename	Description	Size
	zhang24bo.pdf	Published version	1.96 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, J
dc.contributor.author	Wei, Q
dc.contributor.author	Liu, F https://orcid.org/0000-0002-5005-9129
dc.contributor.author	Feng, L
dc.date	2024-07-21
dc.date.accessioned	2025-03-11T03:28:07Z
dc.date.available	2025-03-11T03:28:07Z
dc.date.issued	2024-01-01
dc.identifier.citation	ICML'24: Proceedings of the 41st International Conference on Machine Learning, 2024, 235, pp. 60004-60020
dc.identifier.issn	2640-3498
dc.identifier.uri	http://hdl.handle.net/10453/185660
dc.description.abstract	Fine-tuning vision-language models (VLMs) with abundant unlabeled data recently has attracted increasing attention.Existing methods that resort to the pseudolabeling strategy would suffer from heavily incorrect hard pseudolabels when VLMs exhibit low zero-shot performance in downstream tasks.To alleviate this issue, we propose a Candidate Pseudolabel Learning method, termed CPL, to fine-tune VLMs with suitable candidate pseudolabels of unlabeled data in downstream tasks.The core of our method lies in the generation strategy of candidate pseudolabels, which progressively generates refined candidate pseudolabels by both intra- and inter-instance label selection, based on a confidence score matrix for all unlabeled data.This strategy can result in better performance in true label inclusion and class-balanced instance selection.In this way, we can directly apply existing loss functions to learn with generated candidate psueudolabels.Extensive experiments on nine benchmark datasets with three learning paradigms demonstrate the effectiveness of our method.Our code can be found here.
dc.language	en
dc.publisher	JMLR.org
dc.relation.ispartof	ICML'24: Proceedings of the 41st International Conference on Machine Learning
dc.relation.ispartof	International Conference on Machine Learning
dc.relation.ispartofseries	Proceedings of Machine Learning Research
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data
dc.type	Conference Proceeding
utslib.citation.volume	235
utslib.location.activity	Vienna Austria
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/Australian Artificial Intelligence Institute (AAII)
pubs.organisational-group	University of Technology Sydney/UTS Groups/Australian Artificial Intelligence Institute (AAII)/Australian Artificial Intelligence Institute (AAII) Associate Members
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2025-03-11T03:28:05Z
pubs.finish-date	2024-07-27
pubs.publication-status	Published
pubs.start-date	2024-07-21
pubs.volume	235

Abstract:

Fine-tuning vision-language models (VLMs) with abundant unlabeled data recently has attracted increasing attention.Existing methods that resort to the pseudolabeling strategy would suffer from heavily incorrect hard pseudolabels when VLMs exhibit low zero-shot performance in downstream tasks.To alleviate this issue, we propose a Candidate Pseudolabel Learning method, termed CPL, to fine-tune VLMs with suitable candidate pseudolabels of unlabeled data in downstream tasks.The core of our method lies in the generation strategy of candidate pseudolabels, which progressively generates refined candidate pseudolabels by both intra- and inter-instance label selection, based on a confidence score matrix for all unlabeled data.This strategy can result in better performance in true label inclusion and class-balanced instance selection.In this way, we can directly apply existing loss functions to learn with generated candidate psueudolabels.Extensive experiments on nine benchmark datasets with three learning paradigms demonstrate the effectiveness of our method.Our code can be found here.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/185660