A Universal Representation Transformer Layer for Few-Shot Image Classification

Liu, L; Hamilton, W; Long, G; Jiang, J; Larochelle, H

A Universal Representation Transformer Layer for Few-Shot Image Classification

Liu, L Hamilton, W Long, G

Jiang, J Larochelle, H

Permalink

Publication Type:: Journal Article
Citation:: 2020
Issue Date:: 2020-06-21

Closed Access

	Filename	Description	Size
	ICLR - Lu Liu 2021 - review.pdf	Supporting information	265.34 kB		View/Open
	ICLR 2021 - Lu Liu - paper.pdf	Accepted version	14.7 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, L
dc.contributor.author	Hamilton, W
dc.contributor.author	Long, G https://orcid.org/0000-0003-3740-9515
dc.contributor.author	Jiang, J
dc.contributor.author	Larochelle, H
dc.date.accessioned	2021-05-11T00:44:02Z
dc.date.available	2021-05-11T00:44:02Z
dc.date.issued	2020-06-21
dc.identifier.citation	2020
dc.identifier.uri	http://hdl.handle.net/10453/148823
dc.description.abstract	Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources. This problem has seen growing interest and has inspired the development of benchmarks such as Meta-Dataset. A key challenge in this multi-domain setting is to effectively integrate the feature representations from the diverse set of training domains. Here, we propose a Universal Representation Transformer (URT) layer, that meta-learns to leverage universal features for few-shot classification by dynamically re-weighting and composing the most appropriate domain-specific representations. In experiments, we show that URT sets a new state-of-the-art result on Meta-Dataset. Specifically, it achieves top-performance on the highest number of data sources compared to competing methods. We analyze variants of URT and present a visualization of the attention score heatmaps that sheds light on how the model performs cross-domain generalization. Our code is available at https://github.com/liulu112601/URT.
dc.language	en
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	A Universal Representation Transformer Layer for Few-Shot Image Classification
dc.type	Journal Article
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2021-05-11T00:43:59Z

Abstract:

Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources. This problem has seen growing interest and has inspired the development of benchmarks such as Meta-Dataset. A key challenge in this multi-domain setting is to effectively integrate the feature representations from the diverse set of training domains. Here, we propose a Universal Representation Transformer (URT) layer, that meta-learns to leverage universal features for few-shot classification by dynamically re-weighting and composing the most appropriate domain-specific representations. In experiments, we show that URT sets a new state-of-the-art result on Meta-Dataset. Specifically, it achieves top-performance on the highest number of data sources compared to competing methods. We analyze variants of URT and present a visualization of the attention score heatmaps that sheds light on how the model performs cross-domain generalization. Our code is available at https://github.com/liulu112601/URT.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/148823