Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks

Yao, Y; Wang, X; Ma, Y; Fang, H; Wei, J; Chen, L; Anaissi, A; Braytee, A

Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks

Yao, Y Wang, X Ma, Y Fang, H Wei, J Chen, L Anaissi, A Braytee, A

Permalink

Publisher:: Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:: Conference Proceeding
Citation:: Proceedings - 2022 IEEE 9th International Conference on Data Science and Advanced Analytics, DSAA 2022, 2022, 00, pp. 1-10
Issue Date:: 2022-01-01

Embargoed

	Filename	Description	Size
	Conditional Variational Autoencoder with Balanced Pre-training.pdf	Accepeted version	7.87 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Embargoed
Open Access

This item is currently unavailable due to the publisher's embargo.

The embargo period expires on 3 Feb 2025

Full metadata record

Field	Value	Language
dc.contributor.author	Yao, Y
dc.contributor.author	Wang, X
dc.contributor.author	Ma, Y
dc.contributor.author	Fang, H
dc.contributor.author	Wei, J
dc.contributor.author	Chen, L
dc.contributor.author	Anaissi, A
dc.contributor.author	Braytee, A https://orcid.org/0000-0003-2561-6496
dc.date	2022-10-13
dc.date.accessioned	2023-03-09T01:09:28Z
dc.date.available	2023-03-09T01:09:28Z
dc.date.issued	2022-01-01
dc.identifier.citation	Proceedings - 2022 IEEE 9th International Conference on Data Science and Advanced Analytics, DSAA 2022, 2022, 00, pp. 1-10
dc.identifier.isbn	9781665473309
dc.identifier.uri	http://hdl.handle.net/10453/166744
dc.description.abstract	Class imbalance occurs in many real-world applications, including image classification, where the number of images in each class differs significantly. With imbalanced data, the generative adversarial networks (GANs) leans to majority class samples. The two recent methods, Balancing GAN (BAGAN) and improved BAGAN (BAGAN-GP), are proposed as an augmentation tool to handle this problem and restore the balance to the data. The former pre-trains the autoencoder weights in an unsupervised manner. However, it is unstable when the images from different categories have similar features. The latter is improved based on BAGAN by facilitating supervised autoencoder training, but the pre-training is biased towards the majority classes. In this work, we propose a novel Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks (CAPGAN) as an augmentation tool to generate realistic synthetic images. In particular, we utilize a conditional convolutional variational autoencoder with supervised and balanced pre-training for the GAN initialization and training with gradient penalty. Our proposed method presents a superior performance of other state-of-the-art methods on the highly imbalanced version of MNIST, Fashion-MNIST, CIFAR-10, and two medical imaging datasets. Our method can synthesize high-quality minority samples in terms of Fréchet inception distance, structural similarity index measure and perceptual quality. The source code is available at https://github.com/alibraytee/CAPGAN.
dc.language	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	Proceedings - 2022 IEEE 9th International Conference on Data Science and Advanced Analytics, DSAA 2022
dc.relation.ispartof	2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA)
dc.relation.isbasedon	10.1109/DSAA54385.2022.10032367
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.title	Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks
dc.type	Conference Proceeding
utslib.citation.volume	00
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	embargoed	*
utslib.copyright.embargo	2025-02-03T00:00:00+1000Z
dc.date.updated	2023-03-09T01:09:10Z
pubs.finish-date	2022-10-16
pubs.publication-status	Published
pubs.start-date	2022-10-13
pubs.volume	00

Abstract:

Class imbalance occurs in many real-world applications, including image classification, where the number of images in each class differs significantly. With imbalanced data, the generative adversarial networks (GANs) leans to majority class samples. The two recent methods, Balancing GAN (BAGAN) and improved BAGAN (BAGAN-GP), are proposed as an augmentation tool to handle this problem and restore the balance to the data. The former pre-trains the autoencoder weights in an unsupervised manner. However, it is unstable when the images from different categories have similar features. The latter is improved based on BAGAN by facilitating supervised autoencoder training, but the pre-training is biased towards the majority classes. In this work, we propose a novel Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks (CAPGAN) as an augmentation tool to generate realistic synthetic images. In particular, we utilize a conditional convolutional variational autoencoder with supervised and balanced pre-training for the GAN initialization and training with gradient penalty. Our proposed method presents a superior performance of other state-of-the-art methods on the highly imbalanced version of MNIST, Fashion-MNIST, CIFAR-10, and two medical imaging datasets. Our method can synthesize high-quality minority samples in terms of Fréchet inception distance, structural similarity index measure and perceptual quality. The source code is available at https://github.com/alibraytee/CAPGAN.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/166744