Towards query efficient black-box attacks: An input-free perspective

Du, Y; Fang, M; Yi, J; Cheng, J; Tao, D

Towards query efficient black-box attacks: An input-free perspective

Du, Y Fang, M Yi, J Cheng, J Tao, D

Permalink

Publication Type:: Conference Proceeding
Citation:: Proceedings of the ACM Conference on Computer and Communications Security, 2018, pp. 13 - 24
Issue Date:: 2018-10-15

Closed Access

	Filename	Description	Size
	1809.02918.pdf	Published version	3.5 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Du, Y	en_US
dc.contributor.author	Fang, M	en_US
dc.contributor.author	Yi, J	en_US
dc.contributor.author	Cheng, J	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.date.issued	2018-10-15	en_US
dc.identifier.citation	Proceedings of the ACM Conference on Computer and Communications Security, 2018, pp. 13 - 24	en_US
dc.identifier.isbn	9781450360043	en_US
dc.identifier.issn	1543-7221	en_US
dc.identifier.uri	http://hdl.handle.net/10453/133518
dc.description.abstract	© 2018 Association for Computing Machinery. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.	en_US
dc.relation.ispartof	Proceedings of the ACM Conference on Computer and Communications Security	en_US
dc.relation.isbasedon	10.1145/3270101.3270106	en_US
dc.title	Towards query efficient black-box attacks: An input-free perspective	en_US
dc.type	Conference Proceeding
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US

Abstract:

© 2018 Association for Computing Machinery. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/133518