Executing SQL queries over encrypted character strings in the Database-As-Service model

Wu, Z; Xu, G; Yu, Z; Yi, X; Chen, E; Zhang, Y

Executing SQL queries over encrypted character strings in the Database-As-Service model

Wu, Z Xu, G

Yu, Z Yi, X Chen, E Zhang, Y

Permalink

Publication Type:: Journal Article
Citation:: Knowledge-Based Systems, 2012, 35 pp. 332 - 348
Issue Date:: 2012-11-01

Closed Access

	Filename	Description	Size
	2013002759OK.pdf		1.2 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wu, Z	en_US
dc.contributor.author	Xu, G https://orcid.org/0000-0003-4493-6663	en_US
dc.contributor.author	Yu, Z	en_US
dc.contributor.author	Yi, X	en_US
dc.contributor.author	Chen, E	en_US
dc.contributor.author	Zhang, Y	en_US
dc.date.issued	2012-11-01	en_US
dc.identifier.citation	Knowledge-Based Systems, 2012, 35 pp. 332 - 348	en_US
dc.identifier.issn	0950-7051	en_US
dc.identifier.uri	http://hdl.handle.net/10453/29137
dc.identifier.uri	http://hdl.handle.net/10453/33754
dc.description.abstract	Rapid advances in the networking technologies have prompted the emergence of the "software as service" model for enterprise computing, moreover, which is becoming one of the key industries quickly. "Database as service" model provides users power to store, modify and retrieve data from anywhere in the world, as long as they have access to the Internet, thus, being increasingly popular in current enterprise data management systems. However, this model introduces several challenges, an essential issue being how to implement SQL queries over encrypted data efficiently. To ensure data security, this model generally encrypts sensitive data at the trusted client's site, before storing them into the non-trusted database service provider's site, which, unfortunately, results in that SQL queries cannot be executed over the encrypted data immediately at the database service provider. In this paper we only focus on how to query encrypted character strings efficiently. Our strategy is that when storing character strings to the database service provider, we not only store the encrypted character strings themselves, but also generate some characteristic index values for these character strings, and store them in an additional field; and when querying the encrypted character strings, we first execute a coarse query over the characteristic index fields at the database service provider, in order to filter out most of tuples not related to the querying conditions, and then, we decrypt the rest tuples and execute a refined query over them again at the client site. In our strategy, we define an n-phase reachability matrix for a character string and use it as the characteristic index values, and based on such a definition, we present some theorems to split a SQL query into its server-side representation and client-side representation for partitioning the computation of a query across the client and the server and thus improving query performance. Finally, experimental results validate the functionality and effectiveness of our strategy. © 2012 Elsevier B.V. All rights reserved.	en_US
dc.relation.ispartof	Knowledge-Based Systems	en_US
dc.relation.isbasedon	10.1016/j.knosys.2012.05.009	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Executing SQL queries over encrypted character strings in the Database-As-Service model	en_US
dc.type	Journal Article
utslib.citation.volume	35	en_US
utslib.for	080402 Data Encryption	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	15 Commerce, Management, Tourism and Services	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	35	en_US

Abstract:

Rapid advances in the networking technologies have prompted the emergence of the "software as service" model for enterprise computing, moreover, which is becoming one of the key industries quickly. "Database as service" model provides users power to store, modify and retrieve data from anywhere in the world, as long as they have access to the Internet, thus, being increasingly popular in current enterprise data management systems. However, this model introduces several challenges, an essential issue being how to implement SQL queries over encrypted data efficiently. To ensure data security, this model generally encrypts sensitive data at the trusted client's site, before storing them into the non-trusted database service provider's site, which, unfortunately, results in that SQL queries cannot be executed over the encrypted data immediately at the database service provider. In this paper we only focus on how to query encrypted character strings efficiently. Our strategy is that when storing character strings to the database service provider, we not only store the encrypted character strings themselves, but also generate some characteristic index values for these character strings, and store them in an additional field; and when querying the encrypted character strings, we first execute a coarse query over the characteristic index fields at the database service provider, in order to filter out most of tuples not related to the querying conditions, and then, we decrypt the rest tuples and execute a refined query over them again at the client site. In our strategy, we define an n-phase reachability matrix for a character string and use it as the characteristic index values, and based on such a definition, we present some theorems to split a SQL query into its server-side representation and client-side representation for partitioning the computation of a query across the client and the server and thus improving query performance. Finally, experimental results validate the functionality and effectiveness of our strategy. © 2012 Elsevier B.V. All rights reserved.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/29137