A Framework for the Design, Development, Testing and Deployment of Reliable Big Data Platforms

McGregor, C; Inibhunu, C

A Framework for the Design, Development, Testing and Deployment of Reliable Big Data Platforms

McGregor, C

Inibhunu, C

Permalink

Publisher:: Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:: Conference Proceeding
Citation:: Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022, 2022, 00, pp. 2660-2666
Issue Date:: 2022-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

The embargo period expires on 26 Jan 2025

Adobe PDF

Download Accepted versionAdobe PDF (287.05 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	McGregor, C https://orcid.org/0000-0002-0491-4403
dc.contributor.author	Inibhunu, C
dc.date	2022-12-17
dc.date.accessioned	2023-04-18T03:13:58Z
dc.date.available	2023-04-18T03:13:58Z
dc.date.issued	2022-01-01
dc.identifier.citation	Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022, 2022, 00, pp. 2660-2666
dc.identifier.isbn	9781665480451
dc.identifier.uri	http://hdl.handle.net/10453/169905
dc.description.abstract	We consider the problem of reliability in big data science projects that are comprised of multiple computing platforms and complex architectures that harness data. Specifically on their ability to capture, process and analyze streaming high frequency data from vast complex systems reliably with effective scalability for deployment in vast domains such as clinical care, smart cities or within extreme climatic work environments. This paper introduces a framework to enable reliable data science projects by integrating multiple computing principles of autonomy, local responsibility, fault tolerance, symmetry, decentralization, well-understood building blocks, and simplicity. The designed framework is applied in the development of a decoupled data pipeline demonstrated through a case study on pre-deployment acclimation strategies that is continuously monitored to ensure reliability and availability is effectively quantified.
dc.language	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation.ispartof	Proceedings - 2022 IEEE International Conference on Big Data, Big Data 2022
dc.relation.ispartof	2022 IEEE International Conference on Big Data (Big Data)
dc.relation.isbasedon	10.1109/BigData55660.2022.10020382
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.title	A Framework for the Design, Development, Testing and Deployment of Reliable Big Data Platforms
dc.type	Conference Proceeding
utslib.citation.volume	00
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
utslib.copyright.embargo	2025-01-26T00:00:00+1000Z
dc.date.updated	2023-04-18T03:13:57Z
pubs.finish-date	2022-12-20
pubs.publication-status	Published
pubs.start-date	2022-12-17
pubs.volume	00

Abstract:

We consider the problem of reliability in big data science projects that are comprised of multiple computing platforms and complex architectures that harness data. Specifically on their ability to capture, process and analyze streaming high frequency data from vast complex systems reliably with effective scalability for deployment in vast domains such as clinical care, smart cities or within extreme climatic work environments. This paper introduces a framework to enable reliable data science projects by integrating multiple computing principles of autonomy, local responsibility, fault tolerance, symmetry, decentralization, well-understood building blocks, and simplicity. The designed framework is applied in the development of a decoupled data pipeline demonstrated through a case study on pre-deployment acclimation strategies that is continuously monitored to ensure reliability and availability is effectively quantified.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/169905