Value, but high costs in post-deposition data Curation
Hoopen, PT
Amid, C
Buttigieg, PL
Pafilis, E
Bravakos, P
O-Tárraga, AMC
Gibson, R
Kahlke, T
Legaki, A
Murthy, KN
Papastefanou, G
Pereira, E
Rossello, M
Toribio, AL
Cochrane, G
- Publication Type:
- Journal Article
- Citation:
- Database, 2016, 2016
- Issue Date:
- 2016-01-01
Open Access
Copyright Clearance Process
- Recently Added
- In Progress
- Open Access
This item is open access.
Full metadata record
| Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hoopen, PT | en_US |
| dc.contributor.author | Amid, C | en_US |
| dc.contributor.author | Buttigieg, PL | en_US |
| dc.contributor.author | Pafilis, E | en_US |
| dc.contributor.author | Bravakos, P | en_US |
| dc.contributor.author | O-Tárraga, AMC | en_US |
| dc.contributor.author | Gibson, R | en_US |
| dc.contributor.author |
Kahlke, T |
en_US |
| dc.contributor.author | Legaki, A | en_US |
| dc.contributor.author | Murthy, KN | en_US |
| dc.contributor.author | Papastefanou, G | en_US |
| dc.contributor.author | Pereira, E | en_US |
| dc.contributor.author | Rossello, M | en_US |
| dc.contributor.author | Toribio, AL | en_US |
| dc.contributor.author | Cochrane, G | en_US |
| dc.date.available | 2015-12-14 | en_US |
| dc.date.issued | 2016-01-01 | en_US |
| dc.identifier.citation | Database, 2016, 2016 | en_US |
| dc.identifier.issn | 1758-0463 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10453/90496 | |
| dc.description.abstract | © The Author(s) 2016. Published by Oxford University Press. Discoverability of sequence data in primary data archives is proportional to the richness of contextual information associated with the data. Here, we describe an exercise in the improvement of contextual information surrounding sample records associated with metagenomics sequence reads available in the European Nucleotide Archive. We outline the annotation process and summarize findings of this effort aimed at increasing usability of publicly available environmental data. Furthermore, we emphasize the benefits of such an exercise and detail its costs. We conclude that such a third party annotation approach is expensive and has value as an element of curation, but should form only part of a more sustainable submitter-driven approach. | en_US |
| dc.relation.ispartof | Database | en_US |
| dc.relation.isbasedon | 10.1093/database/bav126 | en_US |
| dc.subject.mesh | Humans | en_US |
| dc.subject.mesh | Data Collection | en_US |
| dc.subject.mesh | Sequence Analysis | en_US |
| dc.subject.mesh | Computational Biology | en_US |
| dc.subject.mesh | Ecosystem | en_US |
| dc.subject.mesh | Geography | en_US |
| dc.subject.mesh | Semantics | en_US |
| dc.subject.mesh | Databases, Nucleic Acid | en_US |
| dc.subject.mesh | Europe | en_US |
| dc.subject.mesh | Metagenomics | en_US |
| dc.subject.mesh | Molecular Sequence Annotation | en_US |
| dc.subject.mesh | Microbiota | en_US |
| dc.title | Value, but high costs in post-deposition data Curation | en_US |
| dc.type | Journal Article | |
| utslib.citation.volume | 2016 | en_US |
| utslib.for | 0806 Information Systems | en_US |
| utslib.for | 0804 Data Format | en_US |
| utslib.for | 0807 Library and Information Studies | en_US |
| pubs.embargo.period | Not known | en_US |
| pubs.organisational-group | /University of Technology Sydney | |
| pubs.organisational-group | /University of Technology Sydney/Faculty of Science | |
| pubs.organisational-group | /University of Technology Sydney/Strength - C3 - Climate Change Cluster | |
| utslib.copyright.status | open_access | |
| pubs.publication-status | Published | en_US |
| pubs.volume | 2016 | en_US |
Abstract:
© The Author(s) 2016. Published by Oxford University Press. Discoverability of sequence data in primary data archives is proportional to the richness of contextual information associated with the data. Here, we describe an exercise in the improvement of contextual information surrounding sample records associated with metagenomics sequence reads available in the European Nucleotide Archive. We outline the annotation process and summarize findings of this effort aimed at increasing usability of publicly available environmental data. Furthermore, we emphasize the benefits of such an exercise and detail its costs. We conclude that such a third party annotation approach is expensive and has value as an element of curation, but should form only part of a more sustainable submitter-driven approach.
Please use this identifier to cite or link to this item:
Download statistics for the last 12 months
Not enough data to produce graph
