Understanding Metadata Functional Requirements in Genome Curation Work

Hong Huang, Jian Qin, hong huang

Research output: Contribution to journalArticlepeer-review

Abstract

The proliferation of genomic data and their widespread data reuse pose new challenges to effectively manage and curate genomic data. This study contributes towards better understanding of 156 genomics scientists' perception and priorities for metadata functional requirements in genome curation work. Our study was guided by previously identified twenty two metadata functional requirements (Willis, Greenberg, & White, 2012), and intended to define a context-sensitive model of groupings for metadata goals in genome curation. Analysis of the results revealed that genomics scientists recognize specific sets of metadata functional requirements in the genome-curation context. These metadata goals were reduced to six factor constructs. The rankings of these constructs in decreasing order are Portability, Reusability, Manipulability, Sufficiency, Interoperability, and Modularity. The findings indicated that genomics scientists need both domain independent and dependent metadata functional requirements that are primarily related to data comparison, integration, and reuse across platforms and databases. The constructs defined by this study advance the understanding of metadata requirements and their relationships. In addition, the resulting metadata requirement model can serve as a valuable resource to genome scientists, data curators and administrators for designing metadata schemes and developing data-curation policies.

Original languageAmerican English
JournalProceedings of the American Society for Information Science and Technology
Volume50
DOIs
StatePublished - Jan 1 2013

Keywords

  • domain
  • genome curation
  • metadata goals
  • metadata schemes

Disciplines

  • Medicine and Health Sciences

Cite this