Piloting a Data Publication Service for the BD2K Commons
Objective: Develop a scalable cloud-hosted data publication system for the BD2K Commons
Approach: Integrate BDDS data publication capabilities, KnowEnG data preparation capabilities, and National Data Service (NDS) infrastructure and capabilities.
Publication capabilities are delivered through a hosted service with metadata stored and indexed in the cloud and data storage provided by a specified remote storage provider made accessible via Globus. Published datasets are organized by “communities” and their member “collections”. A variety of specific policies can be set on communities or collections to manage:
•Metadata (schema, requirements)
•Access control (user and group based)
•Curation workflow
•Submission and distribution license
•Storage endpoint
•Persistent identifier provider (DOI, Handle)