MetaStore User Documentation
MetaStore is the catch-all data publishing platform of the MPCDF. It is meant as a place to create and publish metadata, describing datasets stored in the various storage systems at MPCDF. Beside this use case, small datasets (<1GB) can be uploaded directly into MetaStore. Digital Object Identifiers (DOI) can be added to the describing metadata.
You can attribute to each dataset a DOI via DataCite. The page of your dataset on MetaStore will be then used as a landing page for the DOI. You can also create a DOI for your dataset later in time, and not necessarily directly at the upload of the dataset. You must just be careful that, once a DOI is created, you cannot delete it anymore.
Each dataset can contain at least one or more resources. A resource can be either an URL or an uploaded file (smaller than 1GB).
Who can use it
If your institute has an account on MetaStore, you can use it with this account. Know that in doing so, you are bound to the agreement with MPCDF about data quality.
MetaStore is not meant as a platform for the individual researcher. Instead, access and permissions are granted to Max Planck institutes, groups, departments or big research projects. As MetaStore supports nested organisations, it is possible to have groups or departments below a Max Planck Institute, but the top level alway has to be a MPI. If you are interested in using MetaStore, please name one or two administrators from your institute who will be in charge to manage your data publications. You can contact us via email@example.com
Creating a dataset
There are several ways to create, update and delete datasets on MetaStore.
Supported dataset schemas
As the main use case of MetaStore is the assignment of DOIs, the default metadata set is DataCite’s metadata schema (in version 4.4). To keep the creation of metadata as easy and simple as possible, the default method of creating a metadata set in MetaStore offers the necessary key value fields as simple full text fields. In contrast, the extended metadata schema supports the full functionality of DataCite’s metadata schema, including the controled vocabulary. There are two metadata schemas allowed on MetaStore, all from DataCite.
Digital Object Identifier (DOI)
Technically spoken, a Digital Object Identifier (DOI) is a link, pointing to a landing page or directly to piece of data which is available via the World Wide Web. It is a so called persistent identifier (PID), which still should be available when the data behind it is no longer available. This property makes a DOI the perfect identifier for big datasets stored at MPCDF: when the data moves to another place, for example from Nexus S3 to the HPSS tape library, the DOI will still stay the same and references in research articles or other kinds of documents will still point to the data’s landing page, which is the record in MetaStore.