Virtual Collection Registry

12 April 2021

Virtual Collection Registry

1. Virtual Collections

Any form of research relies on reproducibility and citability. Usually a scientific publication will come with a bibliography based on persistent references. The same should apply for the documentation of the research data. This has become a topic of growing importance for researchers, as more and more research data becomes accessible online, gets re-used and cited. Furthermore, in the preparation of research data sets, the need for advanced ways to versionise, group, and share data both within and outside organisations has become paramount.

A Virtual Collection (VC) is a convenient means for referencing research data and other sources. It allows researchers to create an aggregation of various data resources connected to a certain research purpose. The VC may cover resources from various repositories or other services, such as the result of database queries, as long as they are persistent. These resources will most probably have been generated by different researchers and teams and managed by different organisations.

Since metadata is open and can be combined in an autonomous way, any researcher can build virtual collections. Ultimately, researchers, as well as other actors of the research cycle, may want to access the resources that are aggregated in such virtual collections and may require access permissions to do so.

While it is impossible to create new data-sets/corpora for every occasion and purpose, VCs will allow the researcher to arrange and re-use existing resources and collections for new purposes. VCs are independent of any particular resource repository implementation, hence the aim is to facilitate their creation and use from as many resource repositories as possible.

2. The Virtual Collection Registry (VCR)

The SSHOC Virtual Collection Registry offers a researcher-friendly way to organise research data references, for example:

The Virtual Collection Registry (VCR), currently operated by CLARIN, the European Research Infrastructure for Language Resources and Technology, enables the creation of DOI-identifiable VCs across the SSH spectrum, and beyond, as the VCR metadata model is not limited to SSH. It provides a way for registering, accessing and discovering VCs by simplifying access for researchers.

The basis for the creation of virtual collections is a joint domain of compatible metadata descriptions. Thus, virtual collections can boost the re-usability of existing resources, therefore facilitating empirically sound e-Science in the arts and humanities.

3. Use the VCR to:

4. Using VCs: Browsing and inspecting, sharing and citing via DOI, sending to switchboard for processing:

1. Browsing the VCR

1. Browsing the VCR

_2a. Inspecting the VCR_

2a. Inspecting the VCR

_2b. Inspecting the VCR_

2b. Inspecting the VCR

_3. Sharing via DOI_

3. Sharing via DOI

5. Creating VCs: How to provide context for research:

Go to https://collections.clarin.eu/public?0, log on and click on Create button.

How to create VCs

Name it in field Name and do fill in other fields, in particular Authors

How to create VCs

and References.

How to create VCs

Press Save Collection:

How to create VCs

6. Third party data catalogue integration: How to create VCs via a search in an external data catalogue

How to create VCs via a search in an external data catalogue

Save results from CLARIN Virtual Language Observatory (VLO)

How to create VCs via a search in an external data catalogue

Submit search results to Virtual Collection Registry

How to create VCs via a search in an external data catalogue

The VCR is one of the EOSC thematic services and registered at the EOSC portal.

Virtual Collections

References: