Article | Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26‚Äď28 October 2016, CLARIN Common Language Resources and Technology Infrastructure | The Curation Module and Statistical Analysis on VLO Metadata Quality
Göm menyn

Title:
The Curation Module and Statistical Analysis on VLO Metadata Quality
Author:
Davor Ostojic: ACDH-OEAW, Vienna, Austria Go Sugimoto: ACDH-OEAW, Vienna, Austria Matej Ďurčo: ACDH-OEAW, Vienna, Austria
Download:
Full text (pdf)
Year:
2017
Conference:
Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26‚Äď28 October 2016, CLARIN Common Language Resources and Technology Infrastructure
Issue:
136
Article no.:
007
Pages:
90-101
No. of pages:
12
Publication type:
Abstract and Fulltext
Published:
2017-05-23
ISBN:
978-91-7685-499-0
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

The Curation Module is developed to facilitate the metadata ingestion and curation process of the Virtual Language Observatory (VLO) by providing a systematic method to measure metadata quality and a user-friendly interface to inspect profiles, records, and collections of the Component MetaData Infrastructure (CMDI) used for the VLO. A large amount of useful statistics generate a comprehensive data matrix including information about the quality score, publication status, facet coverage, and metadata header, as well as the number of records and concepts. The module helps various stakeholders to automatically and systematically identify the metadata problems. Whilst metadata modellers can evaluate the quality of shared profiles, data creators assess the validity of newly created records. Data providers can use it for the improvement of their metadata for better discoverability and accessibility of valuable linguistic contents, whereas working groups could examine the actual use of profiles and records to define the next version of CMDI and VLO. Thus, the Curation Module supports all stages of metadata management and fosters the analysis and improvement of metadata quality to enhance the CLARIN services. In this article, we present a selection of statistical information on the metadata quality made possible by the Curation Module.

Keywords: Metadata curation, Quality control, Metadata analysis and assessment, Curation module, VLO (Virtual Language Observatory), CMDI (Component Metadata Infrastructure)

Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26‚Äď28 October 2016, CLARIN Common Language Resources and Technology Infrastructure

Author:
Davor Ostojic, Go Sugimoto, Matej Ďurčo
Title:
The Curation Module and Statistical Analysis on VLO Metadata Quality
References:

[Durco 2013] M. Durco. 2013. SMC4LRT - Semantic Mapping Component for Language Resources and Technology. (masters)Technical University, Vienna, Austria. http://permalink.obvsg.at/AC11178534


[Durco and Mörth 2014] M. Durco, and K. Mörth. 2014. Towards a DH Knowledge Hub - Step 1: Vocabularies. In CLARIN Annual Conference Soesterberg, Netherlands.


[Kemps-Snijders 2014] Kemps-Snijders, M. 2014. Metadata quality assurance for CLARIN.


[King, Ostojic, Durco, and Sugimoto 2016] M. King, D. Ostojic, M. Durco, and G. Sugimoto. 2016. Variability of the Facet Values in the VLO‚Äďa Case for Metadata Curation. In Selected Papers from the CLARIN Annual Conference 2015, October 14‚Äď16, 2015, Wroclaw, Poland (pp. 25‚Äď44) Link√∂ping University Electronic Press. http://www.ep.liu.se/ecp/123/003/ecp15123003.pdf


[Odijk 2014] J. Odijk. 2014. Discovering Resources in CLARIN: Problems and Suggestions for Solutions. http://dspace.library.uu.nl/handle/1874/303788


[Trippel, Broeder, Durco, and Ohren 2014] T. Trippel, D. Broeder, M. Durco, and O. Ohren. 2014. Towards automatic quality assessment of component metadata. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (pp. 3851‚Äď3856) Reykjavik, Iceland: European Language Resources Association (ELRA). http://lrec2014.lrec-conf.org/en/

Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26‚Äď28 October 2016, CLARIN Common Language Resources and Technology Infrastructure

Author:
Davor Ostojic, Go Sugimoto, Matej Ďurčo
Title:
The Curation Module and Statistical Analysis on VLO Metadata Quality
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2017-02-21