Article | Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden | Málrómur: A Manually Verified Corpus of Recorded Icelandic Speech
Göm menyn

Title:
Málrómur: A Manually Verified Corpus of Recorded Icelandic Speech
Author:
Stein√ĺ√≥r Steingr√≠msson: The √Ārni Magn√ļsson Institute for Icelandic Studies, Iceland J√≥n Gu√įnason: Reykjavik University, Iceland Sigr√ļn Helgad√≥ttir: The √Ārni Magn√ļsson Institute for Icelandic Studies, Iceland Eir√≠kur R√∂gnvaldsson: Department of Icelandic, University of Iceland, Iceland
Download:
Full text (pdf)
Year:
2017
Conference:
Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden
Issue:
131
Article no.:
029
Pages:
237-240
No. of pages:
4
Publication type:
Abstract and Fulltext
Published:
2017-05-08
ISBN:
978-91-7685-601-7
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

This paper describes the M√°lr√≥mur corpus, an open, manually verified, Icelandic speech corpus. The recordings were collected in 2011‚Äď2012 by Reykjavik University and the Icelandic Center for Language Technology in cooperation with Google. 152 hours of speech were recorded from 563 participants. The recordings were subsequently manually inspected by evaluators listening to all the segments, determining whether any given segment contains the utterance the participant was supposed to read, and nothing else. Out of 127,286 recorded segments 108,568 were approved and 18,718 deemed unsatisfactory.

Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Author:
Stein√ĺ√≥r Steingr√≠msson, J√≥n Gu√įnason, Sigr√ļn Helgad√≥ttir, Eir√≠kur R√∂gnvaldsson
Title:
Málrómur: A Manually Verified Corpus of Recorded Icelandic Speech
References:

J√≥n Gu√įnason, Oddur Kjartansson, J√∂kull J√≥hannsson, El√≠n Carstensd√≥ttir, Hannes H√∂gni Vilhj√°lmsson, Hrafn Loftsson, Sigr√ļn Helgad√≥ttir, Krist√≠n M. J√≥hannsd√≥ttir, and Eir√≠kur R√∂gnvaldsson. 2012. Almannar√≥mur: An Open Icelandic Speech Corpus. In Proceedings of SLTU ’12, 3rd Workshop on Spoken Languages Technologies for Under-Resourced Languages, Cape Town, South Africa.


Sigr√ļn Helgad√≥ttir and Eir√≠kur R√∂gnvaldsson. 2013. Language Resources for Icelandic. In K. De Smedt, L. Borin, K. Lind√©n, B. Maegaard, E. R√∂gnvaldsson, and K. Vider, editors, Proceedings of the Workshop on Nordic Language Research Infrastructure at NODALIDA 2013, pages 60‚Äď76. NEALT Proceedings Series 20. Link√∂ping Electronic Conference Proceedings, Link√∂ping, Sweden.


Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro Moreno, and Mike LeBeau. 2010. Building Transcribed Speech Corpora Quickly and Cheaply for Many Languages. In Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), pages 1914‚Äď1917, Makuhari, Chiba, Japan.

Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Author:
Stein√ĺ√≥r Steingr√≠msson, J√≥n Gu√įnason, Sigr√ļn Helgad√≥ttir, Eir√≠kur R√∂gnvaldsson
Title:
Málrómur: A Manually Verified Corpus of Recorded Icelandic Speech
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2017-02-21