Article | NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland | Nefnir: A high accuracy lemmatizer for Icelandic Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
Nefnir: A high accuracy lemmatizer for Icelandic
Author:
Svanhvít Ingólfsdóttir: Department of Computer Science, Reykjavik University, Iceland Hrafn Loftsson: Department of Computer Science, Reykjavik University, Iceland Jón Daðason: The Árni Magnússon Institute for Icelandic Studies, University of Iceland, Iceland Kristín Bjarnadóttir: The Árni Magnússon Institute for Icelandic Studies, University of Iceland, Iceland
Download:
Full text (pdf)
Year:
2019
Conference:
NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Issue:
167
Article no.:
033
Pages:
310--315
No. of pages:
5
Publication type:
Abstract and Fulltext
Published:
2019-10-02
ISBN:
978-91-7929-995-8
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

Lemmatization, finding the basic morphological form of a word in a corpus, is an important step in many natural language processing tasks when working with morphologically rich languages. We describe and evaluate Nefnir, a new open source lemmatizer for Icelandic. Nefnir uses suffix substitution rules, derived from a large morphological database, to lemmatize tagged text. Evaluation shows that for correctly tagged text, Nefnir obtains an accuracy of 99.55%, and for text tagged with a PoS tagger, the accuracy obtained is 96.88%.

Keywords: lemmatization morphologically rich languages morphological database

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Svanhvít Ingólfsdóttir, Hrafn Loftsson, Jón Daðason, Kristín Bjarnadóttir
Title:
Nefnir: A high accuracy lemmatizer for Icelandic
References:
No references available

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Svanhvít Ingólfsdóttir, Hrafn Loftsson, Jón Daðason, Kristín Bjarnadóttir
Title:
Nefnir: A high accuracy lemmatizer for Icelandic
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06