Article | NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland | The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services
Author:
Mikko Aulamo: Department of Digital Humanities / HELDIG, University of Helsinki, Finland Jörg Tiedemann: Department of Digital Humanities / HELDIG, University of Helsinki, Finland
Download:
Full text (pdf)
Year:
2019
Conference:
NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Issue:
167
Article no.:
046
Pages:
389-394
No. of pages:
6
Publication type:
Abstract and Fulltext
Published:
2019-10-02
ISBN:
978-91-7929-995-8
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

This paper presents a flexible and powerful system for creating parallel corpora and for running neural machine translation services. Our package provides a scalable data repository backend that offers transparent data pre-processing pipelines and automatic alignment procedures that facilitate the compilation of extensive parallel data sets from a variety of sources. Moreover, we develop a web-based interface that constitutes an intuitive frontend for end-users of the platform. The whole system can easily be distributed over virtual machines and implements a sophisticated permission system with secure connections and a flexible database for storing arbitrary metadata. Furthermore, we also provide an interface for neural machine translation that can run as a service on virtual machines, which also incorporates a connection to the data repository software.

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Mikko Aulamo, Jörg Tiedemann
Title:
The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services
References:
No references available

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
Mikko Aulamo, Jörg Tiedemann
Title:
The OPUS Resource Repository: An Open Package for Creating Parallel Corpora and Machine Translation Services
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06