Article | NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland | Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content
Author:
José Carlos Rosales Nuñez: Université Paris Sud, LIMSI, France / Université Paris Saclay, France / INRIA Paris, France Djamé Seddah: INRIA Paris, France Guillaume Wisniewski: Université Paris Sud, LIMSI, France / Université Paris Saclay, Franc
Download:
Full text (pdf)
Year:
2019
Conference:
NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland
Issue:
167
Article no.:
001
Pages:
2--14
No. of pages:
12
Publication type:
Abstract and Fulltext
Published:
2019-10-02
ISBN:
978-91-7929-995-8
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

This work compares the performances achieved by Phrase-Based Statistical Machine Translation systems (PB- SMT) and attention-based Neuronal Machine Translation systems (NMT) when translating User Generated Content (UGC), as encountered in social medias, from French to English. We show that, contrary to what could be expected, PBSMT outperforms NMT when translating non-canonical inputs. Our error analysis uncovers the speci- ficities of UGC that are problematic for sequential NMT architectures and suggests new avenue for improving NMT models.

Keywords: Machine Translation User Generated Content Neural Machine Translation PBSMT

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
José Carlos Rosales Nuñez, Djamé Seddah, Guillaume Wisniewski
Title:
Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content
References:
No references available

NEAL Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa), September 30 - October 2, Turku, Finland

Author:
José Carlos Rosales Nuñez, Djamé Seddah, Guillaume Wisniewski
Title:
Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06