Article | Proceedings of the NoDaLiDa 2019 Workshop on Constraint Grammar - Methods, Tools and Applications, 30 September 2019, Turku, Finland | Tagging What Isn’t There: Enriching CG Annotation With Implicit Information Linköping University Electronic Press Conference Proceedings
Göm menyn

Title:
Tagging What Isn’t There: Enriching CG Annotation With Implicit Information
Author:
Eckhard Bick: Institute of Language and Communication, University of Southern Denmark, Denmark
Download:
Full text (pdf)
Year:
2019
Conference:
Proceedings of the NoDaLiDa 2019 Workshop on Constraint Grammar - Methods, Tools and Applications, 30 September 2019, Turku, Finland
Issue:
168
Article no.:
002
Pages:
5-11
No. of pages:
7
Publication type:
Abstract and Fulltext
Published:
2019-12-03
ISBN:
978-91-7929-918-7
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

This paper examines ways to make existing Constraint Grammar (CG) annotation grammatically more explicit, allowing corpus users and application programs, such as machine translation (MT), to refer to context-implied grammatical features in a more direct fashion. Two types of categories are addressed. First, morphological categories are propagated to words that leave them under-specified (e.g. number and definiteness for Danish adjectives) or unexpressed (e.g. person-number for Danish verbs). Second, we also introduce new categories, such as aspect and future tense for Danish, that may be morphologically explicit in a given MT target language, but do not exist in the source language. In a pilot evaluation of four categories in the context of Danish-Greenlandic MT, the implemented enrichment grammar for Danish achieved F-scores of 97% for propagated categories and 85% for new categories. In addition to feature tagging, structural annotation is also made more explicit, adding secondary dependency links for e.g. the subjects of relative and infinitive clauses, or attribute links between subject complements and subjects.

Keywords: Constraint Grammar, Morphology, Feature Propagation, Machine Translation, Tense-Aspect-Mode (TAM) tagging

Proceedings of the NoDaLiDa 2019 Workshop on Constraint Grammar - Methods, Tools and Applications, 30 September 2019, Turku, Finland

Author:
Eckhard Bick
Title:
Tagging What Isn’t There: Enriching CG Annotation With Implicit Information
References:
No references available

Proceedings of the NoDaLiDa 2019 Workshop on Constraint Grammar - Methods, Tools and Applications, 30 September 2019, Turku, Finland

Author:
Eckhard Bick
Title:
Tagging What Isn’t There: Enriching CG Annotation With Implicit Information
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2019-11-06