Article | Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden | Quote Extraction and Attribution from Norwegian Newspapers
Göm menyn

Title:
Quote Extraction and Attribution from Norwegian Newspapers
Author:
Andrew Salway: Language and Language Technology Group, Uni Research, Bergen, Norway Paul Meurer: Language and Language Technology Group, Uni Research, Bergen, Norway Knut Hofland: Language and Language Technology Group, Uni Research, Bergen, Norway √ėystein Reigem: Language and Language Technology Group, Uni Research, Bergen, Norway
Download:
Full text (pdf)
Year:
2017
Conference:
Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden
Issue:
131
Article no.:
041
Pages:
293-297
No. of pages:
5
Publication type:
Abstract and Fulltext
Published:
2017-05-08
ISBN:
978-91-7685-601-7
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Series:
NEALT Proceedings Series
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

We present ongoing work that, for the first time, seeks to extract and attribute politicians‚Äô quotations from Norwegian Bokm√•l newspapers. Our method ‚Äď using a statistical dependency parser, a few regular expressions and a look-up table ‚Äď gives modest recall (a best of .570) but very high precision (.978) and attribution accuracy (.987) for a restricted set of speaker names. We suggest that this is already sufficient to support some kinds of important social science research, but also identify ways in which performance could be improved.

Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Author:
Andrew Salway, Paul Meurer, Knut Hofland, √ėystein Reigem
Title:
Quote Extraction and Attribution from Norwegian Newspapers
References:

Gisle Andersen and Knut Hofland. 2012. Building a large corpus based on newspapers from the web. In: Gisle Andersen (ed.), Exploring Newspaper Language: Using the web to create and investigate a large corpus of modern Norwegian: 1-30. John Benjamins.


Danqi Chen and Christopher Manning. 2014. A Fast and Accurate Dependency Parser Using Neural Networks. Procs. 2014 Conference on Empirical Methods in Natural Language Processing: 740-750.


Helge Dyvik, Paul Meurer, Victoria Ros√©n, Koenraad De Smedt, Petter Haugereid, Gyri Sm√łrdal Losnegaard, Gunn Inger Lyse, and Martha Thunes. 2016. NorGramBank: A ‘Deep’ Treebank for Norwegian. Procs. 10th International Conference on Language Resources and Evaluation, LREC 2016: 3555-3562.


Justin Grimmer and Brandon M. Stewart. 2013. Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts. Political Analysis, 21(3): 267-297.


Ralf Krestel, Sabine Bergler, and René Witte. 2008. Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles. Procs. 6th International Language Resources and Evaluation Conference, LREC 2008.


Tim O’Keefe, Silvia Pareti, James R. Curran, Irena Koprinska, and Matthew Honnibal. 2012. A Sequence Labelling Approach to Quote Attribution. Procs. 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning: 790-799.


Silvia Pareti, Tim O’Keefe, Ioannis Konstas, James R. Curran, and Irena Koprinska. 2013. Automatically Detecting and Attributing Indirect Quotations. Procs. 2013 Conference on Empirical Methods in Natural Language Processing: 989-999.

Proceedings of the 21st Nordic Conference on Computational Linguistics, NoDaLiDa, 22-24 May 2017, Gothenburg, Sweden

Author:
Andrew Salway, Paul Meurer, Knut Hofland, √ėystein Reigem
Title:
Quote Extraction and Attribution from Norwegian Newspapers
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2017-02-21