Article | Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands | Spokes - a search and exploration service for conversational corpus data
Göm menyn

Title:
Spokes - a search and exploration service for conversational corpus data
Author:
Piotr Pezik: University of Lodz, Corpus & Computational Linguistics Laboratory, Poland
Download:
Full text (pdf)
Year:
2015
Conference:
Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands
Issue:
116
Article no.:
009
Pages:
99-109
No. of pages:
11
Publication type:
Abstract and Fulltext
Published:
2015-08-26
ISBN:
978-91-7685-954-4
Series:
Linköping Electronic Conference Proceedings
ISSN (print):
1650-3686
ISSN (online):
1650-3740
Publisher:
Linköping University Electronic Press, Linköpings universitet


Export in BibTex, RIS or text

Spokes is an online service for conversational corpus data search and exploration, currently developed as part of CLARIN-PL ‚Äď the Polish CLARIN infrastructure. This paper describes the data sets currently available through Spokes, the architecture of the service and the data and metadata search functionality it provides to its users. We also introduce some of the more experimental features which have been developed to facilitate more advanced research on multimodal conversational corpora.

Keywords: conversational corpora;multimedia corpus search engine;CLARIN-PL

Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands

Author:
Piotr Pezik
Title:
Spokes - a search and exploration service for conversational corpus data
References:

Boersma2002.. Paul Boersma. 2002. Praat, a system for doing phonetics by computer. Glot international, 5(9/10):341‚Äď345.


Bolinger1986. Dwight Bolinger. 1986. Intonation and its parts: Melody in spoken English. Stanford University Press.


Coleman et al.2012. John Coleman, Ladan Baghai-Ravary, John Pybus, and Sergio Grau. 2012. Audio BNC: the audio edition of the Spoken British National Corpus.


Douglas2003. Fiona M Douglas. 2003. The scottish corpus of texts and speech: Problems of corpus design. Literary and linguistic computing, 18(1):23‚Äď37.


Du Bois et al.2000. John W. Du Bois, Wallace L. Chafe, Charles Meyer, Sandra A. Thompson, Robert Englebretson, and Nii Martey. 2000. Santa Barbara corpus of spoken American English.


Evert2004. Stefan Evert. 2004. The statistics of word cooccurrences. Ph.D. thesis, PhD Dissertation, Stuttgart University.


Freitas and Santos2008. Tiago Freitas and Fabíola Santos. 2008. Corp-oral: Spontaneous speech corpus for european portuguese. In Proceedings of LREC.


Gasch2010. Joachim Gasch. 2010. Dgd 2.0: A web-based navigation platform for the visualization, presentation and retrieval of german speech corpora. Sprache und Datenverarbeitung, 34(1):27‚Äď38.


Hirschberg and Pierrehumbert1986. Julia Hirschberg and Janet Pierrehumbert. 1986. The intonational structuring of discourse. In Proceedings of the 24th annual meeting on Association for Computational Linguistics, pages 136‚Äď144. Association for Computational Linguistics.


Johannessen et al.2009. Janne Bondi Johannessen, Joel Priestley, Kristin Hagen, Tor Anders √Öfarli, and √ėystein Alexander Vangsnes. 2009. The nordic dialect corpus-an advanced research tool. In Proceedings of the 17th Nordic conference of computational linguistics NODALIDA 2009. NEALT proceedings series, volume 4, pages 73‚Äď80.


M√ľller2007. Meinard M√ľller. 2007. Dynamic time warping. Information retrieval for music and motion, pages 69‚Äď84.


Pezik2012] Piotr Pezik. 2012. Jezyk m√≥wiony w NKJP. In Adam Przepi√≥rkowski, Miroslaw B√°nko, Rafal G√≥rski, and Barbara Lewandowska-Tomaszczyk, editors, Narodowy Korpus J?ezyka Polskiego, pages 37‚Äď47. Wydawnictwo Naukowe PWN, Warszawa.


Walinski and P?ezik2007. Jacek Walinski and Piotr P?ezik. 2007. Web access interface to the PELCRA referential corpus of polish. pages 65‚Äď86. Lang.


Wells and others1997. John C Wells et al. 1997. Sampa computer readable phonetic alphabet. Handbook of standards and resources for spoken language systems, 4.


Wittenburg et al.2006. Peter Wittenburg, Hennie Brugman, Albert Russel, Alex Klassmann, and Han Sloetjes. 2006. Elan: a professional framework for multimodality research. In Proceedings of LREC, volume 2006.

Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands

Author:
Piotr Pezik
Title:
Spokes - a search and exploration service for conversational corpus data
Note: the following are taken directly from CrossRef
Citations:
No citations available at the moment


Responsible for this page: Peter Berkesand
Last updated: 2017-02-21