Workshop on Text and Speech Annotation
Linguistic Description and the Creation of Digital Language Resources
Aim of the Workshop
The workshop will address the creation and usage of digital speech and text resources. A particular focus lies on the role that digital resources play in linguistic research. The course will give a general introduction to the generation of data collections, small language corpora and on-line linguistic knowledge bases.
The target group of the workshop are linguists interested in strengthening the empirical underpinning of their typological and/or theoretical work. Graduate students as well as faculty members are welcome.
Our focus will be on collaborative speech and text annotation making use of the new facilities that modern web-technology and linguistic software offer to linguists.
The workshop will feature 4 days of course work covering practical issues relating to linguistic annotation of speech and text as well as web-editing as a means to create public linguistic knowledge bases through collaborative on-line editing.
We will offer hands-on introductory courses to TypeCraft, a multi-lingual on-line database for text annotation developed at the University for Science and Technology, Trondheim, Norway by Dorothee Beermann and Pavel Mihaylov, and Praat, a freely available signal analysis software developed by Paul Boersma and David Weenink of the University of Amsterdam, The Netherlands.
Next to an introduction to text and speech annotation we are planning several guest lectures on selected linguistic topics of particular relevance to language annotation and linguistic analysis.