Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Parallel Annotation of Speech and Text"

Line 15: Line 15:
 
<flashmp3>PSTA03.mp3</flashmp3><br>
 
<flashmp3>PSTA03.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA03.mp3]], TextGrid: [[File:PSTA03.txt]]
 
Download files for viewing in the Praat application: Sound: [[File:PSTA03.mp3]], TextGrid: [[File:PSTA03.txt]]
<br>
 
----
 
<br>
 
Sentences 4 to 7:<br>Speaker dialect: Trondheim
 
  
<Phrase>10906</Phrase>
+
[[Parallel Processing of Speech and Text Data - Part 3]]
<flashmp3>PSTA04.mp3</flashmp3><br>
+
Download files for viewing in the Praat application: Sound: [[File:PSTA04.mp3]], TextGrid: [[File:PSTA04.txt]]
+
<br>
+
<Phrase>10907</Phrase>
+
<flashmp3>PSTA05.mp3</flashmp3><br>
+
Download files for viewing in the Praat application: Sound: [[File:PSTA05.mp3]], TextGrid: [[File:PSTA05.txt]]
+
<br>
+
<Phrase>10908</Phrase>
+
<flashmp3>PSTA06.mp3</flashmp3><br>
+
Download files for viewing in the Praat application: Sound: [[File:PSTA06.mp3]], TextGrid: [[File:PSTA06.txt]]
+
<br>
+
<Phrase>10909</Phrase>
+
<flashmp3>PSTA07.mp3</flashmp3><br>
+
Download files for viewing in the Praat application: Sound: [[File:PSTA07.mp3]], TextGrid: [[File:PSTA07.txt]]
+
<br>
+
----
+
<br>
+
Sentences 8 to 10:<br>Speaker dialect: Eastern Norway
+
  
<Phrase>10910</Phrase>
 
<flashmp3>PSTA08.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA08.mp3]], TextGrid: [[File:PSTA08.txt]]
 
<br>
 
<Phrase>10911</Phrase>
 
<flashmp3>PSTA09.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA09.mp3]], TextGrid: [[File:PSTA09.txt]]
 
<br>
 
<Phrase>10912</Phrase>
 
<flashmp3>PSTA03.mp3</flashmp3><br>
 
Download files for viewing in the Praat application: Sound: [[File:PSTA10.mp3]], TextGrid: [[File:PSTA10.txt]]
 
<br>
 
----
 
<br>
 
 
'''About the TextGrid files:'''
 
'''About the TextGrid files:'''
  

Revision as of 18:46, 18 March 2010

Test sentences

(taken from the "Sound to Sense" project)

Sentences 1 to 3:
Speaker dialect: Bergen

Jeg ser bildet, kan du si, litt på skrått ned, ovenifra.
“I see the picture, say, somewhat diagonally downwards, from above.”
Jeg
e
1SG
PN
ser
se:r
seePRES
V
bildet
bilde
pictureDEFSG
N
kan
kan:
canPRES
V
du
ʉ
2SG
CL
si
si:
sayINF
V
litt
lit:
a.little
ADVm
po
onDIR
PREP
skrått
skro:t
diagonalADJ>ADV
ADVm
ned
ned
downDIR
ADVm
ovenifra
ovenifra
from.aboveDIRSRC
ADVm

Download files for viewing in the Praat application: Sound: File:PSTA01.mp3, TextGrid: File:PSTA01.txt
Det dekker omtrent hele det venstre…mest…altså, venstreste kortsiden.
“It covers approximately the whole left…most…that is, the leftest short side.”
Det
de
3SGNEUT
PN
dekker
dek:er
coverPRES
V
omtrent
umtrent
approximately
ADVm
hele
he:le
wholeDEF
ADJ
det
de
DEFSGNEUT
ART
venstre
venstre
left
ADVm
mest
mest
mostSUP
ADJ
altså
aso
that.isDM
ADVm
venstreste
venstreste
leftSUPMUDEF
ADJ
kortsiden
kortsiden
shortsideDEFSG
N

Download files for viewing in the Praat application: Sound: File:PSTA02.mp3, TextGrid: File:PSTA02.txt
Hun står med ryggen mot veggen opp og ser på han som skal kaste ballen som står utenfor og peker på boksene.
“She's standing with her back up against the wall and looking at him, who is standing outside and about to throw the ball, and pointing towards the boxes.”
Hun
hun
3SGFEM
PN
står
sto:r
standPRES
V
med
med
withMNR
PREP
ryggen
ryɡ:en
backDEFSG
N
mot
mut
againstDIR
PREP
veggen
veɡ:en
wallDEFSG
N
opp
up
upDIRMU
PREP
og
o
and
CONJC
 
 
ser
se:r
seePRES
V
po
atDIR
PREP
han
han
3SGMASC
PN
som
som
 
PNrel
skal
skal:
shallPRES
V
kaste
kaste
throwINF
V
ballen
bal:en
ballDEFSG
N
som
som
 
PNrel
står
sto:r
standPRES
V
utenfor
ʉtenfor
outside
ADVm
og
o
and
CONJC
peker
pe:ker
pointPRES
V
po
atDIR
PREP
boksene
boksene
boxDEFPL
N

Download files for viewing in the Praat application: Sound: File:PSTA03.mp3, TextGrid: File:PSTA03.txt Parallel Processing of Speech and Text Data - Part 3 About the TextGrid files: The TextGrid files are opened together with the matching sound files for viewing in the Praat application. The TextGrid files consist of three tiers, 'Word' (rendered in Bokmål orthography) 'Phoneme' (shows underlying segments) and 'Note' (shows surface realisation with IPA symbols, and other notes). Here is a list of glosses used in the 'Note' tier: Phonology/Phonetics:
BrV = Segent realised with breathy voice
CrV = Segent realised with creaky voice
DV = Underlying voiced segment realised devoiced
EPN = Epenthesis
RD = Reduction of segment (e.g. corner vowel realised as schwa or plosive as fricative).
V = Underlying non-voiced segment realised voiced
Morphophonology/Syntax
CL = Clitic
Other
ERR = The speaker errs and corrects himself
HES = (Audible) hesitation from speaker