Typecraft v2.5
Jump to: navigation, search

Difference between revisions of "Documenting Lule Sami"

 
(88 intermediate revisions by 4 users not shown)
Line 1: Line 1:
Documenting Lule Sami is a pilot study for the manual, in-depth annotation of Lule Sami text which is conducted at the [http://www.ntnu.no/hf Department for Language and Communication Studies] at the Norwegian University of Science and Technology. The project started in Mai 2008 and will end in November 2008. [[A Pilot Study in Documenting Lule Sami]] tells you more about the project itself. [[About Lule Sami]] gives a short introduction to the Lule Sami people and their language. Here you also find links to other relevant sites about the Sami and their language.
+
Documenting Lule Sami is a pilot study for the in-depth manual annotation of written Lule Sami (for more information about the project itself see [[Annotation_of_representative_texts_from_Lule_Sami_-_An_NTNU_project]]). The study was conducted by [[User:Dorothee|Dorothee Beermann]] (see also [[About TypeCraft]]) at the Norwegian University of Science and Technology [http://www.ntnu.no/]. The project started in Mai 2008 and ended in December 2008. Under [[About Lule Sami]] you find a short introduction to the Lule Sami language; there you also find references to other relevant links about the Sami and their language.
  
==ADJECTIVES==
+
Lule Sami is a morphologically rich, highly inflected and very often fusional language which makes its in-depth morpho-syntactic annotation an interesting, yet at the same time a difficult and very time-consuming task. None of the texts we have collected has been annotated before, and since Lule Sami is, with fewer than 1,500 speakers in Norway and Sweden, of which only a few speak Lule Sami as their first language, one of the highly endangered languages in Europe, it is important that efforts in documenting Lule Sami are made now.  
'''Adjectives''' are '''nouns''' in Lule Saami. It is in most cases impossible to say that an adjective is not a noun and vice versa.
+
  
What makes the adjectives to '''adjectives''' - and not pure nouns - is that they can be '''compared''' and that they are used '''attributive'''.
+
Below we discuss some of the issues that were raised during the annotation process. 
 +
[[Image:The_inner_room.jpg|thumbnail|250px|right]]
  
<Phrase>4203</Phrase>
 
Here vuora-s [vuorrasa] [-s is Adj mark/derivation] gets case inflection - just as a noun does.  (Kristin)
 
 
  
 +
---------------------------------------------------------
 +
-----------------
 +
--------------
  
We need gloss tags '''ATT''' for attributive form and '''PRED''' for predicative form of the adjectives. Some forms are equal in both forms - then perhaps, it is sufficient to mark it only with ''ADJ'' pos.  (Kristin)
+
====Annotating Lule Sami - Questions and some answers====
  
==DERIVATIONS==
 
  
===Verbal Aspectual derivations===
+
=====Categories and Functions=====
  
====INCHOATIVE====
+
'''ADJ and N'''
<Phrase>4785</Phrase>
+
'''oaddá'''-t er '''inchoative''' of '''oade'''-t.
+
  
The derivation is seen as a fortification of the consonant cluster and lengthening of the last vowel in the stem.
+
Anders Kintel writes:
  
'''Inchoative''' verbs express that the doing or the state '''is starting'''.  (Kristin)
+
'''"Vi gjør oppmerksom på at de fleste adjektiv i samisk kan også fungere som substantiv og også motsatt, derfor står det ikke alltid en markering bak ordet som tilsier at dette er et adjektiv eller et substantiv".'''
  
 +
Here a free translation:
  
===V > N===
+
''We would like to draw attention to the fact that most adjectives in Sami can function as nouns, as well as nouns can function as adjectives; therefore not always a specification'' [n. or adj.] '' is given after the word that expresses that this is an adjective or a noun.''
I think it is NOT enough to annotate V > N, V > V, etc.
+
  
We should mark every derivation with what kind of fx N derivation it is. I have written something about it under.
+
Reference:
 +
Kintel A. ''Lulesamisk-norsk''. Ajluokta /Drag, Biehtsemanon 2005. Unpublished manuscript.
  
gähttjalibmáj gæhttjat+V+TV+Der1+Der/l+V+Actio+Der2+Der/ibme+N+Sg+Ill
 
<Phrase>3066</Phrase>
 
Hva slags '''NOMZ''' - er det interessant? Kalles '''Actio''' i den samiske grammatikken.
 
  
Avledningen '''-li-''' i '''gæhttja-li-t''' er SUB-FREQ 
+
An example of how we have annotated de-adjectival and other derivational lexemes comes from one of the texts from the TypeCraft database.
  
We have '''HANDLERnomen''':
+
In the example below we annotate ''vuorra'' meaning ''old'' on the POS tier as '''N''':
+
Den som utfører handlingen - som kalles '''?''' og forkortes '''?'''.
+
 
+
og '''HANDLINGsnomen'''
+
+
- Betegner selve handlingen: '''tjállem''' - skrivning - som kalles '''?''' og forkortes '''?'''
+
 
   
 
   
- Betegner redskap, midlet til å utføre handlingen med: '''gåjvun''' - skuffel - som kalles '''?''' og forkortes '''?'''
 
  
- Gjenstand for handlingen: '''gåbtjås''' - dekke - som kalles '''?''' og forkortes '''?'''
+
<Phrase>4203</Phrase>
 +
In the case of ''vuorro'' we in fact do find derivational morphology. The -s in vuorra-s [vuorrasa] marks the noun as derived. The -s is followed by case inflection. Clearly, the function of ''vuorra'' is that of a noun, and accordingly it has been inflected for case.
  
- Resultatet (produktet) av handlingen: '''tjála''' - skriv - som heter '''?''' og forkortes '''?'''
+
In general we will assign POS categories according to the function that a word has in a given context. On the glossing tier we will in addition indicate the word's derivation. For example in cases where the suffix -s reflects nominalization, we will provide the ADJ-> N tag in the gloss line.
 +
  
- Vær- og føreforhold, eller stedet hvor handlingen skjer: '''jådådahka''' - godt føre ; '''tjuoladahka''' - sted hvor en har hugget ved - som heter '''?''' og forkortes '''?'''  (Kristin)
+
Notice that we use the gloss tags '''ATT''' for attributive forms and '''PRED''' for predicative forms of the adjective. Some forms are equal morphologically, in this case we will only use the POS tier to indicate the adjective status.
  
  
===V > Adj===
+
'''V > N'''
<Phrase>4763</Phrase>
+
(Kristin): I think it is NOT enough to annotate V > N, V > V, etc. We should mark every derivation with what kind of N derivation it is. I have written something about it below:
'''oahpásmuvvat''': '''oahpás-''' is an '''ADJ''' ('oahpás-' in compounds, 'oahpes' (ATT) otherwise)
+
  
It should be possible to note somewhere that the verb(s) is derived from an ADJ.  
+
In the phrase below the word for ''temptation'' is derived from the verb 'watch/look'', which in LS is ''gähttjalibmáj''. When we decompose this word we get: '''gæhttjat+V+TV+Der1+Der/l+V+Actio+Der2+Der/ibme+N+Sg+Ill'''
  
There are words thats start out as verbs > is derived into nouns > is again derived into another verb...
+
Consider the following phrase:
  
Maybe there should be one more level for derivations only?
+
<Phrase>3066</Phrase>
The translation is of no help: while '''oahpás-''' is an ADJ in ATT form, '''known''' is a V in PERF.PART form. Translation gives us only sketchy semantics.  (Kristin)
+
In descriptive Sami grammars the nominalizer ''li'' is called '''Actio'''. The nominalizer seems to be internally complex:
 +
'''-li-''' i '''gæhttja-li-t''' = subjunctive-FREQ
  
 +
(Dorothee) At this point it is not clear which subtypes of nominalizing suffixes we should distinguish. Should we for example introduce NMLZ.actio and other subtypes of nominalizer. How useful would that be?
 +
Perhaps we should wait until we have a clearer overview over which categories are needed, and try to use the tags of the type V>N etc. for the time being.
  
==GERUND==
 
  
===Gerund I===
+
'''V > Adj but how about ADJ->V'''
Is expressing: while..., at the same time as... something happens at the same time as the doing the predicate verb is expressing.
+
  
sån oaddá-j bårå-'''dijn''' = he fell asleep while eating
+
In the phrase below we need the tag ADJ->V
 
+
'''-dijn''' - used after the last vowel of the week stem of a pair-syllabic verb
+
 
+
Ex: jåhte-t -> jåde'''-dijn''' (= while moving)
+
 
+
- used after the last vowel of the stem of a contracted verb
+
 
+
Ex: tjieggi-t -> tjieggi'''-dijn''' (= while travelling)
+
 
+
...(a)'''-ttjin''' - used after the last vowel of an unpair-syllabic verb (the last stem vowel changes to '-a')
+
 
+
Ex: tjåhkani-t -> tjåhkana'''-ttjin''' (= while assembling)  (Krsitin)
+
 
+
 
+
===Gerund II===
+
Is expressing: someone is doing something or a doing is going on, keeps on, or somethin has started but is not finished. Is used with AUX '''liehke-t''' (= to be). Compare: 'He is reading'.
+
 
+
sån '''la''' låhkå'''-min''' = he is reading
+
 
+
'''-min''' - used after the last vowel of the strong stem of a pair-syllabic verb
+
 
+
Ex: sån '''la''' goarro'''-min''' (= she is sewing)
+
 
+
- used after the last vowel of a contract verb:
+
 
+
Ex: sån '''la''' guolli'''-min''' (= she is fishing)
+
 
+
''' -me''' - used after the last vowel of an unpair-syllabic verb
+
 
+
Ex: sån '''la''' malesti'''-me''' (= he is cooking)  (all examples from Spiik) (Kristin)
+
 
+
 
+
==NEGATIONAL V== 
+
Og så må vi ha en POS for Negasjons-verbet: '''VNEG'''?  (Kristin)
+
 
+
 
+
==NORVAGISMS==
+
 
<Phrase>4763</Phrase>
 
<Phrase>4763</Phrase>
'''oahpásmuvvat doarromuseajn''' - oahpásmuvvat takes '''ILL''', but here is used '''COMIT'''.
+
'''oahpásmuvvat''': '''oahpás-''' is an '''ADJ''' ('oahpás-' in compounds, 'oahpes' (ATT) otherwise)
+
- '''muvva-t''' (with '''ILL''': to become what the word says: get to know (people and concrete things), get customed to, get experience with, get familiar with. 
+
  
(- '''tuvva-t''' (with '''COMIT'''): to become what the word says: learn to know)(Kristin)
+
(Kristin): It should be possible to note somewhere that verb(s) can be derived from adjectives. Maybe there should be one more level for derivations only? Also, the translation is of no help: while '''oahpás-''' is an ADJ in ATT form, '''known''' is a V in PERF.PART form. Translation gives us only sketchy semantics.   
  
 +
(Dorothee): '''UPS''' yes, we need the tag ADJ->V
  
==NOTES==
 
I remember there was a place where it was possible to write comments on the grammar in the sentence analyzed. There should be such a place. (Kristin)
 
  
 +
'''PRON.POSS vs. PRON'''
  
==NOUNS==
 
'''DEM''' are nouns,
 
'''QUANT''' are also nouns,
 
'''NUMB''' are nouns too,
 
 
They can all be a free noun. (Krsitin)
 
 
 
==PRON.POSS vs. PRON==
 
 
<Phrase>3039</Phrase>
 
<Phrase>3039</Phrase>
I would like to use only PRON when it is used Attributive, but PRON.POSS when it is used Predicative.  
+
Above is a nominal construction where the possessive pronoun follows the noun.
What do you think about that? (Kristin)
+
Possessive pronouns may also precede the noun.
  
 +
Question:
 +
Are both syntactic pattern in free distribution?
 +
Is one of the two constructions preferred? So do we find one of the constructions more often in our texts?
  
==VERBAL FINITE FORMS==
+
Why really should we use only PRON when the possessive is used attributive, but PRON.POSS when it is used as modifier?.
  
===Imperative===
+
=====VERBAL FORMS=====
'''IMP.1''' brukes som en direkte ordre.
+
======more verbal tags...======
'''IMP.2''' brukes vanligvis om sterkt ønske og forslag, særlig i 1P og 3P (omtrent som konjunktiv i romanske språk)
+
While annotating verb forms in Lule Sami we noticed that TypeCraft did not provide all the tags we
(inf: Arnhild) (Kristin)
+
needed. In the following we exemplify some of the verb forms, and discuss the right use of tags.
  
'''* comment 1    IMP2'''
+
Please see the updated list of gloss tags [[Special:TypeCraft/GlossTags/]] ([[User:Dorothee|Dorothee]] 15:51, 15 December 2008 (CET))
Also Conjunctive is used for IMP2. This may be a more correct term. (Kristin)
+
  
==VERBAL INFINITE FORMS==
+
'''GERUND'''
 +
we need two tags for two distinct gerunds:
  
===Supinum===
+
'''Gerund I'''
  
====''ANNOTATION''====
+
expresses: 'while'..., 'at the same time as'... Gerund I expresses the partial overlap of two events.
<Phrase>3575</Phrase>
+
====-tji====
+
How should one annotate the suffix '''-tji''' in the above sentence.
+
Kristin suggests to use 'supinum'. I am not so sure that is is right. As far as I know the supinum
+
is one of the infinite forms if LS next to the infinitive, the gerund, the participle forms and others.
+
But is '''-tji''' the marker of an infinite form?  (Dorothee)
+
=====* comment 1  -tji-t or -tjit=====
+
It is possible to look at '''-t''' as the infinitive marker and 'supinum' is the '''-tji-''' pluss the infinitive marker.
+
  
May be it is better to say that 'supinum' is '''tjit''' and an infinite marker.  (Kristin)
+
sån oaddá-j bårå-'''dijn'''       = he fell asleep while eating
  
====''USE''====
+
jåhte-t -> jåde'''-dijn'''       = while moving
Ex: "Dån la má smidá '''váttsá-tjit'''!" - "You are clever '''at walking'''!"  (Arnhild/Kristin)
+
====''? ? ?''====
+
Is it possible to say that '''supinum'' a derivation? It is mentioned among the '''ordavledninger'''.  (Kristin)
+
  
==VERBAL INFLECTIONS==
+
tjieggi-t -> tjieggi'''-dijn'''  = while traveling
  
===PRES vs. PAST===
+
tjåhkani-t -> tjåhkana'''-ttjin'''= while assembling
  
With '''Toolbox''' it was possible to put in that when a particular infl as '''-v 1SG''' connect to a''' weak stem''', we have '''PRES''', while when the infl '''-v 1SG '''connect with a '''strong stem''', we have '''PAST'''.
+
Note:
 +
'''-dijn''' is used after the last vowel of the week stem of a pair-syllabic verb or after the last vowel of the stem of a contracted verb.
 +
'''-ttjin'''is used after the last vowel of an unpair-syllabic verb (the last stem vowel changes to '-a').
  
'''How''' do we annotate this '''in TypeCraft?'''  
+
'''Gerund II'''
  
(In school they resolve the inflectional problem by adding '''the last vowel of the stem''' to the '''infl'''.)
+
expresses: someone is doing something, or something is going on, or something has started but is not finished.
 +
The Gerund II is build through the use of the auxiliary '''liehke-t''' (to be).  
  
I think we should annotate like this: 
+
sån '''la''' låhkå'''-min''' = he is reading
<Phrase>5745</Phrase>
+
We just say there are two different inflections '''-v''': one for '''PRES''', the other for '''PAST'''.
+
  
There are similar problems for other inflections too, as for the '''PAST''' inflection '''-j'''. For '''3SG''' the '''PAST''' inflection is not followed by any inflection for '''3SG''': '''-Ø'''.
+
'''-min''' - used after the last vowel of the strong stem of a pair-syllabic verb
  
For this case there is an allomorph for '''-j''' that is '''PAST.3SG'''.    (Kristin)
+
Ex: sån '''la''' goarro'''-min''' (= she is sewing)
  
=Translation=
+
- used after the last vowel of a contract verb:
  
==PN PLACES==
+
Ex: sån '''la''' guolli'''-min''' (= she is fishing)
In English (as in other languages too - but not so much in Norwegian..) it is quite normal to give places other names than the name they use in the language of that country.
+
  
Ex: '''München > Munich''' (eng); '''Monaco''' (it) - '''Firenze > Florence''' (eng) - '''København > Copenhagen''' (eng) - '''Köln > Cologne''' - etc. etc. etc.
+
''' -me''' - used after the last vowel of an unpair-syllabic verb
  
In English are used the same names as we use in Norwegian, so there is no reason not to use those names.
+
Ex: sån '''la''' malesti'''-me''' (= he is cooking)  (all examples from Spiik) (Kristin)
  
A compromise is also possible: to use the official names: 
+
'''Imperative'''
  
'''Ájluokta-Drag''';  '''Gásluokta-Kjøpsvik''';  '''Guovdageaidnu-Kautokeino''';  '''Divttasvuodna-Tysfjord''' and so on.
+
Also here we need two distinct tags to distinguish between
  
The reason for double names is that Saami and Norwegian names are equal.
+
'''IMP.1'''  which expresses a direct order.
  
The Saami name is to use when writing in Saami, the Norwegian one when writing in Norwegian.
+
'''IMP.2'''  which expresses a strong wish or suggestion
  
But the places have double names officially.  (Kristin)
 
  
==SÁMI - SAMI - SAAMI==
+
'''INCHOATIVE'''
  
I just talked with the employee at the museum at Arran who has the responsibility for the exibitions, Anne Kalstad Mikkelsen. She has checked '''the spelling''' of '''sáme''' with the Norvegian Sami Parliament. The Sami Parliament has decided that '''sáme''' is to be written '''Sami''' in English. So the museum has to follow this norm.
+
In the gloss tier we need a tag for inchoative verbs. Here an example:
 +
<Phrase>4785</Phrase>
 +
'''oaddá'''-t er '''inchoative''' of '''oade'''-t.
  
So - we should then follow the same norm! (Shouldn't we?)  (Kristin)
+
Done. Look at [[Special:TypeCraft/GlossTags/]] ([[User:Dorothee|Dorothee]] 14:20, 15 December 2008 (CET))
  
=Presentational page=
+
Phonologically inchoatives are marked by a fortification of the consonant cluster and lengthening of the last vowel in the stem.
  
'''bajedihtte'''
+
'''NEGATIVE VERBS'''
  
'''åjvijdihtte'''
+
The tag '''Vneg''' in the POS tier is needed. Please see the list of gloss tags [[Special:TypeCraft/GlossTags/]]
  
'''ja bigodihtte'''
+
-------------------------------------------------
  
'''slehpájdihtte'''
+
=====Supinum=====
  
 +
<Phrase>3575</Phrase>
  
This poem is nice! And the grammar too! Now I leave Trondheim... '''See you on TypeCraft!...''' (Kristin - Oct. 20 18:00]
+
How should one annotate the suffix '''-tji''' in the above sentence.
<Phrase>5749</Phrase>
+
[[User:Kristin]] had suggested to use 'supinum'. The supinum is one of the infinite forms of LS next to the infinitive, the gerund, the participle and possibly others. In the examples above the suffix '''-tji''' was first annotated as supinum and the the -t as an infinitive marker, which did not
==ÁJLUOVTA SKÅVLÅ...==
+
make to much sense. [[User:Kristin]]  then suggested that 'supinum' is '''tjit''' and as such an infinite marker. At present we have an annotation as shown in the example above. 
Here is the printing-friendly page in the Lokalavisa NordSalten:
+
  
http://www.nord-salten.no/nyheter/samisk/ajluovta_skavla_buosjes_oahppe_vas_svierigis_vadtsin#tips
+
The examples below illustrates the supinum
  
It is possible to put it somewhere. (Kristin)
+
Ex: "Dån la má smidá '''váttsá-tjit'''!" - "You are clever '''at walking'''!" 
  
 +
It seems that most instances of infinitives so far occur after modal verbs, yet here seems to be a different case:
  
==ILLUSTRATIONS==
+
<Phrase>3663</Phrase>
Here is a picture from the museum at Árran, from the exibition '''Viessom''': A couple in bout, fishing:
+
How should one really annnotate that one? ([[User:Dorothee|Dorothee]] 16:35, 15 December 2008 (CET))
[[http://arran.custompublish.com/getfile.php/684547.927.fybqeudtpc/1.jpg]] (Kristin)
+
  
==SAAMI PEOPLE==
 
  
===Language/Dialect - Dorothee===
 
Concerning the following sentence and the paragraph that contains it in the main text:
 
  
'''"This long history and the fact that they are usually not mutually intelligible makes them different languages, not different dialects as they are often mistakenly described." '''  
+
'''Derivational or inflectional ??'''
  
It seems that the referent of the demonstrative in the sentence above should be 'Saami languages', but that did not come out well I think. Yet. it would be nice to get an answer to the question, why LS is called a language rather than a dialect of Saami!
+
We would like to mention that the '''supinum''' is characterised as a  derivational suffix in descriptive grammars of LS. It is mentioned among the '''ordavledninger''', the Norwegian word for word derivation.
  
Another thing is that I guess nobody would really claim that Estonian, Finish and Saami are the same language. So one probably does not have to assert that!  (Dorothee)
 
  
====comment 1 - Kristin====
+
----------------------------------------------------
There are dialects that are more different from each other than the Saami languages.
+
'''Strong and weak verb stems'''
There are languages that are 'identical' - just with minor differences.
+
  
I (when I talk about it!) explain Saami to have three main language groups: Eastern Saami Languages group, Central Saami Languages group, and Southern Saami Languages group. Sea Saami, Inari Saami, North Saami, Lule Saami and Pite Saami belongs to the Central Saami Languages group. The Central Saami Languages group have quantity change in common. There are 10 different tounges of Saami - and each of them can be divided in dialectal groups (see Sammallahti). Differrent language groups is one factor that counts for conidering South Saami and East (Skolt) Saami as different 'languages' than the languages belonging in the Central Saami Languages group.
+
Verbs in LS can either have a weak or a strong stem, so for example the verb ''wash''  which has two stem forms
  
These Saami languages are written with different orthography: Latin and cyrillic - and several different solutions at least for the latin alphabets. So in Norway East Saami, North Saami, Lule Saami and South Saami are all written with different alphabet (Pite Saami and Ume Saami I do not know about in modern written form). This situation makes communication between the different Saami groups  difficult. This is a second factor that makes these languages considered 'languages' and not dialects.
+
'''''basá'' and '' bassi'''''
  
Neighbouring Saami dialects are not so different that it is a hindrance for mutual understanding, but the dialects are gradually decreasing in mutual intelligibility. Samis have 80% of their vocabulary in common, but the semantics of the words can differ substantially; then comes differences in syntax, morphology and phonology. Samis talk their own language inside their own group, outside the group the majority language is usually used. Lack of mutual intelligibility is a third factor. (Kristin)
+
the 1P, present tense is expressed as '''basá-v''' while the 1P past tense is '''bassi-v'''.
  
===Older than... - Dorothee===
+
We will use the tag '''WEAK''' and '''STRONG''' to distinguish these two different types of verb stems.
  
It also seems to me that the next statement does not really follow. But if in fact it is true that Saami is an older language than the Germanic and the Romance language then there should be a reference to a known scholar claiming that!!
+
====Grammatical Changes====
 +
LS is changing...
  
BTW, is it true that the Romance language family is half as old as the Germanic, does not quite sound right? We definitely need references if we want to leave this sentence in the text. I would simply opt for omitting it. It is not central to our concern.  (Dorothee)
+
<Phrase>4763</Phrase>
 +
'''oahpásmuvvat doarromuseajn'''
  
====comment 1 - Kristin====
+
According to grammars of LS  - oahpásmuvvat takes '''ILL''', but in the example sentence above it is used with a '''COMIT'''
I do not know of any knowledge that says Saami is 'older' than other languages. This way of thinking about languages is absurd! - Languages change all the time, and at one point they have been through so many quantitative changes (many small changes) that it experiences a change in quality. In Norwegian there were quantitative changes after the Great Plauge which resulted in a quantitative change around 1550: New Norwegian was a fact (the name in opposition to Old Norwegian - or Norse).  
+
case. This leads to a change in meaning:
 +
 +
- '''muvva-t''': used with '''ILL''': the meaning is: get to know (people and concrete things), get accustomed to, get experience with, get familiar with.
  
If we go thousand years back, the Saami languages (or dialect groups) were more near (similiar) than today. I suppose Pekka Sammallahti is one that knows a lot about this. In his book "The Saami Languages" he writes also about diacrony.
+
(- '''tuvva-t''' used with '''COMIT''' the meaning is: learn to know)(Kristin)
  
If we compare today's Saami languages with Norwegian and the languages N. is related too, the older all-Sami language (do not remember what it is called just now) may be compared with Germanic. I can look up when mayor changes have taken place in the history of the Saami languages, - but this will still not 'prove' anything in a discussion of 'how old' Saami languages are. It tells only of when linguists will consider a change to be a change of quality.
+
====Translation of place names====
  
So may be we quit talking about 'old' languages... (Kristin)
+
In English (as in other languages too - but not so much in Norwegian..) it is quite normal to translate proper names, e.g. München > Munich, Firenze > Florence, København > Copenhagen,  etc.
  
===Since praehistoric times... - Kristin===
+
Lule Sami place names have been translated into Norwegian, such as:
'''"Saami has been spoken on the Scandinavian and Kola Peninsulas in Northwest Europe since prehistoric times."''' This is not a correct statement. So it has to go. (Krsitin)
+
  
===Linguists currently recognize 9 living Saami languages - Kristin===
+
'''Ájluokta-Drag''';  '''Gásluokta-Kjøpsvik''';  '''Guovdageaidnu-Kautokeino''';  '''Divttasvuodna-Tysfjord''', etc.
  
"Note that the Sea Saami no longer have an independent language, but have adopted North Saami, Lule Saami or Norwegian."
+
In Norway place names have officially a Sami and a Norwegian name, and the Sami name is used when writing in Sami, while the Norwegian one is used when writing in Norwegian.
  
"no longer have an independent language"? Well, Sea Saami have never "had" an independent language. What does this mean?
+
As for free translation into English this could mean that we either use the Sami name, since we translate from Sami, or that
 +
we use the Norwegian name, since the Norwegian name is also internationally better known.
  
Sea Saami is an independent language which today is talked by old people in Kvænangen and Varanger.
+
'''Which one should it be?'''
  
Sea Saami has not a own written language, but this lacs also Pite Saami and Ume Saami.  (Kristin)
+
====SÁMI - SAMI - SAAMI====
  
=Making TypeCraft better=
+
(Kristin): I just talked with the employee at the museum at Arran who has the responsibility for the exibitions, Anne Kalstad Mikkelsen. She has checked '''the spelling''' of '''sáme''' with the Norvegian Sami Parliament. The Sami Parliament has decided that '''sáme''' is to be written '''Sami''' in English. So the museum has to follow this norm.
  
 +
So - we should then follow the same norm! (Shouldn't we?) 
  
==EDITING...==
+
(Dorothee): Definitely !
===Save===
+
There should be an SAVE icon to push also at the top of the editing window :) [I do not like to have to scroll down to the bottom every time I want to save :( ]. (Kristin)
+
 
+
 
+
==SEARCH FOR...==
+
===Search for Gloss and POS===
+
 
+
It should be possible to search for glosses and poses too.
+
 
+
Fx it be possible to search for, '''PRES''', '''PAST''', '''IMP''', etc. and get to see the different '''paradigms''' and the '''words''' (from the text) in '''citation''' form that belonged into the particular paradigms. (Then there should be possible to put one of the words into the paradigm ('''show this paradigm for ...(word)''' - and the paradigm would show the word for all the paradigm instances that is used in the text- without muliplying when it is used more than once.)
+
 
+
It should also be possible to search for '''spatial''' words or '''temporal''' words and then get all Nspt and ADVspt - or all Ntmp and ADVtmp - in separated lists - the Ns together with the word they are tied to. All alphabetically with the '''citation''' form
+
 
+
Then it should be possible to search for '''VI''' and '''VT''' and the '''citation''' forms should pop up alphabetically.(Kristin)
+
 
+
 
+
==<SENTENCE>==
+
<Phrase>4785</Phrase>
+
Here the '''<:CASE>''' coming ''before'' the '''ILL-SG''' of which it is an explanation.
+
 
+
Better is:    '''ILL.SG <:CASE>''' (Kristin)
+
 
+
 
+
==ALFABETIC PRESENTATION of texts==
+
 
+
Lule Sami '''Árranis''': Viessom: Sáme vanntsabiggárin [museum exibition]>10
+
Lule Sami '''Árran''' - julevsáme guovdásj [homepage] (19 sentences; 0 annotated)
+
 
+
This is not the right way to do it! It has to be like this:
+
'''a'''
+
'''ab'''
+
'''ac'''
+
'''aba'''
+
'''abb'''
+
'''abc'''
+
'''ac'''
+
'''aca''' etc etc
+
 
+
It has to be the opposite way:
+
 
+
Lule Sami '''Árran''' - julevsáme guovdásj [homepage] (19 sentences; 0 annotated)
+
...
+
...
+
Lule Sami '''Árranis''': Viessom: Sáme vanntsabiggárin [museum exibition]>10
+
 
+
This has to be corrected. (Kristin)
+
 
+
=Previous Discussion - Dorothee=
+
 
+
Hi Kristin and Svenn Egil,
+
 
+
This will be our project page!!! (click back to ''article'')
+
 
+
It's purpose is ofc to report on the annotation of Lule Saami texts that we do, but written up in a way that it is interesting to all ppl interested in languages or Lule Saami or both.
+
 
+
NOT JUST LINGUISTS
+
 
+
We should make sure that this becomes a page that has information that cannot be found on the Wikipedia or any other well known web information source.
+
 
+
Moreover, it should be interesting and beautiful information - so not too much text; instead some text made attractive by pictures, music files, cool links, and also with a internal link to your personal
+
page in this TC wiki so that ppl can see who the ppl are that annotate Lule Saami... and so forth.
+
 
+
Here now some links and text snippets that I found interesting in this connection:
+
 
+
http://boreale.konto.itv.se/laante.htm
+
http://boreale.konto.itv.se/samieng1.htm
+
 
+
----------------------------------
+
Bluegreen: LuleSami.
+
Mountain and ForestSami culture in Norway and Sweden, many
+
famous handicrafters and chanters from this group, language is
+
locally quite strong.
+
Separate educational institutions with several textbooks in the
+
language, but not for all subjects teached.
+
A small number of books are published each year in this language.
+
Traditional and present day cultural center: Jokkmokk, Sweden.
+
 
+
--------------------------------
+
Webmasters note:
+
 
+
As a curiosity I'd like to mention that there's one Sami word that has made it into several of the major languages of this world, that word is Tundra -doesn't it speak volumes about which part of the world this is. :)
+
+
-----------------------------------------
+
When Nils-Aslak Valkeapää in his book The Sun—my Father (1991), chooses to create a metaphorical poem of a migrating herd of reindeer and uses [in his poem] some of the wealth of names that exist in Sami to describe the reindeer’s appearance, age and sex, he does so not only to demonstrate the wealth of terminology within the Sami language—he does something beyond that: He plays with the language, conjuring up concepts that have never been used before in that fashion. He conceives, in a sense, new fictional animals by combining familiar words in new ways. And he creates different reindeer which, in terms of their being a part of the herd or outside of it, can easily be viewed as parallels to the artist and his or her position in society, as well as to all human beings in their common experiences of being part of a "flock" or alone.
+
 
+
To this wealth of words can be added a great number of Sami onomatopoetical expressions for sounds pertaining to migration, words for working the herd, for the baying of dogs,and the sounds of a thousand hoofs on frozen ground, for undulating moors over which reindeer horns move, for the sound of bells that, like a blanket of clouds, lift the sky up and give the basis for life in these northern regions. And, as if that isn’t enough, there are allusions to the Sami national anthem, and tracks left behind by the herd, both concrete tracks where it has walked and abstract tracks for us, the readers, to follow back into history. Whether we journey with the herd or only pass by it as we wander, it is impossible for us to survive into the future without the tracks, without nature: The River of Life, the daughter of spring, sap, the mosquito maidens and "the sun/red and warm/moved happiness/ into the morning." Because "nothing remains of us/but a yoik in the singing wind/a dream about being." But even so: "and time does not exist, no end, none/and time is, eternal, always, is," and we are all part of "the life’s circle/infinite/without/beginning/or end/fulfills/changes/colors"…"the horizon’s red dawn/ the starry peaks."
+
 
+
taken from: http://weberstudies.weber.edu/archive/archive%20C%20Vol.%2016.2-18.1/Vol.%2016.2/haraldgaski.html
+
------------------------------------------
+
 
+
At the beginning it might be confusing to edit this wiki, but I can tell you that one learns it rather fast :=)
+
I used the following link to get the information I needed:
+
 
+
http://meta.wikimedia.org/wiki/Help:Editing
+
 
+
===comment 1 - Kristin===
+
We cannot use information that tells us that Lule Sami culture is a mountain and Forest Sami Culture in Norway! This concerns only Samis on the Swedish side of the border.
+
+
On the Norwegian side of the border we find a Costal Sami culture - that differs from Costal Sami Culture further north!
+
 
+
I can write shortly about this.
+

Latest revision as of 21:57, 7 October 2015

Documenting Lule Sami is a pilot study for the in-depth manual annotation of written Lule Sami (for more information about the project itself see Annotation_of_representative_texts_from_Lule_Sami_-_An_NTNU_project). The study was conducted by Dorothee Beermann (see also About TypeCraft) at the Norwegian University of Science and Technology [1]. The project started in Mai 2008 and ended in December 2008. Under About Lule Sami you find a short introduction to the Lule Sami language; there you also find references to other relevant links about the Sami and their language.

Lule Sami is a morphologically rich, highly inflected and very often fusional language which makes its in-depth morpho-syntactic annotation an interesting, yet at the same time a difficult and very time-consuming task. None of the texts we have collected has been annotated before, and since Lule Sami is, with fewer than 1,500 speakers in Norway and Sweden, of which only a few speak Lule Sami as their first language, one of the highly endangered languages in Europe, it is important that efforts in documenting Lule Sami are made now.

Below we discuss some of the issues that were raised during the annotation process.

The inner room.jpg





Annotating Lule Sami - Questions and some answers

Categories and Functions

ADJ and N

Anders Kintel writes:

"Vi gjør oppmerksom på at de fleste adjektiv i samisk kan også fungere som substantiv og også motsatt, derfor står det ikke alltid en markering bak ordet som tilsier at dette er et adjektiv eller et substantiv".

Here a free translation:

We would like to draw attention to the fact that most adjectives in Sami can function as nouns, as well as nouns can function as adjectives; therefore not always a specification [n. or adj.] is given after the word that expresses that this is an adjective or a noun.

Reference: Kintel A. Lulesamisk-norsk. Ajluokta /Drag, Biehtsemanon 2005. Unpublished manuscript.


An example of how we have annotated de-adjectival and other derivational lexemes comes from one of the texts from the TypeCraft database.

In the example below we annotate vuorra meaning old on the POS tier as N:


Várrá vuolgget, guollit ja vuorrasij siegen tjåhkkåhit ja gulldalit gå subtsasti…
“Walk the mountains, go fishing, and sit with the elders listening to their stories... ”
Várrá
várrá
mountainNOMSG
N
vuolgget
vuolgget
goINF
Vitr
guollit
guollit
fishN>VINF
Vitr
ja
ja
and
CONJC
vuorrasij
vuorrasij
old GENPL
N
siegen
siegen
withINESSSG
Nspat
tjåhkkåhit
tjåhkkåhit
sitDURINF
Vitr
ja
ja
and
CONJC
gulldalit
gulldalit
listenDURINF
 
when
COMP
subtsasti…
subtsasti…
taleN>V3PLPRES
Vitr

In the case of vuorro we in fact do find derivational morphology. The -s in vuorra-s [vuorrasa] marks the noun as derived. The -s is followed by case inflection. Clearly, the function of vuorra is that of a noun, and accordingly it has been inflected for case.

In general we will assign POS categories according to the function that a word has in a given context. On the glossing tier we will in addition indicate the word's derivation. For example in cases where the suffix -s reflects nominalization, we will provide the ADJ-> N tag in the gloss line.


Notice that we use the gloss tags ATT for attributive forms and PRED for predicative forms of the adjective. Some forms are equal morphologically, in this case we will only use the POS tier to indicate the adjective status.


V > N (Kristin): I think it is NOT enough to annotate V > N, V > V, etc. We should mark every derivation with what kind of N derivation it is. I have written something about it below:

In the phrase below the word for temptation is derived from the verb 'watch/look, which in LS is gähttjalibmáj. When we decompose this word we get: gæhttjat+V+TV+Der1+Der/l+V+Actio+Der2+Der/ibme+N+Sg+Ill

Consider the following phrase:

Ja ale mijáv gæhttjalibmáj lájddi, ájnat várjjala mijáv bahás.
“And lead us not into temptation, but deliver us from evil: ”
Ja
ja
and
CONJC
ale
ale
notIMP2SG
 
mijáv
mijávv
us1PLACC
PN
gæhttjalibmáj
gæhttjalibmáj
watch/lookDIMFREQV>NILLSG
N
lájddi
lájddi
leadIMP2SG
Vtr
ájnat
ájnat
but
CONJS
várjjala
várjjala
deliverIMP2SG
 
mijáv
mijáv
us1PLACC
PN
bahás
bahás
evilELATSG
N

In descriptive Sami grammars the nominalizer li is called Actio. The nominalizer seems to be internally complex: -li- i gæhttja-li-t = subjunctive-FREQ

(Dorothee) At this point it is not clear which subtypes of nominalizing suffixes we should distinguish. Should we for example introduce NMLZ.actio and other subtypes of nominalizer. How useful would that be? Perhaps we should wait until we have a clearer overview over which categories are needed, and try to use the tags of the type V>N etc. for the time being.


V > Adj but how about ADJ->V

In the phrase below we need the tag ADJ->V

Dá bale bessin oahppe vehi oahpásmuvvat doarromuseajn Narvijkan ja sáme ásadusáj Jåhkåmåhken åvdås vádtsájin.
“ ”
theseNOMPL
DEM
bale
bale
timeGENSG
N
bessin
bessin
be_allowed3PLPAST
Vitr
oahppe
oahppe
pupilNOMPL
N
vehi
vehi
little
ADVm
oahpásmuvvat
oahpásmuvvat
  get_to_beINF
Vtr
doarromuseajn
doarromuseajn
warNOMSGmuseumwithCOMITSG
N
Narvijkan
Narvijkan
NarvikatINESSSG
Np
ja
ja
and
CONJC
sáme
sáme
SaamiGENSG
N
ásadusáj
ásadusáj
arrangemetmedCOMITPL
N
Jåhkåmåhken
Jåhkåmåhken
river_bendatINESSSG
Np
åvdås
åvdås
before
 
vádtsájin
vádtsájin
leave3PLPAST
Vitr

oahpásmuvvat: oahpás- is an ADJ ('oahpás-' in compounds, 'oahpes' (ATT) otherwise)

(Kristin): It should be possible to note somewhere that verb(s) can be derived from adjectives. Maybe there should be one more level for derivations only? Also, the translation is of no help: while oahpás- is an ADJ in ATT form, known is a V in PERF.PART form. Translation gives us only sketchy semantics.

(Dorothee): UPS yes, we need the tag ADJ->V


PRON.POSS vs. PRON

Áhttje mijá guhti le almen.
“Our Father which art in Heaven,”
Áhttje
áhttje
fatherNOMSG
N
mijá
mij
ourGENPL
PN
guhti
guhti
whoNOMSG
PROint
le
le
is3SGPRES
COP
almen
almen
heaveninINESSSG
N

Above is a nominal construction where the possessive pronoun follows the noun. Possessive pronouns may also precede the noun.

Question: Are both syntactic pattern in free distribution? Is one of the two constructions preferred? So do we find one of the constructions more often in our texts?

Why really should we use only PRON when the possessive is used attributive, but PRON.POSS when it is used as modifier?.

VERBAL FORMS
more verbal tags...

While annotating verb forms in Lule Sami we noticed that TypeCraft did not provide all the tags we needed. In the following we exemplify some of the verb forms, and discuss the right use of tags.

Please see the updated list of gloss tags Special:TypeCraft/GlossTags/ (Dorothee 15:51, 15 December 2008 (CET))

GERUND we need two tags for two distinct gerunds:

Gerund I

expresses: 'while'..., 'at the same time as'... Gerund I expresses the partial overlap of two events.

sån oaddá-j bårå-dijn = he fell asleep while eating

jåhte-t -> jåde-dijn = while moving

tjieggi-t -> tjieggi-dijn = while traveling

tjåhkani-t -> tjåhkana-ttjin= while assembling

Note: -dijn is used after the last vowel of the week stem of a pair-syllabic verb or after the last vowel of the stem of a contracted verb. -ttjinis used after the last vowel of an unpair-syllabic verb (the last stem vowel changes to '-a').

Gerund II

expresses: someone is doing something, or something is going on, or something has started but is not finished. The Gerund II is build through the use of the auxiliary liehke-t (to be).

sån la låhkå-min = he is reading

-min - used after the last vowel of the strong stem of a pair-syllabic verb

Ex: sån la goarro-min (= she is sewing)

- used after the last vowel of a contract verb:

Ex: sån la guolli-min (= she is fishing)

-me - used after the last vowel of an unpair-syllabic verb

Ex: sån la malesti-me (= he is cooking) (all examples from Spiik) (Kristin)

Imperative

Also here we need two distinct tags to distinguish between

IMP.1 which expresses a direct order.

IMP.2 which expresses a strong wish or suggestion


INCHOATIVE

In the gloss tier we need a tag for inchoative verbs. Here an example:

Hyhto sisi manájma ja jus riekta de oaddát galgajma, valla ejma ájn ájgo.
“We went inside the cabin and, if doing right, then we would go to sleep, but we did not yet intend to do that.”
Hyhto
hyhto
cabinGENSG
N
sisi
sisi
insideILLSG
Nspat
manájma
manájma
goPAST1PL
Vitr
ja
ja
and
CONJC
jus
jus
if
CONJS
riekta
riekta
right
ADVm
de
de
then
CONJS
oaddát
oaddát
sleepINCEPINF
Vitr
galgajma
galgajma
shallPAST1PL
AUX
valla
valla
but
CONJC
ejma
ejma
notNEGPAST1PL
Vtr
ájn
ájn
yet
ADVtemp
ájgo
ájgo
intendNEG
PTCP

oaddá-t er inchoative of oade-t.

Done. Look at Special:TypeCraft/GlossTags/ (Dorothee 14:20, 15 December 2008 (CET))

Phonologically inchoatives are marked by a fortification of the consonant cluster and lengthening of the last vowel in the stem.

NEGATIVE VERBS

The tag Vneg in the POS tier is needed. Please see the list of gloss tags Special:TypeCraft/GlossTags/


Supinum
Iŋŋgá: Mån dal biejav mállásav duoldatjit.
“Inggá: I now put the dinner to cook.”
Iŋŋga:
Iŋŋgá:
Inggá:NOMSG
Np
Mån
mån
I1SGNOM
PN
dal
dal
now
ADVtemp
biejav
biejav
put1SGPRES
Vtr
mállásav
mállásav
dinnerACCSG
N
duoldatjit
duoldatjit
cookforINF
V


How should one annotate the suffix -tji in the above sentence. User:Kristin had suggested to use 'supinum'. The supinum is one of the infinite forms of LS next to the infinitive, the gerund, the participle and possibly others. In the examples above the suffix -tji was first annotated as supinum and the the -t as an infinitive marker, which did not make to much sense. User:Kristin then suggested that 'supinum' is tjit and as such an infinite marker. At present we have an annotation as shown in the example above.

The examples below illustrates the supinum

Ex: "Dån la má smidá váttsá-tjit!" - "You are clever at walking!"

It seems that most instances of infinitives so far occur after modal verbs, yet here seems to be a different case:

Mån galgav suhkkát sájdev bivdátjit.
“I shall row out to fish pollock.”
Mån
mån
I1SGNOM
PN
galgav
galgav
shall1SGPRES
AUX
suhkkát
suhkkát
rowINF
Vitr
sájdev
sájdev
pollockACCSG
N
bivdátjit
bivdátjit
fishto
 

How should one really annnotate that one? (Dorothee 16:35, 15 December 2008 (CET))


Derivational or inflectional ??

We would like to mention that the supinum is characterised as a derivational suffix in descriptive grammars of LS. It is mentioned among the ordavledninger, the Norwegian word for word derivation.



Strong and weak verb stems

Verbs in LS can either have a weak or a strong stem, so for example the verb wash which has two stem forms

basá and bassi

the 1P, present tense is expressed as basá-v while the 1P past tense is bassi-v.

We will use the tag WEAK and STRONG to distinguish these two different types of verb stems.

Grammatical Changes

LS is changing...

Dá bale bessin oahppe vehi oahpásmuvvat doarromuseajn Narvijkan ja sáme ásadusáj Jåhkåmåhken åvdås vádtsájin.
“ ”
theseNOMPL
DEM
bale
bale
timeGENSG
N
bessin
bessin
be_allowed3PLPAST
Vitr
oahppe
oahppe
pupilNOMPL
N
vehi
vehi
little
ADVm
oahpásmuvvat
oahpásmuvvat
  get_to_beINF
Vtr
doarromuseajn
doarromuseajn
warNOMSGmuseumwithCOMITSG
N
Narvijkan
Narvijkan
NarvikatINESSSG
Np
ja
ja
and
CONJC
sáme
sáme
SaamiGENSG
N
ásadusáj
ásadusáj
arrangemetmedCOMITPL
N
Jåhkåmåhken
Jåhkåmåhken
river_bendatINESSSG
Np
åvdås
åvdås
before
 
vádtsájin
vádtsájin
leave3PLPAST
Vitr

oahpásmuvvat doarromuseajn

According to grammars of LS - oahpásmuvvat takes ILL, but in the example sentence above it is used with a COMIT case. This leads to a change in meaning:

- muvva-t: used with ILL: the meaning is: get to know (people and concrete things), get accustomed to, get experience with, get familiar with.

(- tuvva-t used with COMIT the meaning is: learn to know). (Kristin)

Translation of place names

In English (as in other languages too - but not so much in Norwegian..) it is quite normal to translate proper names, e.g. München > Munich, Firenze > Florence, København > Copenhagen, etc.

Lule Sami place names have been translated into Norwegian, such as:

Ájluokta-Drag; Gásluokta-Kjøpsvik; Guovdageaidnu-Kautokeino; Divttasvuodna-Tysfjord, etc.

In Norway place names have officially a Sami and a Norwegian name, and the Sami name is used when writing in Sami, while the Norwegian one is used when writing in Norwegian.

As for free translation into English this could mean that we either use the Sami name, since we translate from Sami, or that we use the Norwegian name, since the Norwegian name is also internationally better known.

Which one should it be?

SÁMI - SAMI - SAAMI

(Kristin): I just talked with the employee at the museum at Arran who has the responsibility for the exibitions, Anne Kalstad Mikkelsen. She has checked the spelling of sáme with the Norvegian Sami Parliament. The Sami Parliament has decided that sáme is to be written Sami in English. So the museum has to follow this norm.

So - we should then follow the same norm! (Shouldn't we?)

(Dorothee): Definitely !