Tags used for Morphisto morphological analyses, extracted from the analysis of the Potsdam Commentary Corpus.
10/02/01 created, Christian Chiarcos, chiarcos@uni-potsdam.de
Tags of the form "V (Inf|Imp|PPast|PPres) ..."
quantifiers (INDEF) trigger strong or weak inflection of adjectives, I assume that the tags ... oD and ... mD refer to this phenomenon.
The tag VPRE ... marks various particles, adverbs, prepositions and some
verbs. Possibly, this tag specifies a displaced verbal particle as in
"[lieb]_VPRE V gewinnen", "Er sprang [ab]_VPRE", "Er stand [da]_VPRE Adv".
However, even with this hypothesis in mind, I can't think of something
reasonable to make out of "darauf" being tagged as VPRE ProAdv.
I assume that the tag originally referred specifically to prepositions that occur frequently in this role, hence the -PRE element.
Apparently certain lexical classes are distinguished. I've found Simp and Old, but there might be more. As long as it is uncertain, whether these are to be represented as subclasses of Noun or as MorphologicalFeature, there is no property to express this relation.
demonstrative pronouns and demonstrative determiners, tagged DEM...
Personal and reflexive pronouns that can occur as substitive pronouns only.
Tagged PPRO ...
Possessive pronouns and possessive determiners, tagged POSS
tagged IP ...
indefinite pronouns and indefinite determiners, including quantifiers, tagged INDEF...
Represented in Morphisto only indirectly by the lack of Pred or Adv.
1
Morphological features identified by Morphisto (as accessible under http://ingrid.sub.uni-goettingen.de/cgi-bin/analyze.cgi): everything following the POS tag, e.g. "Masc Nom Sg" in "Test +NN Masc Nom Sg".
Note that these features are not identical to the richer feature set used directly in the lexicon.
PPRO refl
word classes that share the properties SyntacticPronounType or PronounType
POS tag V
Tags of the form "V (1|2|3) (Sg|Pl) (Past|Pres) ...", for tense and aspect see properties.
tagged PTKL ...
tagged KONJ ...
word classes that share the reeignCase property
PPRO pers
tag ITJ
1
1
Note that for hasTagContaining, hasTagEndingWith, etc., I assume a representation of alternative analyses beginning with the POS tag and connected by a pipe (|). So, for every hasTagStartingWith('X'), there should exist the corresponding property hasTagContaining('|X'), etc.
"NNMasc Nom Sg|NN Masc Dat Sg|NN Masc Akk Sg"
Parts of speech produced by Morphisto (as accessible under http://ingrid.sub.uni-goettingen.de/cgi-bin/analyze.cgi), the POS tag is the first uninterrupted character sequence after +, e.g., NN in "Test +NN Masc Nom Sg"
Elements tagged with ...ADV or adv
Relative pronouns and relative determiners, tagged REL
word classes that share the properties AdjectiveType and AdjectiveInflectionType, i.e., ADJ and ORD.
Interrogative pronouns and determiners, tagged WPRO
Word classes that share Gender, Case and Number as the only obligatory properties. Tags starting with N...
V 3
|V 3
third person
|IP links
left delimiters of parentheses and quotations: ', (, -, /
IP links
3
3
3|
|IP
IP
punctuation in general
1
1|
1
conjunction in general
|KONJ
KONJ
ORD
|ORD
ordinal numbers
2
2|
2
past participle
|V PPast
V PPast
PTKL Adj
|PTKL Adj
particles that express the degree of comparison of adjectives, e.g., allzu, Am, zu
examples: all, alle, allem, allen, aller, Alles, andere, anderen, anderer, anderes, beide, beiden, bisschen, jede, jeden, jeder, jedes, meiste, meisten, paar, solch, solche, solchen, solcher, viele, vielen, vieler, wenig, Wenige, wenigen, weniger, wenigsten
mD|
mD
|KONJ Inf
KONJ Inf
conjunction with infinitive
ADJ Adv
|ADJ Adv
This tag is used for "lieben".
Alternative analyses include the normal analyses as verb or adjective: "ADJ Pos Neut Dat Sg Sw/Mix|V 1 Pl Pres Ind|V 3 Pl Pres Konj|V Inf|ADJ Pos Neut Gen Sg|ADJ Pos Masc Gen Sg|ADJ Pos Fem Gen Sg Sw/Mix|ADJ Pos NoGend Akk Pl Sw/Mix|ADJ Pos Masc Akk Sg|V 1 Pl Pres Konj|ADJ Pos Fem Dat Sg Sw/Mix|VPRE V|V 3 Pl Pres Ind|ADJ Pos NoGend Nom Pl Sw/Mix|ADJ Pos Masc Dat Sg Sw/Mix|ADJ Pos NoGend Dat Pl|ADJ Pos NoGend Gen Pl Sw/Mix"
VPRE V
|VPRE V
Masc
Masc|
Masc
NN
|NN
Subj
Subj|
Konj
Konj|
Konj
Subj
Apparently, the guesser uses this tag instead of normal Konj if the word is not found in the lexicon.
Fem
Fem|
Fem
|POSS
POSS
VPRE ProAdv
this tag is used for some pronominal adverbs, e.g., "darauf". Alternative analyses include PROADV
|VPRE ProAdv
PREP
|PREP
adverb derived from an interrogative stem, including, but not restricted to pronominal adverbs, e.g., Wann, warum, weshalb, Wieso, wo, wobei, Wofür, woher, wohin, wonach, worin, wovon, Wozu
|WADV
WADV
"proper" adjectives
|ADJ
ADJ
imperative
V Imp
|V Imp
PTKL Ant
particles that represent answers to questions or requests, e.g., Bitte, bitteschön, ja, Nein
|PTKL Ant
V
|V
|V|
V
|PTKL Neg
particles that express negation
PTKL Neg
V 1
first person
|V 1
ART Def
|ART Def
DEM
|DEM
|WPRO
WPRO
|REL
REL
conjunctions that express comparison or similarity, e.g., "als", "wie"
KONJ Vgl
|KONJ Vgl
IP rechts
|IP rechts
right delimiters of parentheses or quotations: )
The tag VPRE alone marks all occurrencies of "ab", possibly in its role as displaced verbal particle, see comments on VerbalParticle.
VPRE
VPRE|
|VPRE|
PROADV
non-interrogative pronominal adverbs, also including adverbs derived from Indo-European pronominal stems, e.g., bevor, Dabei, dadurch, dafür, dagegen, daher, dahin, dahinter, Damit, Danach, Daran, darauf, Daraufhin, daraus, darin, darum, darüber, Davon, davor, dazu, Dazwischen, Deshalb, deswegen, drauf, drin, Drum, flugs, gewissermaÃYen, her, herum, heut, indessen, rum, Seitdem, soeben, Trotzdem, Willens, zuerst, zurzeit, zurück
|PROADV
used for plural, where gender differentiation is not expressed
NoGend
NoGend|
NoGend
Neut
Neut
Neut|
oD|
examples: Einige, einigem, einigen, einiger, einiges, ersteres, etliche, etwas, irgendein, jeglichen, kein, Keine, keinem, keinen, keiner, lauter, Letzterer, manch, manche, Manchem, manchen, mancher, Manches, mehr, Mehrere, mehreren, reichlich, viel, Welche, welchem, welchen, Welcher, weniger
oD
|ART Indef
ART Indef
Dat
Dat
Dat|
comma
IP Komma
|IP Komma
Nom
Nom
Nom|
PPRO refl
reflexive pronoun
refl
refl
Acc
Akk
Acc
Acc|
Akk
Akk|
interjection, e.g., "aha"
|INTJ
INTJ
|KONJ Sub
KONJ Sub
subordinating conjunctions, e.g., da, ehe, falls, indem, Nachdem, Obschon, obwohl, sobald, sofern, Solange, Weil, wenn, wenngleich
|PTKL
PTKL
particle
Pl
Pl|
Pl
St
St|
St
Comp
Comp
Comp|
this tag is used for "da", alternative analyses include " KONJ Sub" and "ADV"
|VPRE Adv
VPRE Adv
Gen
Gen|
Gen
|ADV
non-derived adverbs
ADV
NE
|NE
Note that for hasTagContaining, hasTagEndingWith, etc., I assume a representation of alternative analyses beginning with the POS tag and connected by a pipe (|). So, for every hasTagStartingWith('X'), there should exist the corresponding property hasTagContaining('|X'), etc.
"NNMasc Nom Sg|NN Masc Dat Sg|NN Masc Akk Sg"
|KONJ Kon
coordinating conjunctions ???
examples: beziehungsweise, dass, desto, doch, Entweder, indessen, oder, sondern, Sowas, sowie, Sowohl, und, weder,
KONJ Kon
Pos
Pos|
Pos
Ind
Ind|
Ind
second person
V 2
|V 2
pers
personal pronouns
pers
PPRO pers
Pres
Pres|
Pres
|POSTP
POSTP
Sw/Mix
Sw/Mix|
Sw/Mix
|IP Norm
sentence-level punctuation: !, ., :, ?
IP Norm
Simp
Simp
Simp|
present participle
V PPres
|V PPres
Old
Old|
Old
INDEF
|INDEF
personal and reflexive pronouns
|PPRO
PPRO
|ART
ART
Sup
Sup|
Sup
PREP/ART
|PREP/ART
preposition fused with definite article
Past
Past|
Past
St/Mix
St/Mix
St/Mix|
|NPROP
NPROP
Sw
Sw|
Wk
Wk|
Sw
Wk
V Inf
|V Inf
infinitive
Infinitive with particle "zu"
V Inf zu
|V Inf zu
"zu" as a particle
|PTKL zu
PTKL zu
prfl
prfl
forms that can be either personal pronouns or reflexive pronouns
PPRO prfl
ADJ Pred
Pred
Sg
Sg|
Sg