TODO: integrate syntactic edge labels from http://193.185.105.50/demo/machinese/doc/MS-all-tags.html
annotation model for connexor machinese syntax, English version, currently, only part of speech tags considered
http://193.185.105.50/demo/machinese/doc/enfdg3-tags.html<br/>
2008/03/27 created
2010/01/08 linked with system.owl
2010/01/25 added subclassification of verbs, aspect, gender, revised participle, etc., distinction between hasTag, hasTagContaining etc.
Christian Chiarcos, chiarcos@uni-potsdam.de
1
1
morph
With adverbs, the obligatory tags include 'ADV'. The anglebracket tag again occurs in front of the 'ADV' tag, when relevant.
Features specific to determiners
features specific to Adverb
Also applied for German conjunctive.
With adjectives, the obligatory tags include 'A' and degree of comparison.
This is actually a combination of two annotations: "PRON adj"
verb; used only for finite verbs and infinitives.
With verbs, the obligatory tags include 'V' and one of the following: 'AUXMOD', 'INF', 'IMP', or tense. With tense, person is possible. The subfeature <N+> is placed before the 'V' tag when relevant.
features specific to Verb and subclassification of Verb
Subfeatures of PartOfSpeech tags.
With nouns, the obligatory tags include 'N' and case. In Conexor FDG, the obligatory tags for nouns include 'N', case, and number.
Subfeatures specific to Numeral
With pronouns, the only obligatory tag is 'PRON'. The anglebracket tags occur in front of the 'PRON' tag, when relevant. After the 'PRON' tag come the tags 'PERS', 'RECIPR', 'WH', 'DEM', or degree of comparison, when relevant. Next is the place for case, then number, and last, 'INDEP', when relevant.
With abbreviations, the obligatory tags include 'ABBR' and case. In Conexor FDG, the obligatory tags for abbreviations include 'ABBR', case and number.
With determiners, the only obligatory tag is 'DET'. In Conexor FDG, the obligatory tags for determiners include 'DET' and number. In the ordering of the morphological tags, the number tag is the last one and the case tag second to last, when relevant.
With numerals, the obligatory tags include 'NUM' and either 'CARD' or 'ORD'.
Connexor produces space-separated lists of tags, every tag element assigned by this property represents one token in such a list. With different tag-tokenization, hasTag is to be renamed to "containsTag".
V IMP
V IMP
V IMP
V IMP
imperative e.g. John, <b>come</b> here!
IMP
IMP
V IND
V IND
V IND
V IND
past tense, e.g. <b>were</b>
PAST
PAST
reflexive pronoun, e.g. <b>herself</b>
Refl
Refl
PREP
e.g. <b>of</b>
PREP
SUP
superlative, e.g. <b>biggest</b>, <b>most</b>, <b>furthest</b>
SUP
N+
N+
N+
N+
noun-verb combination, e.g. <b>India's</b> got..
V SUB
V SUB
V SUB
V SUB
P2
P2
P2
2nd person (verbs)
P2
NOM
nominative, e.g. These integrated <b>algorithms</b> are stored on the computer's hard disk., <b>others</b>
NOM
NOM
A
e.g. These integrated algorithms are stored on the computer's <b>hard</b> disk, from which they are downloaded into the DSP board's <b>random</b> access memory (RAM).
A
FEM
FEM
FEM
FEM
e.g. <b>if</b>
CS
CS
CS
CS
SG3
singular, third person, of pronouns, e.g. <b>him</b>
SG3
SG3
SG
SG
singular, e.g. These integrated algorithms are stored on the <b>computer's</b> hard <b>disk</b>, <b> other</b>
EN
e.g. These <b>integrated</b> algorithms are <b>stored</b> on the computer's hard disk, from which they are <b>downloaded</b> into the DSP board's random access memory (RAM).
EN
EN
EN
Dem
demonstrative determiner, e.g. <b>this</b> option
Dem
Dem
Dem
genitive, e.g. These integrated algorithms are stored on the <b>computer's<b> hard disk., <b>other's<b>
GEN
GEN
interrogative pronoun, e.g. <b>why</b>
Interr
Interr
PRON adj
PRON adj
PRON adj
A combination of two annotation values.
PRON adj
V
V
e.g. Others <b>go</b> further and <b>define</b> software to <b>be</b> programs plus documentation though this <b>does</b> not <b>correspond</b> with common usage.
V
NEG-PART
NEG-PART
e.g. are <b>not</b>, are<b>n't</b>
NEG-PART
NEG-PART
V INF
V INF
V INF
VINF
infinitive
INDEP
INDEP
INDEP
INDEP
independent genitive form of pronouns, i.e. the independent genitive form functioning always as head of a noun phrase, e.g. <b>theirs</b>
SG1
singular, first person, of pronouns, e.g. <b>me</b>
SG1
SG1
NUM
NUM
e.g. Software can be split roughly into <b>two</b> main types - system software and application software or programs.
NUM
NUM
INF
infinitive , e.g. would <b>be</b>
INF
INF
INF
e.g. The Berkeley UNIX mechanism for <b>creating</b> a virtual connection between processes.
ING
ING
ING
ING
PRON
PRON
PRON
e.g. <b>Others</b> go further and define software to be programs plus documentation though <b>this</b> does not correspond with common usage.
PRON
absolutive degree of comparison, e.g. <b>big</b>, <b>much</b>, <b>many</b> options, <b>far</b>
ABS
ABS
ABS
ABS
WH
relative or interrogative pronoun beginning with the letters 'wh' or 'how', e.g. <b>which</b>
WH
WH
WH
PROG
PROG
PROG
PROG
PRES
PRES
present tense, e.g. <b>am</b>
PRES
PRES
SUB
subjunctive, e.g. The casket <b>be</b> brought in.
SUB
SUB
SUB
DAT
Dative
DAT
WH
WH
WH
determiner beginning with the letters 'wh' or 'how', e.g. <b>which</b> option
WH
plural, e.g. These integrated <b>algorithms</b> are stored on the computer's hard disk., <b>others</b>
PL
PL
PL
PL
comparative degree of comparison, e.g. <b>bigger</b>, <b>more</b>, <b>further</b>
CMP
CMP
CMP
CMP
singular, third-person verb, e.g. <b>is</b>
SG3
SG3
SG3
SG3
INTERJ
INTERJ
INTERJ
e.g. <b>Hey</b>, so-and-so needs an instruction to do such-and-such.
INTERJ
MSC
MSC
MSC
MSC
singular, first-person verb, e.g. <b>am</b>
SG1
SG1
SG1
SG1
WH
WH
WH
adverb beginning with the letters 'wh' or 'how', e.g. <b>why</b>
WH
<Ex>
Ex
Ex
Ex
Ex
existential <b>there</b>, e.g. <b>There</b> are various models of the software life-cycle, and many methodologies for the different phases.
e.g.
"SODA Manual of Operation", R. C. Brigham and C. G. Bell, School of Elec <b>Eng</b>, <b>U</b> New <b>S</b> Wales, Sydney, <b>NSW</b> (1958)
ABBR
ABBR
ABBR
ABBR
Rel
Rel
Rel
Rel
relative pronoun, e.g. <b>which</b>
CC
e.g. <b>and</b>
CC
PL3
plural, 3rd person pronoun, <b>them</b>
PL3
PL1
PL1
plural, 1st person pronouns, e.g. <b>us</b>
CARD
CARD
cardinal number, e.g. <b>2010</b>
CARD
CARD
demonstrative pronoun, e.g. <b>these</b>
Dem
Dem
Dem
Dem
AUXMOD
AUXMOD
AUXMOD
modal auxiliary, e.g. <b>would</b>
AUXMOD
ordinal number, e.g. <b>first</b>
ORD
ORD
ORD
ORD
Heur
Heur
Heur
Heur
mark for unknown word; occurs in front of a part-of-speech tag, e.g. <b>mechansim</b>
RECIPR
RECIPR
reciprocal pronoun, e.g. <b>each other</b>
RECIPR
RECIPR
N
N
e.g.
These <b>integrated</b> algorithms are stored on the <b>computer's</b> hard <b>disk</b>.
N
P1
P1
1st person (verbs)
P1
P1
participle
V PCP
V PCP
V PCP
V PCP
PERF
PERF
PERF
PERF
DET
DET
e.g. If you're running <b>the</b> mess-dos emulator, control-alt-insert will cause <b>a</b> soft boot of <b>the</b> emulator, while leaving <b>the</b> rest of <b>the</b> system running.
DET
DET
ADV
ADV
e.g. Others go <b>further</b> and define software to be programs plus documentation though this does not correspond with common usage.
ADV
ADV
NEU
NEU
NEU
INFMARK
INFMARK
INFMARK
e.g. <b>to</b> do this <b>in order to</b> do that
INFMARK
P3
P3
P3
P3
ACC
ACC
accusative (of pronouns), e.g. <b>him</b>
ACC
ACC
Pers
Pers
Pers
Pers
personal pronoun, e.g. <b>us</b>