2007 created on the basis of data samples (no other documentation available)
2010-04-06 updated
Christian Chiarcos, chiarcos@uni-potsdam.de
Annotation scheme for part of speech annotation used by Serge Sharoff's
TreeTagger module, cf. Sharoff et al. (2008).<br/>
Sharoff, S. and Kopotev, M.
and Erjavec, T. and Feldman, A. and Divjak, D. (2008), Designing and evaluating Russian
tagsets. In Proceedings of the 6th International Conference on Language Resources and
Evaluation (LREC 2008), Marrakech, Morocco, May 2008
These are conjunctions,
e.g. и ("and"), чтобы ("that").
These are adverbs, e.g.
сгоряча ("in a temper"), очень ("very much").
These are expressions used
as a predicative, thus are non-inflected, e.g. жаль ("it is a pity"), хорошо ("well"), пора
("it is time").
These are punctuation marks
marking sentence ending.
This is an idiomatic phrase
marking the beginning of a new turn in a dialog, or a new topic in a discourse, e.g. кстати
("by the way"), по-моему ("in my opinion").
These are verbs, e.g.
пользоваться ("to use"), обрабатывать ("to process").
These are adverbal
pronouns, e.g. где ("where; where is"), вот ("there; there is").
pos
These are adjectives, e.g.
коричневый ("brown"), таинственный ("mysterious").
These are initials, e.g.
(M., P.).
These are adjectival
pronouns, e.g. который ("which"), твой ("your").
These are ordinal numbers,
e.g. один ("one"), седьмой ("the seventh"), восьмидесятый ("the eightieth").
These are predicatives
pronouns, e.g. некого ("noone"), нечего ("nothing").
These are interjections,
e.g. увы ("alas"), Ай ("Aye"), Ой ("Oh"), Ах ("Oh"), Ох ("Oh"), Ух ("Uh").
Theses are nouns, e.g.
яблоня ("an apple tree"), лошадь ("a horse"), корпус ("the case"), вечность
("eternity").
These are numerals, e.g.
четыре ("four"), десять ("ten"), много ("it is a lot of").
These are personal
pronouns, e.g. она ("she"), что ("what").
These are punctuation
marks.
These are prepositions,
e.g. под ("under"), напротив("opposite").
These are particles, e.g.
бы, же, пусть.
ADV
PART
INIT
A-PRO
INTJ
PARENTH
A
ADV-PRO
V
PRAEDIC-PRO
PRAEDIC
SENT
PRAEDIC-PRO
CONJ
PUNCT
S
NUM
S-PRO
A-NUM