2006 created by Christian Chiarcos
2006-2008 maintained by Christian Chiarcos, Johannes Bubenzer and Angelika Adam
2010-04-07 updated, 0ld-comments removed, partially proof-read, Christian Chiarcos
Christian Chiarcos, chiarcos@uni-potsdam.de
TODO: syntax
Annotation Model of the morphosyntactic component of the SUSANNE scheme as
applied to the British English SUSANNE corpus (Sampson, 1995), also
covering the simplified SUSANNE tag set as used by the TnT Tagger
(Brants 2000).
<br/>
Brants, T. (2000),
TnT--a statistical part-of-speech tagger,
In Proc. ANLP 2000 <br/>
Sampson, G. (1995), English for the computer:
The SUSANNE corpus and analytic scheme,
Oxford University Press
SUSANNE tags beginning with PP... represent substitutive personal pronouns
SUSANNE tags beginning with V... represent verbs.
Note that the subclassification follows the following EAGLES recommendation:
Since it is impractical, however, given the current capabilities of tagging software,
to resolve automatically the ambiguity of these six morphological functions, it is a
common practice to assign a single value to the base form, or else to assign two values,
one for the finite and one for the non-finite functions. Because of this, the tables below
show two tagsets: one tagset representing the 6 attribute-values above, and a reduced tagset
(`RTags'), which resembles most tagsets so far used for the English language in reducing the
six values to two.
http://www.ilc.cnr.it/EAGLES96/morphsyn/node150.html#SECTION00054000000000000000
SUSANNE tags begining with J... represent predicative adjectives, attributive adjectives and general adjectives.
Tags which begin with VV... are used for non-modal verbs or non-auxiliary verbs.
LexicalVerbPresentParticiple is a class for present participle verbforms.
(Sampson 1995: 118f)
SUSANNE tags beginning with Y represent punctuation, text-structuring symbols, font instructions and pauses.
Common nouns where the meaning of the word is a topographical feature and the noun is used as the head word in a place-name.
(Sampson 1995, p. 94)
XX is the SUSANNE tag for the word "not".
(Sampson 1995, p. 119)
TODO: check linking
SUSANNE tags beginning with ICS apply to prepositions (I) that can also occur as subordinating conjunctions (CS), e.g.,
"after", "before", "since" (ICSt), "considering" (ICS), "save" (ICSx).
This ambiguity is not resolved in SUSANNE.
(Sampson 1995, p.108f.)
SUSANNE tags beginning with V and containing 0 represent base forms of verbs, i.e., infinitive and identical word forms (indicative, imperative).
TODO: check linking
This is used for the indefinite article "a", "an".
(Sampson 1995, p. 105)
SUSANNE tags beginning with R... represent adverbs.
SUSANNE tags beginning with VV... are used for non-modal verbs or non-auxiliary verbs.
LexicalVerbPastForm is a class for past tense verbforms.
(Sampson 1995, p. 118f)
SUSANNE tags beginning with CC represent coordinating conjunctions ("and"/CC, "as"/CC31, "but"/CCB).
(Sampson 1995, p. 105)
FA is the SUSANNE tag used for a suffix which is seperated by a hyphen.
(Sampson 1995, p. 107)
SUSANNE tags beginning with JB... represent attributively used adjectives, e.g., "inner", "interior", "joint".
(Sampson 1995, p. 109)
SUSANNE tags beginning with APP represent attributive pronouns.
This is the negative substitutive pronoun "none".
(Sampson 1995, p. 114)
SUSANNE tags beginning with RRQ represent interrogative or relative adverbs, e.g., "where", "how", "when".
(Sampson 1995, p. 117)
SUSANNE tags beginning with NN represent common nouns and direction nouns.
Also, ND has been included here as it is comparable with temporal situating as in NNa, NNp, NNT....
SUSANNE tags beginning with PPI... represent first person pronouns.
(Sampson 1995, p. 115)
The class Be contains the SUSANNE tags for the forms of "be".
(Sampson 1995, p. 118)
This is the second person personal pronoun "you".
(Sampson 1995, p. 115)
These are general adjectives.
(Sampson 1995, p. 109)
SUSANNE tags beginning with V and containing N (VDN, VHN and VVN) represent past participles.
The tag VHD is used for the past tense form "had".
(Sampson 1995, p. 119)
The tag VBM is used for "am".
(Sampson 1995, p. 118)
FD is used for a distorted word, for spoken language only.
(Sampson 1995, p. 107)
MFn is the tag used for fractons written digitally e.g. "2/3".
(Sampson 1995, p. 111)
The SubstitutiveIndefinitePronoun class subsumes the SUSANNE tags PN1 and PNX1.
The SUSANNE VH0 is used for "have".
(Sampson 1995, p. 119)
SUSANNE tags for forms of "to have".
(Sampson 1995, p. 118)
SUSANNE tags beginning with RT... are temporal adverbs.
(Sampson 1995, p.118)
Present participle of "to do".
Unit nouns, i.e., units of measurement, wether written in full ("inch", "kilogramm") or abbreviated either alphabetically or symbolically, as head of a noun phrase denoting a measured quantity. (Sampson 1995, p. 97)
SUSANNE tag AT pertains to articles and other determiners, i.e., "every", the indefinite article, the negative determiner and the definite article.
(Sampson 1995, p.105)
Tags which begin with VV... are used for non-modal verbs or non-auxiliary verbs.
LexicalVerbThirdPerson is used for the third person form of full verbs.
(Sampson 1995, p. 119)
SUSANNE tags beginning with RP... represent prepositions used as adverb.
(Sampson 1995, p.116f)
The NonPOS class subsumes separated morphological elements, formulae, equations, foreign words etc.
Also, GG (~ FA) and ZZ (single Letter) are added here
FB is the SUSANNE tag used for a prefix which is seperated by a hyphen.
(Sampson 1995, p. 107)
MC is the SUSANNE tag used for cardinal numbers from "zero" upwards, spelled out. This includes "umpteen".
(Sampson 1995, p. 111)
These are substitutive possessive pronouns. The SUSANNE tags begin with PPG... . (G is "genitive", as English possessives are derived from Genitive forms of personal pronouns)
(Sampson 1995, p. 115)
Postmodifying adverbs and adverb-like phrases.
(Sampson 1995, p.115f.)
The SUSANNE tag VD0 is used for "do" as infinitive or as indicative.
(Sampson 1995, p. 119)
SUSANNE tag for "in order" introducing infinitive (cf. TO);
according to http://www.ilc.cnr.it/EAGLES96/morphsyn/node623.html#SECTION000124300000000000000 this is a subordinating conjunction
The PresentParticiple class contains the SUSANNE tags VBG, VDD, VHG and VVG which stand for present participle forms.
VDZ is used for "does".
(Sampson 1995, p. 118)
The class of ModalVerb contains the SUSANNE tags VMd, VMo and VMK.
MD is the SUSANNE tag used for ordinal numbers, whether used as ordinal adjective or adverb or as fraction e.g. "third", "fourth".
(Sampson 1995, p. 111)
The Numeral class subsumes cardinal numbers, ordinal numbers and fraction numbers. The SUSANNE tags begin with M.. .
This used for the past participle of "be".
This is used for "no" as determiner or qualifier.
(Sampson 1995, p. 105)
SUSANNE tags beginning with APPG... are used for attributive possessive pronouns ("her", "his", "its", "my", "our", "your", "their")
(Sampson 1995, p.105)
The SUSANNE tag VMo is used for modals in present form e.g. "can", "dare", "may", "must", "need", "shall", "will".
(Sampson 1995, p. 118)
"o" (Sampson 1995, p. 118) is possibly misspelled for "0".
SUSANNE TagSet (Sampson 1995) augmented with
English examples from the Susanne corpus and from "Morphosyntactic
Phenomena Encoded in Lexicons and Corpora A Common Proposal and
Applications to European Languages EAG---CLWG---MORPHSYN/R
Version of 31st Aug, 1996"
(http://www.ilc.cnr.it/EAGLES96/morphsyn/)
The hierarchical structure is inferred from the
structure of SUSANNE tags: Letters on the left tend
to represent superconcepts, letters to the right define
their subclassification. There are, however, some
deviations to this rule, e.g., APPGf for "her" as
a determiner, where
the A is a prefix ("attributive")
attached to a tag of the PPG family ("possessive
pronoun")
The FO class subsumes formulas, mathematical expressions, adresses etc..
(Sampson 1995, p. 107f.)
PNX1 is the SUSANNE tag used for the singular indefinite reflexive "oneself".
(Sampson 1995, p. 115)
SUSANNE tag used for "every" (Sampson 1995, p. 105), according to http://www.ilc.cnr.it/EAGLES96/morphsyn/node454.html#SECTION00084100000000000000, an indefinite determiner
The SUSANNE tag VB0 is used for "be".
(Sampson 1995, p. 119)
The SUSANNE tag VBG is used for the present participle of "be", i.e., "being".
(Sampson 1995, p. 118)
The SUSANNE tag VHN is used for the past participle form "had".
(Sampson 1995, p. 119)
SUSANNE tags beginning with V and containing D,d or K represent past tense verbs (VBD, VHD, VMd, VMK and VVD).
These are lexical verbs.
These are apposition-introducing elements e.g. "for example", "for instance", "namely".
(Sampson 1995, p.116)
The SUSANNE tag VHG is used for "having".
(Sampson 1995, p. 119)
SUSANNE tags beginning with RR... are general adverbs.
(Sampson 1995, p.117)
The class DoType contains the SUSANNE tags for the forms of "do".
(Sampson 1995, p. 118)
The SUSANNE tag VBZ is used for "is".
(Sampson 1995, p. 118)
These are the reflexive personal pronouns.
Their tags begin with PPX.. .
(Sampson 1995, p. 115)
The SUSANNE tag ND1 is used for direction e.g. "north", "N", "southeast".
(Sampson 1995, p. 111)
Common nouns of style and title. An S term is a status-indicating item which either accompanies one or more individual names within the full title of a person or is used to addres a person of appropriate status, or both. (Sampson 1995, p. 95, 112-113)
The VDN tag describes "done".
(Sampson 1995, p. 118)
This is a single-word comparative adverb other than "more" and "less".
(Sampson 1995, p. 117)
The SUSANNE tag VHZ is used for "has".
(Sampson 1995, p. 119)
This is a superlative adverb other than "most" and "least".
(Sampson 1995, p. 117)
Past forms of "be".
SUSANNE tags beginning with N... represent nouns (proper nouns and common nouns).
The Conjunction class tags conjunctions and subsumes the classes BTO, coordinating conjunctions, subordinating conjunctions and pre-coordinators.
The SUSANNE tags begin with C... . for logical reasons, pre-coordinator (LE) is added here.
Demonstratives
These are predicatively used adjectives ("akin", "alike", "asleep").
(Sampson 1995, p. 109)
The class QuantifyingPronoun subsumes quantifying determiners or pronouns ("all", "both", "half").
(Sampson 1995, p. 106)
SUSANNE tags begin with DA... represent quantifiers ("few", "little", "less", "many", "several").
(Sampson 1995, p. 106)
SUSANNE tags beginning with RL... represent adverbs of place or direction.
(Sampson 1995, p.116)
The SubstitutiveRelativePronoun class contains the SUSANNE tags PNQOr and PNQSr.
These are interjections tagged UH in SUSANNE.
In the CHRISTINE corpus replaced with a range of different U-tags.
ZZ is used for letters and letter names of the (roman or other) alphabet.
(Sampson 1995, p. 120)
Following the EAGLES recommendations, the Unique class subsumes different 'particles'.
These are isolate forms, uninflectable, which have not been included anwhere else
(see http://www.ilc.cnr.it/EAGLES96/morphsyn/node700.html#SECTION000153100000000000000 for a list).
Includes "to" (+ Infin.) (UI), negative "not", "n't" (UN), and existential "there" (UX)
SUSANNE tags beginning with CS... represent subcoordinating conjunctions ("although", "how", "if").
(Sampson 1995, p.106)
The tags BTO and TO are also included here, according to van Valin and Lapolla (1997), English "to" (+infinitive) is a subordinating
conjunction.
VDD is used for "did".
(Sampson 1995, p. 118)
SUSANNE tags beginning with I... represent prepositions, but also PrepositionOrSubordinatingConjunction.
The SUSANNE tag LE applies to pre-coordinators, i.e., the first element of paired coordination markers (cf. http://www.ilc.cnr.it/EAGLES96/morphsyn/node622.html#SECTION000124200000000000000, §3.71).
In the Susanne Corpus "pre-co-ordinator" means the first part of paired co-ordinating markers, e.g. in "both... and", "neither... nor".
(Sampson 1995, p.110)
The SubstitutiveInterrogativePronoun class contains the SUSANNE tags PNQOq and PNQSq.
SUSANNE tags beginning with P... represent substitutive pronouns (EAGLES "pronouns" as opposed to "determiner").
These are interrogative or relative pronouns in attributive use.
(Sampson 1995, p. 107)
SUSANNE tags beginningwith VVN represent past participles of lexical verbs.
SUSANNE tags beginning with NP... apply to several subtypes of proper nouns.
(Sampson 1995, p. 113f.)
In SUSANNE terminology, a "qualifier" is an adverb-modifying adjective or adverb.
Tags beginning with RG... apply to qualifiers which having no other adverbial use.
(Sampson 1995, p.116)
SUSANNE tag beginning with FW... apply to foreign words which are not capable of being allocated a more specific tag by reference to their English context.
(Sampson 1995, p. 108)
These are nouns for time.
(Sampson 1995, p. 113)
The SUSANNE tag VMK is used for "ought" and "used" as modal catenatives.
(Sampson 1995, p. 118)
SUSANNE tags which begin with VV... are used for non-modal/non-auxiliar verbs.
(Sampson 1995, p. 118f)
added by Johannes Bubenzer (09.09.2006) for a TnT tag used in a data sample from
Collaborative Research Center (SFB) 538 "Multilingualism" (Hamburg), project K4
The SUSANNE tag VBR is used for "are".
(Sampson 1995, p. 118)
SUSANNE tags beginning with D... form a very heterogeneous class of expressions include quantifiers, DB and determiners.
According to the examples, the prototypical forms which are collected here can be used as determiners. Some have ambiguity with pronouns.
The Present class contains the SUSANNE tags VBM, VBZ, VDZ, VHZ and VVZ.
SUSANNE V...-tags which contain "M" represent first person singular of "be", those which contain "Z" represent third person singular verbs.
Other present verbs are tagged as BaseForm.
This is the definite article "the".
(Sampson 1995, p. 105)
These are third person personal pronouns.
(Sampson 1995, p. 115)
Organization nouns are common nouns that denote is an organization and that are used as the head word in the name of organizations of that kind. (Sampson 1995, p. 94)
SUSANNE tags which begin with PNQ... represent SubstitutiveWHPronouns. This class contains PNQV ("whosever", "whomever", "whoever"), SubstitutiveInterrogativePronoun and SubstitutiveRelativePronoun.
EX is the SUSANNE tag used for existential "there".
(Sampson 1995, p. 107)
The SUSANNE tag VMd is used for modals in past form e.g. "could", "might", "should", "would".
(Sampson 1995, p. 118)
"d" in the tag name is possibly misspelled for "D".
SUSANNE tag beginning with DD... represent a very heterogeneous group including demonstratives, quantifiers and wh-determiner.
(Sampson 1995, p. 106)
GG is the SUSANNE tag used for the Germanic genitive inflection +´s, or + ´ after a plural stem and certain other stems ending in -s.
(Sampson 1995, p. 108)
Germanic genitive 's is modelled as NonPOS here because of its diachronic origin as a morphological element, cf. FA. (Chiarcos)
Alternatively, 's could be treated as postposition (http://www.ilc.cnr.it/EAGLES96/morphsyn/node56.html#SECTION00044000000000000000,
http://www.ilc.cnr.it/EAGLES96/morphsyn/node586.html#SECTION000114100000000000000)
DA2R is the SUSANNE tag used for "fewer".
DA2R
pos
pos
MDo
MDo is the tag for "first", "second".
VVDi is used for the past tense form of intransitive verbs.
VVDi
pos
DA2 is the SUSANNE tag used for "few" and "many".
DA2
pos
pos
MDt
MDt is used for "next", "last".
AT1e
AT1e is used for "every".
pos
pos
NNU1n
NNU1n is used for unit nouns which can be used in countable singular or as uncountable mass noun e.g. "metre".
NP1f
pos
NP1f tags a feminine forename.
ICSt
pos
ISCt describes "after", "before", "ere", "since", "until", "til" as prepositions (with a complement) or subordinating conjunctions.
NPM1
NPM1 is the SUSANNE tag used for a month name, e.g. "October"
pos
pos
PPGm is used for "his".
PPGm
DD1a
DD1a is used for "that" as determiner, demonstrative pronoun, or qualifier (e.g. that slowly).
pos
NNU1c is the tag for unit countable nouns in singular e.g. "inch", "kilogram".
NNU1c
pos
pos
FA is the SUSANNE tag used for a suffix which is seperated by a hyphen.
FA
CCB is used for "but" as coordinating conjunction.
CCB
pos
pos
These are the plural forms of the reflexive personal pronouns.
PPX2
pos
AT
The SUSANNE tag AT is used for "the".
DA1
pos
DA1 is used for "much", "little".
PPGy is used for "yours"in singular or plural.
NPM
pos
APPGi1
pos
APPGi1 is the tag for "my" as possessive.
BTO
BTO stands for " in order" and occurs within a infinitival clause as a left sister of an infinitival verb/ verb group.
(Sampson 1995, p.105, 268)
pos
NP2f
pos
NP2f is used for plural feminine forenames.
RGQV
RGQV is the tag for "however", "no matter how" as qualifier.
pos
pos
PN1 is used for e.g. "anybody", "anyone", "anything", "everybody", "everything", "naught", "nobody", "no one", "nothing".
(Sampson 1995, p. 114)
PN1
pos
NP2j is the SUSANNE tag used for plural organization names, e.g. "ICP's", "Unilevers".
NP2j
pos
NNO
MF
pos
MF is used for fraction numerals.
VVG
pos
VVG is used for present participle verbforms.
DD2
DD2 is used for "a few", "a good few", "a good many", "a good great".
pos
PPH
pos
These are third person pronouns.
NNSA is used for an item of style or title fllowing name, with or without a countable singular use, e.g. "Jr", "Sr".
pos
NNSA
These are adverbs.
pos
RA
RRQV is used for wh-...-ever adverbs, e.g. "however", "no matter how", "whenever", "wherever", "wheresoever".
RRQV
pos
MCs
MCs is the desciption for integers written digitally with a leading zero e.g. "007".
pos
NNT1h
pos
NNT1h is the tag for holiday or season, e.g. "Christmas", "Boxing Day".
NPD1 is the SUSANNE tag used for a day of the week e.g. "Monday".
pos
NPD1
NP1m NP1f tags a masculine forename.
pos
NP1m
MCr
pos
MCr is used for roman numerals from " I " upwards.
pos
DA2q is used for "several".
DA2q
EX
pos
EX is the SUSANNE tag used for the existential "there".
pos
PPIO2 is the SUSANNE tag used for "us".
PPIO2
DAr is used for "former"/ " latter" in all uses.
DAr
pos
PPHO1m for "him".
pos
PPHO1m
VVNi used for the past participle of intransitive verbs
pos
VVNi
DDQ is the SUSANNE tag used for "what".
pos
DDQ
pos
RAq
RAq is the SUSANNE tag used for "apiece", distributive uses for "each".
NNmm stands for the plural of NNm abbreviation, e.g. "Figs.", "Nos.".
NNmm
pos
RRR
RRR is used for a single-word comparative adverb other than "more", e.g. " better", "closer", "deeper", "earlier" and so on.
pos
"Whose" without a specification of its use.
pos
DDQG
DBh is used for "half" as determiner or pronoun.
pos
DBh
NNSS
pos
NNSS is used for style or title-only adjectival items in plural, e.g. "Messrs", "Mmes".
VBD
VBD is used for the past forms of be.
pos
MC2n is used for the plural of cardinals written digitally.
MC2n
pos
PPGf
PPGf is used for "hers".
pos
pos
DAg
DAg is used for "own" as part of a genitive construction.
pos
JBy
JBy is used for "only" as an adjective like in the example "the only thing".
YC is the SUSANNE tag used for comma.
pos
YC
FD is the SUSANNE tag used for a distorted word. It is only used in speech analysis.
pos
FD
NNLb are descriptive heads of place-name nouns with tendency to precede the specific name "Costa".
pos
NNLb
NNJ
pos
NNJ is used for a noun which appears only as a desciptive head of an organisation name, e.g. "Corp", "Bros". Singular or plural is possible.
pos
LE
LE is used for "both", "not only" as pre-co-ordinator.
YBL describes the beginning of heading: <bmajhd> <bminhd>.
YBL
pos
pos
PPX1h
PPX1h tags "itself".
VH0
The SUSANNE tag VH0 is used for "have".
pos
pos
NPM is the tag for an abbreviated month name, e.g. "Oct".
NPM
CSN is used for "than" in all uses.
CSN
pos
pos
RRg is used for "long" as adverb.
RRg
PPX1f
PPX1f is the SUSANNE tag used for "herself".
pos
DDo
DDo is used for "a lot".
pos
pos
This is the time of the day written digitally, e.g. "10:30", "10.30".
NNa
PPGi1
pos
PPGi1 is used for "mine".
PNQV is used for substitutive wh pronouns without a specification of case.
PNQV
pos
pos
YPR is the SUSANNE tag used for the closing bracket ")".
YPR
DB2
DB2 is used for "both" as determiner or pronoun.
pos
This is a noun which appears as a countable singular and is a desciptive head of an organisation name
NNJ1c
pos
YBR
pos
YBR tags the end of heading <emajhd> <eminhd>.
IIb
IIb is used for "by".
pos
PNQOr
pos
This tag is used for the relative "whom".
(Sampson 1995, p. 115)
RRe
pos
RRe is the tag for "enough" as clause adverb.
The SUSANNE tag VBM is used for "am".
VBM
pos
pos
CSg
CSg is used for "though" as subordinating conjunction.
pos
ICSk
ICSk is the tag for "like" as preposition, subordinating conjunction, or in a relative use.
UH is the SUSANNE tag used for an interjection e.g. "blimey", "hello", "please", "well", "yes".
UH
pos
pos
RLh is used for "here", "there" as adverbs of place
RLh
MC1n is the tag for 1 written digitally, including uses as ordinal e.g. "February 1".
MC1n
pos
FOx
pos
FOx is the tag for an algebraic expression with nominal as opposed to equative function e.g. "a", "pi".
DDQGr for "whose" in relative uses.
pos
DDQGr
RLw is used for "somewhere", "someplace", "anywhere", "anyplace", "everywhere", "nowhere" are adverbs of place or direction.
RLw
pos
YPL
YPL is the SUSANNE tag used for the opening bracket "(".
pos
AT1 is used for the indefinite article "a", "an".
pos
AT1
RTo
RTo is used for "now".
pos
NP2z is used for the pluralized form of a 'code name' functioning as countable noun, e.g. "1-11´s".
NP2z
pos
pos
NNOc is used for e.g. "dozen", "score", "gross" etc. .
NNOc
VBR
VBR is used for "are".
pos
added by Johannes Bubenzer (2006-09-09) according to a TnT-tagged
dataset provided by SFB 538 (Hamburg), project K4.
NNL1 is used for topographical nouns in singular.
pos
NNL1
The SUSANNE tag VMd is used for modals in past form e.g. "could", "might", "should", "would".
pos
VMd
pos
NNu is the tag for plural mass nouns e.g. "data", "measles".
NNu
NP1g
NP1g is used for miscellaneous singular geographical proper names, e.g. "Adriatic", "America", "Sherwood".
pos
The SUSANNE tag VHD is used for the past tense form "had".
VHD
pos
pos
APPGi2
APPGi2 is used for "our".
DD1t
DD1t is used for "a little".
pos
RAe
pos
RAe is the tag for "else" in all uses.
YX
YX is the SUSANNE tag used for the exclamation mark "!".
pos
NP1z is used for a 'code name', an acronym or arbitrary alphanumeric expression functioning as countable noun.
NP1z
pos
PPX2h
PPX2h is the SUSANNE tag used for "themselves", "each other" or "one another".
pos
PPI
pos
PPI is used for substitutive first person pronouns.
added by Johannes Bubenzer (2006-09-09) in accordance to a
TnT-tagged data set provided by SFB 538 (Hamburg), project K4
NNT1 is sued for temporal nouns in singular.
NNT1
pos
DD1 is used for determiner in singular.
added by Johannes Bubenzer (2006-09-09) in accordance to a
TnT-tagged data set provided by SFB 538 (Hamburg), project K4
pos
DD1
pos
The SUSANNE tag VHG is used for "having".
VHG
pos
These are wh-adverbs.
RRQ
DD1q
pos
DD1q is used for "another", "each", "one and the same" as determiner or pronoun.
YR is the SUSANNE tag used for an interruption point. It is only used in speech analysis.
YR
pos
pos
YG
YG is the SUSANNE tag used for a logical position of a transformationally moved/deleted item.
VVDt
pos
The past tense form of transitive verbs is tagged with VVDt.
pos
CSW is used for "whether" in all uses.
CSW
PPHS1m id used for "he".
PPHS1m
pos
pos
FW
FW is the SUSANNE tag used for a foreign word which is not capable of being allocated a more specific tag by reference to its English context.
pos
The SUSANNE tag VMo is used for modals in present form e.g. "can", "dare", "may", "must", "need", "shall", "will".
VMo
NNux
NNux is used for plural mass nouns in "-ics", e.g. "mechanics".
pos
The SUSANNE tag VHZ is used for "has".
VHZ
pos
RLe
RLe is used for "elsewhere".
pos
pos
VBG
VBG is used for "being".
DDQr
pos
DDQr for "which" in relative uses.
NP1i is the tag for intitials of personal names.
pos
NP1i
RL is the SUSANNE tag used for an adverb of place or direction e.g. "along", "back", "outside".
RL
pos
NP2m tags plural masculine forenames.
pos
NP2m
MCn is used for cardinal numerals written in digits, e.g "February 28".
MCn
pos
RAj is used for postnomianl adjectives e.g. "destinate", "centigrade", "elect", "galore".
RAj
pos
pos
PPH1 is used for "it".
PPH1
pos
DAz
DAz is used for "such".
RTt
RTt is used for "today", "tomorrow", "tonight", "yesterday".
pos
NNp is the tag for the time of a day written digitally in 24-hour notation from 13:00 on, e.g. "13:30".
pos
NNp
These are plural proper names.
NP2
pos
PNQVS
pos
This tag is used for "no matter who", "whoever", "whosever".
RRx stands for "only" as adverb.
pos
RRx
NP2s
pos
NP2s describes plural surnames, as in the "Robinsons".
pos
The VDN tag describes "done".
VDN
pos
The base form of transitive verbs is tagged with VV0t.
VV0t
pos
JA is used for predicatively used adjectives e.g. "ablaze", "alone".
JA
PNX1 is the SUSANNE tag used for the singular indefinite reflexive "oneself".
pos
PNX1
pos
II is the SUSANNE tag used for general prepositions, including prepositional use of words which can function either as preposition or as adverb. e.g. "into", "near", "off", "on", "onto", "opposite", "outside", "over", "past", "through".
(Sampson 1995, p. 109)
II
VVZ is used for the third person singular verbforms.
pos
VVZ
MFn
MFn is the SUSANNE tag used for fractions written digitally e.g. "2/3".
pos
VVGi is used for present participle intransitive verbforms.
pos
VVGi
ICS
ICS is used for "considering", "notwithstanding".
pos
pos
VVNv for the past participle of verbs having transitive and intransitive uses.
VVNv
RT is the SUSANNE tag used for "again", "hereafter", "overnight".
RT
pos
LEe is used for "either" as pre-co-ordinator.
LEe
pos
MDn
MDn are ordinal numbers written digitally e.g. "1 st", "2 nd".
pos
pos
NNUp
NNUp is used for "%", "percent", "per cent".
pos
This tag is used for "no matter whose", "whosever".
PNQVG
CSn
CSn is used for "when" as subordinating conjunction.
pos
YQ is the SUSANNE tag used for the question mark "?".
pos
YQ
NNT1c is for a singular time noun that can head a noun phrase functioning adverbially, e.g. "hour", "day", "evening".
pos
NNT1c
pos
NP
These are proper nouns.
RP
pos
RP is used for "across", "down", "in", "off", "on", "out", "over", "through", "up" in adverberbial uses.
pos
RGQq
RGQq for "how" as qualifier.
ND is used for direction nouns.
ND
pos
DD1i
pos
DD1i is used for "this" in al uses including as qualifier.
pos
DAT
DAT is used for "least" and "most" in all uses.
YND for the colon-dash ":".
YND
pos
FWs
pos
FWs is the tag for a biological Latin species (or lower rank) name e.g. "sapiens officinale".
PNQS is used for "who". This tag is taken from the reduced TnT tagset.
pos
PNQS
RGa
Rga is used for "as" as qualifier. RGb is the tag for "quite" as qualifier or before an article e.g. in "quite a good idea". RGf stands for "too" as qualifier.
pos
pos
NPM2
NPM2 is the SUSANNE tag used for plural of month names, e.g. "Octobers".
pos
DDQV
The SUSANNE tag DDQV describes e.g. "whatever", "whichever", "whichsever", "no matter which".
JB
pos
JB is used for attributively used adjectives e.g. "chief", "entire", "future", "inverse", "major".
pos
NPD2 is the SUSANNE tag used for a day of the week in plural, e.g. "Tuesdays".
NPD2
TO is the SUSANNE tag used for the infinitival "to". It could be analyzed as a sub-type of subordination.
(Sampson 1995, p. 118)
TO
pos
YD is the SUSANNE tag used for dash (em-dash or larger).
YD
pos
pos
VDD is used for "did".
VDD
RAz
pos
RAz tags "or so" as in "fifty or so".
pos
ATn
ATn stands for "no" as determiner or qualifier.
pos
ZZ1 is the SUSANNE tag used for a singular letter of the (roman or other) alphabet.
ZZ1
RPK
pos
RPK is used for "about" in adverbial and catenative use.
pos
NNUc
NNUc is used for unit nouns which can stand in countable singular or in plural, e.g. "hertz", "yen".
VBN
This is the SUSANNE tag used for "been".
pos
JJR
JJRis used for comparative adjectives, including "elder" and "further".
pos
NNL1n is used for descriptive heads of place-name in countable singular or uncountable mass nouns, e.g. "drive", "water", "green".
NNL1n
pos
VDZ is used for "does".
VDZ
pos
PPG
These are substitutive possessive pronouns.
pos
pos
LEn is used for "neither" as pre-co-ordinator.
LEn
NNS
NNS is used for nouns of style or title in singular, e.g. "Mr", "Mrs", "Dr".
pos
YH is the SUSANNE tag used for a hyphen, "-" dash.
YH
pos
pos
NNOn
NNOn is used for abbreviated NNOc words and is ambiguous between singular and plural, e.g. "m" for million(s).
pos
RGAf for "enough" as qualifier.
RGAf
DB
pos
DB is used for "all", "both" or "half" as pronoun or determiner.
JJh
JJh is used for pseudo-adjectives formed by suffixing "-ed" to the last word of a nominal compound.
pos
RAa is the SUSANNE tag used for "ago", "since" as synonym for age.
RAa
pos
pos
RGz for "so" as qualifier.
RGz
VBDR is used for "were".
pos
VBDR
NN1
This is a common noun in singular.
pos
APPGh1 is the tag for "its".
APPGh1
pos
pos
FOc tags a formula or an acronym for chemical substances, molecules or subatomic particles e.g. "TNT", "DDT".
FOc
RAn
pos
RAn is used for "whatever", "whatsever" after a negative or non-assertive nominal head.
YE
pos
YE is the SUSANNE tag used for the ellipsis mark "...".
RLn is used for "downstairs", "upstairs" .
RLn
pos
RRz is used for "so" introducing a main clause or a clause of purpose or result, or as an adverb of manner or degree.
pos
RRz
PPX2i describes "ourselves"/ "ourself".
PPX2i
pos
VMK
pos
The SUSANNE tag VMK is used for "ought used" as modal catenative.
MD is the SUSANNE tag used for ordinal numbers, whether used as ordinal adjective or adverb or as fraction e.g. "third", "fourth".
pos
MD
MCe is for numbers containing a non-decimal separator, other than a time-of-day expression, e.g. "4:11", "4.11"(stands for four years and 11 months), "3.4.93".
pos
MCe
RAi
pos
RAi is the tag for "inst", "ult" etc. .
NNUn
pos
NNUn is used unit nouns which can be used as countable singular, plural or as uncountable mass noun.
NNm
NNm is the tag for a singular abbreviation normally followed by a numeral functioning as a proper name e.g. "Fig.", "No.".
pos
pos
DA
DA is used for quantifiers.
pos
added by Johannes Bubenzer (2006-09-09) in accordance to a
TnT-tagged data set provided by SFB 538 (Hamburg), project K4
VM is used for modal verbforms.
VM
RRy
RRy is used for "any" as qualifier with comparative.
pos
NNL1cb
pos
NNL1cb is used for descriptive heads of place-name in countable singular with tendency to precede the specific name, e.g. "camp", "lake", "mount".
pos
NP2g
NP2g is used for miscellaneous plural geographical proper names, e.g. "Alps", "Americas", "Antilles".
NNT1m
pos
NNT1m is used for a point-of-time noun, which cannot head a noun phrase functioning adverbially, e.g. "noon", "midnight", "midyear".
VVGv describes the present participle form of verbs which have transitive and intransitive uses.
VVGv
pos
pos
FOt is used for a telephone number.
FOt
pos
MC2
MC2 tags the plural of cardinals.
APPGy is used for "your".
APPGy
pos
CSi
pos
CSi is used for "if".
DD1n is used for "neither" as determiner or pronoun.
DD1n
pos
pos
CCn is used for "nor".
CCn
RGf
pos
RGf stands for "too" as qualifier.
CS
pos
These are subordinating conjunctions.
pos
NNS2
NNS2 is used for nouns of style or title in plural, e.g. "doctors".
pos
PPX1m is used for "himself".
PPX1m
pos
FOr is the tag for a road name e.g. "M6 B6480 I-95".
FOr
NNSj is used for style or title-only adjectival items, e.g "Rev", "Very Rev".
NNSj
pos
PPX1y
PPX1y for "yourselves".
pos
pos
NP1t is the tag for a town name.
NP1t
pos
RGb
RGb is the tag for "quite" as qualifier or before an article e.g. in "quite a good idea".
PNQSq
This tag is used for the interrogative "who".
(Sampson 1995, p. 115)
pos
YT
pos
YT is used for bolds.
DD1b is used for "a bit".
DD1b
pos
APPGm is used for "his" except as pronoun.
APPGm
pos
pos
NNU
NNU is used for unit-only nouns in singular, plural, or neutral.
pos
These are interrogative (RRQq) wh- adverbs e.g. "how", "when", "where".
RRQq
pos
CSA is the SUSANNE tag for "as" as subordinating conjunction or as preposition in comparative sense.
CSA
pos
VVN iS used for the past participle verbforms.
VVN
pos
DDf is used for "enough" as pronoun or pre- or postmodifying a noun.
DDf
VVDv
VVDv describes the past tense form of verbs which have transitive and intransitive uses.
pos
pos
NNS1n
NNS1n is used for nouns of style or title in countable singular or uncountable mass noun, e.g. "justice".
DAy
DAy is the tag for "same" / "selfsame".
pos
VVZi is used for the third person singular form of intransitive verbs.
pos
VVZi
pos
NPD is the SUSANNE tag for an abbreviated day of the week, e.g. "Tues".
NPD
This is a countable singular noun, e.g. "clock", "crowd", "Buddhist".
NN1c
pos
NP1p
NP1p is the SUSANNE tag used for 'province' name, name of an US state, an English county etc. .
pos
CCr
pos
CCr is the tag for "or".
pos
RAb
RAb is used for "A.D./ Anno Domini" and for counterparts used in the pre-date position in other calendars.
pos
MC2y
MC2y is used for the plural of year names e.g. "1960s".
This is the tag for "we".
PPIS2
pos
pos
VVD
VVD is used for past tense verbforms.
pos
VDG
This is the SUSANNE tag used for "doing".
CSr
pos
CSr is used for "where" as subordinating conjunction.
pos
JBT is the SUSANNE tag used for e.g. "utmost", "uttermost".
JBT
PPHS1f
PPHS1f describes "she".
pos
pos
YIL
YIL is the SUSANNE tag used for left quotes, opening (single or double) inverted commas.
MCb is used for 'labels' for cross-reference within and between texts, comprising combinations of digits, letters and/ or non-alphanumeric characters.
vMCb
pos
pos
GG
GG is the SUSANNE tag used for the Germanic genitive inflection +´s, or + ´ after a plural stem and certain other stems ending in -s.
NNU2
pos
NNU2 is the tag for unit nouns in plural, e.g. "inches".
NN1u
pos
This is a uncountable/ mass noun, e.g. "snow", "Buddhism".
pos
VV0i is used for the base form of intransitive verbs.
VV0i
RRQr
RRQr is used for relative wh- adverbs (having overt antecedent - not "fused" relative) e.g. "when", "where", "whereat", "whereby".
pos
pos
DA2T is used for "fewest".
DA2T
VVGK
VVGK is used for "going" as catenative (in BE "going to").
pos
These are base form of verbs.
VV0
pos
pos
PN1o
PN1o is the SUSANNE tag usedfor the singular indefinite pronoun ("one").
(Sampson 1995, p. 114)
ICSx
ISCx is the SUSANNE tag used for "but", "except", "save" as prepositions or subordinating conjunctions.
pos
pos
NN1ux
This is a uncountable/ mass noun whch is ending on "-ics", e.g. "athletics", "thermodynamics".
RAy
pos
RAy describes "B.C." and counterparts following dates in other calendars.
pos
FOs
FOs is the tag for registrtation/ reference/ serial/ model number.
YM is the SUSANNE tag used for a filled pause. Its only used in speech analysis.
YM
pos
PPX
pos
PPX are reflexive personal pronouns.
NNL2 is used for descriptive heads of place-name in plural, e.g. "buildings", "montains".
NNL2
pos
These are singular proper names.
pos
NP1
PPGh2
PPGh2 is used for "theirs".
pos
CSk
pos
CSk is used for "as if", "as though".
DD2a is used for "those".
DD2a
pos
RRT is used for a superlative adverb other than "most", e.g. "best", "brightest", "closest", "earliest", "fastest" and so on.
RRT
pos
PPHS2 is used for "they".
pos
PPHS2
NN1n
pos
This is a noun which can be used in a countable singular or as uncountable/ mass noun.
JBo is used for "other" in all uses.
JBo
pos
RTn is used for "then".
pos
RTn
PN1z is used for "so" as a pro-form, for instance in the "do so"- construction.
(Sampson 1995, p. 114)
pos
PN1z
JJj
JJj is the tag for abbreviated adjectives appended to organization names to identify it legal status e.g. "Ltd Inc Pty" and counterparts in other jurisdictions.
pos
pos
This tag is used for "no matter whom", "whomever".
PNQVO
DAR
pos
DAR is used for "less" or "more".
CC is used for coordinating conjunctions "and", "and/or", "as well as", "plus"
pos
CC
VVZv describes the third person singular form of verbs which have transitive and intransitive uses.
VVZv
pos
YTL
pos
YTL is used for a begin bold.
MC2r decribes the plural of roman numerals.
MC2r
pos
RRs describes "otherwise", "yet".
RRs
pos
FOqc
FOqc is the tag fora chemical equation, when it is analysed as a single word.
pos
NNJ1 is used for organization nouns in singular.
added by Johannes Bubenzer (2006-09-09) in accordance to a
TnT-tagged data set provided by SFB 538 (Hamburg), project K4
pos
NNJ1
ND1 is used for direction e.g. "north", "N", "southeast".
pos
ND1
NP2p is the SUSANNE tag used for plural province names, e.g. "Carolinas".
pos
NP2p
MCy
MCy stands for the year name written digitally (full or with apostrophe) e.g. "1987", " `99".
pos
NNL1c
pos
NNL1c is used for descriptive heads of place-name in countable singular, e.g. "road", "island".
PPHO2
pos
PPHO2 is used for "them".
pos
The SUSANNE tag VD0 is used for "do" as infinitive or as indicative.
VD0
IIp is used for "per".
pos
IIp
NNU1 is used for unit nouns in singular.
NNU1
pos
PN
PN is the SUSANNE tag used for the negative substitutive pronoun "none" in all uses.
(Sampson 1995, p. 114)
pos
pos
PPIS1
This is the tag for ' I ' as personal pronoun.
DD2i
DD2i is used for "these".
pos
RAh is used "am" (for 'ante meridiem'), "pm" o`clock.
RAh
pos
FWg is used for a biological Latin name of genus or other rank higherthan species e.g. "Equus Umbelliferae".
FWg
pos
pos
APPG is used for attributive possessive pronouns.
APPG
NP2c stands for the plural of country names, e.g. "the Philippines".
NP2c
pos
pos
VV0v
VV0v is used for the base form of verbs which have transitive and intransitive uses.
pos
IIg is the tag for "aged" as a pseudo-preposition.
IIg
APPGf
APPGf is used for "her" as possessive.
pos
JBR
JBR is used for e.g. "inner", "lesser", "nether", "outer", "upper".
pos
PPX2y
pos
PPX2y tags "yourselves".
XX
XX-Tag
pos
XX is the SUSANNE tag used for "not".
pos
RGQ
RGQ is used for question qualifiers.
NNLc
pos
NNLc is used for descriptive heads of place-name in countable singular or plural, e.g. "barracks", "links".
pos
PPGi2 is used "ours".
PPGi2
pos
The SUSANNE tag VHN is used for the past participle form "had".
VHN
IF is the SUSANNE tag used for "for" as preposition.
(Sampson 1995, p. 109)
pos
IF
RG is the SUSANNE tag used for a qualifier which having no other adverbial use e.g. "very", "jolly", "mighty", "stark".
RG
pos
RGA is the SUSANNE tag for "indeed" as qualifier.
pos
RGA
pos
DD1e is used for "either" as determiner or pronoun.
DD1e
DBa
pos
DBa is used for "all" as determiner or pronoun.
pos
PPX1i is used for "myself".
PPX1i
RGr
RGr is used for "rather" as qualifier or before an article. RGz for "so" as qualifier.
pos
This tag is used for "you" in singular or plural.
PPY
pos
pos
YP
YP is the SUSANNE tag used for the silent pause. It is used only in speech analysis.
pos
DD is used for "yon", "yonder" as determiner, "somesuch", "the rest".
DD
pos
NP2x are miscellaneous plural proper names.
NP2x
CST is the tag for "that" as subordinating conjunction, including in its use in introducing relative clauses.
CST
pos
The SUSANNE tag VBZ is used for "is".
pos
VBZ
IIt is used for "to" as preposition.
IIt
pos
pos
RAp is used for "per annum", "pa", "per diem" etc. .
RAp
pos
YIR is the SUSANNE tag used for closing (single or double) inverted commas.
YIR
These are the singular forms of the reflexive personal pronouns.
pos
PPX1
NNUb is used for a unit symbol which precedes numerals, e.g. "$".
NNUb
pos
pos
THis is an attributive common noun, e.g. "scissor", "trouser"
NNb
MCd is the tag for numerals including decimal points.
pos
MCd
VBDZ is used for "was".
pos
VBDZ
JJT
JJT is used for superlative adjectives, including "eldest" and "furthest".
pos
pos
NP1s
NP1s is used for surnames.
PNQSr
pos
This tag is used for the relative "who".
(Sampson 1995, p. 115)
This is a common noun.
pos
NN
ZZ is used for letters of the (roman or other) alphabet.
ZZ
pos
MC1 is used for "one" as numeral.
pos
MC1
JJs is used for adjectives in -most orther than "utmost/ uttermost" e.g. "backmost", "inmost", "lowermost", "midmost".
JJs
pos
MC is the SUSANNE tag used for cardinal numbers from "zero" upwards, spelled out. This includes "umpteen".
pos
MC
RAh
DDQq for "which" in interrogative uses.
pos
pos
ZZ2 which is used for a letter of the alphabet with plural inflexion.
ZZ2
This is a common noun in plural.
NN2
pos
JJ
pos
JJ is used for general adjectives e.g. "blue", "English".
NNS1c
pos
NNS1c is used for nouns of style or title in countable singular "doctor", "miss", "madam".
pos
REX
REX is the SUSANNE tag used for an apposition-introducing element e.g. "for example", "for instance", "namely".
This is a noun which beginns with "mid-" other than time noun, e.g. "midstream", "midfield".
pos
NN1m
pos
FO is the SUSANNE tag used for a determinate formula.
FO
pos
VB0 is used for "be".
VB0
VVNt used for the past participle of transitive verbs.
pos
VVNt
This tag is used for the interrogative "whom".
(Sampson 1995, p. 115)
pos
PNQOq
pos
NP1j stands for organization names, e.g. "Unilever", "Kiwanis".
NP1j
pos
IO
IO is the SUSANNE tag used for "of".
(Sampson 1995, p. 109)
DDy
pos
DDy is used for "any" as determiner or pronoun.
FOqx is used for a algebraic equation, when it is analysed as a single word.
pos
FOqx
pos
This is a noun which can appear in a countable singular or in plural, e.g. "sheep", "species", "people".
NNc
VVGt
The present participle form of transitive verbs is tagged with VVGt.
pos
FB
pos
FB is the SUSANNE tag used for a prefix which is seperated by a hyphen.
RRf
RRf is used for "far" as adverb.
pos
IIa
IIa is the SUSANNE tag used for "as".
pos
NNJ2
NNJ2 is used for plural descriptive heads of an organisation name, e.g. "associates".
pos
YF
pos
YF is the SUSANNE tag used for the full stop, American "period", ".".
pos
RAx
RAx is used for a mathematical postfix operator e.g. "!" for factorial.
MCo
pos
MCo is the tag for "0" written as digit.
pos
NNL is used for descriptive head of place-name items in singular or plural, e.g. "Rd", "Is".
NNL
pos
The third person singular form of transitive verbs is tagged with VVZt.
VVZt
pos
FOp
FOp is used for a London postal district, the British post-code, teh American 'Zip-code' e.g. "W.C.2", "LA6 3AN".
NP1x
NP1x is used for miscellaneous singular proper name, e.g. "Parthenon", "Persil".
pos
NNT2 stands for the plural of NNT1.. nouns.
NNT2
pos
pos
YS
YS is the SUSANNE tag used for a semicolon.
pos
APPGh2 for "their".
APPGh2
RAc stands for a cocordinationg element e.g. "etc", "f" or "ff" (means 'following').
RAc
pos
YTR
pos
YTR is used for an end bold.
pos
DDQGq is the tag for "whose" in interrogative uses.
DDQGq
YB
pos
YB is the SUSANNE tag used for text devisions of a paragraph or a higher rank which do not have a heading: <majbrk> <minbrk>.
PPHS1
PPHS1 ia used for subjective third person pronouns in singular.
pos
IIx
IIx is used for mathematical infix operators e.g. "+", ">", "=", etc. .
pos
NNJ1n is used for nouns which can be uncountable ("mass"), a countable singular and a descriptive head of an organisation name, e.g. "company", "organization".
pos
NNJ1n
NP1c
NP1c is the used special tag for country names.
pos
PPHO1f is the SUSANNE tag used for "her" as pronoun.
PPHO1f
pos
PPIO1
pos
PPIO1 is used for "me".
IW
pos
IW is the SUSANNE tag used for "with" in all uses, "without" as preposition, "what with".
(Sampson 1995, p. 109)
YO
YO is the SUSANNE tag used for bullet, pilcrom, or other special symbol visually marking a text division.
pos
pos
CSf
CSf is used for "for" as conjunction.
DDi is used for "some" as determiner or pronoun.
DDi
pos
pos
NNn
NNn is used for nouns which can be uncountable ("mass"), countable singular or plural, e.g "fish" (a fish, three fish, a piece of fish), "Chinese (a Chinese, three Chinese, Chinese is tonal)" .
RGi
pos
The SUSANNE tag RGi is used for "about", "around", "circa", "getting on for" etc..
pos
PPHO1 ia used for objective third person pronouns in singular.
PPHO1
RR is used for general adverbs.
RR
pos
added by Johannes Bubenzer (2006-09-09) in accordance to a
TnT-tagged data set provided by SFB 538 (Hamburg), project K4
pos
NNS1
NNS1 is used for title nouns in singular.
NP2t is used for plural town names.
NP2t
pos
YN
pos
YN is the SUSANNE tag used for colon.