2006 created by Christian Chiarcos 2006-2008 maintained by Christian Chiarcos, Johannes Bubenzer and Angelika Adam 2010-04-07 updated, 0ld-comments removed, partially proof-read, Christian Chiarcos Christian Chiarcos, chiarcos@uni-potsdam.de TODO: syntax Annotation Model of the morphosyntactic component of the SUSANNE scheme as applied to the British English SUSANNE corpus (Sampson, 1995), also covering the simplified SUSANNE tag set as used by the TnT Tagger (Brants 2000). <br/> Brants, T. (2000), TnT--a statistical part-of-speech tagger, In Proc. ANLP 2000 <br/> Sampson, G. (1995), English for the computer: The SUSANNE corpus and analytic scheme, Oxford University Press SUSANNE tags beginning with PP... represent substitutive personal pronouns SUSANNE tags beginning with V... represent verbs. Note that the subclassification follows the following EAGLES recommendation: Since it is impractical, however, given the current capabilities of tagging software, to resolve automatically the ambiguity of these six morphological functions, it is a common practice to assign a single value to the base form, or else to assign two values, one for the finite and one for the non-finite functions. Because of this, the tables below show two tagsets: one tagset representing the 6 attribute-values above, and a reduced tagset (`RTags'), which resembles most tagsets so far used for the English language in reducing the six values to two. http://www.ilc.cnr.it/EAGLES96/morphsyn/node150.html#SECTION00054000000000000000 SUSANNE tags begining with J... represent predicative adjectives, attributive adjectives and general adjectives. Tags which begin with VV... are used for non-modal verbs or non-auxiliary verbs. LexicalVerbPresentParticiple is a class for present participle verbforms. (Sampson 1995: 118f) SUSANNE tags beginning with Y represent punctuation, text-structuring symbols, font instructions and pauses. Common nouns where the meaning of the word is a topographical feature and the noun is used as the head word in a place-name. (Sampson 1995, p. 94) XX is the SUSANNE tag for the word "not". (Sampson 1995, p. 119) TODO: check linking SUSANNE tags beginning with ICS apply to prepositions (I) that can also occur as subordinating conjunctions (CS), e.g., "after", "before", "since" (ICSt), "considering" (ICS), "save" (ICSx). This ambiguity is not resolved in SUSANNE. (Sampson 1995, p.108f.) SUSANNE tags beginning with V and containing 0 represent base forms of verbs, i.e., infinitive and identical word forms (indicative, imperative). TODO: check linking This is used for the indefinite article "a", "an". (Sampson 1995, p. 105) SUSANNE tags beginning with R... represent adverbs. SUSANNE tags beginning with VV... are used for non-modal verbs or non-auxiliary verbs. LexicalVerbPastForm is a class for past tense verbforms. (Sampson 1995, p. 118f) SUSANNE tags beginning with CC represent coordinating conjunctions ("and"/CC, "as"/CC31, "but"/CCB). (Sampson 1995, p. 105) FA is the SUSANNE tag used for a suffix which is seperated by a hyphen. (Sampson 1995, p. 107) SUSANNE tags beginning with JB... represent attributively used adjectives, e.g., "inner", "interior", "joint". (Sampson 1995, p. 109) SUSANNE tags beginning with APP represent attributive pronouns. This is the negative substitutive pronoun "none". (Sampson 1995, p. 114) SUSANNE tags beginning with RRQ represent interrogative or relative adverbs, e.g., "where", "how", "when". (Sampson 1995, p. 117) SUSANNE tags beginning with NN represent common nouns and direction nouns. Also, ND has been included here as it is comparable with temporal situating as in NNa, NNp, NNT.... SUSANNE tags beginning with PPI... represent first person pronouns. (Sampson 1995, p. 115) The class Be contains the SUSANNE tags for the forms of "be". (Sampson 1995, p. 118) This is the second person personal pronoun "you". (Sampson 1995, p. 115) These are general adjectives. (Sampson 1995, p. 109) SUSANNE tags beginning with V and containing N (VDN, VHN and VVN) represent past participles. The tag VHD is used for the past tense form "had". (Sampson 1995, p. 119) The tag VBM is used for "am". (Sampson 1995, p. 118) FD is used for a distorted word, for spoken language only. (Sampson 1995, p. 107) MFn is the tag used for fractons written digitally e.g. "2/3". (Sampson 1995, p. 111) The SubstitutiveIndefinitePronoun class subsumes the SUSANNE tags PN1 and PNX1. The SUSANNE VH0 is used for "have". (Sampson 1995, p. 119) SUSANNE tags for forms of "to have". (Sampson 1995, p. 118) SUSANNE tags beginning with RT... are temporal adverbs. (Sampson 1995, p.118) Present participle of "to do". Unit nouns, i.e., units of measurement, wether written in full ("inch", "kilogramm") or abbreviated either alphabetically or symbolically, as head of a noun phrase denoting a measured quantity. (Sampson 1995, p. 97) SUSANNE tag AT pertains to articles and other determiners, i.e., "every", the indefinite article, the negative determiner and the definite article. (Sampson 1995, p.105) Tags which begin with VV... are used for non-modal verbs or non-auxiliary verbs. LexicalVerbThirdPerson is used for the third person form of full verbs. (Sampson 1995, p. 119) SUSANNE tags beginning with RP... represent prepositions used as adverb. (Sampson 1995, p.116f) The NonPOS class subsumes separated morphological elements, formulae, equations, foreign words etc. Also, GG (~ FA) and ZZ (single Letter) are added here FB is the SUSANNE tag used for a prefix which is seperated by a hyphen. (Sampson 1995, p. 107) MC is the SUSANNE tag used for cardinal numbers from "zero" upwards, spelled out. This includes "umpteen". (Sampson 1995, p. 111) These are substitutive possessive pronouns. The SUSANNE tags begin with PPG... . (G is "genitive", as English possessives are derived from Genitive forms of personal pronouns) (Sampson 1995, p. 115) Postmodifying adverbs and adverb-like phrases. (Sampson 1995, p.115f.) The SUSANNE tag VD0 is used for "do" as infinitive or as indicative. (Sampson 1995, p. 119) SUSANNE tag for "in order" introducing infinitive (cf. TO); according to http://www.ilc.cnr.it/EAGLES96/morphsyn/node623.html#SECTION000124300000000000000 this is a subordinating conjunction The PresentParticiple class contains the SUSANNE tags VBG, VDD, VHG and VVG which stand for present participle forms. VDZ is used for "does". (Sampson 1995, p. 118) The class of ModalVerb contains the SUSANNE tags VMd, VMo and VMK. MD is the SUSANNE tag used for ordinal numbers, whether used as ordinal adjective or adverb or as fraction e.g. "third", "fourth". (Sampson 1995, p. 111) The Numeral class subsumes cardinal numbers, ordinal numbers and fraction numbers. The SUSANNE tags begin with M.. . This used for the past participle of "be". This is used for "no" as determiner or qualifier. (Sampson 1995, p. 105) SUSANNE tags beginning with APPG... are used for attributive possessive pronouns ("her", "his", "its", "my", "our", "your", "their") (Sampson 1995, p.105) The SUSANNE tag VMo is used for modals in present form e.g. "can", "dare", "may", "must", "need", "shall", "will". (Sampson 1995, p. 118) "o" (Sampson 1995, p. 118) is possibly misspelled for "0". SUSANNE TagSet (Sampson 1995) augmented with English examples from the Susanne corpus and from "Morphosyntactic Phenomena Encoded in Lexicons and Corpora A Common Proposal and Applications to European Languages EAG---CLWG---MORPHSYN/R Version of 31st Aug, 1996" (http://www.ilc.cnr.it/EAGLES96/morphsyn/) The hierarchical structure is inferred from the structure of SUSANNE tags: Letters on the left tend to represent superconcepts, letters to the right define their subclassification. There are, however, some deviations to this rule, e.g., APPGf for "her" as a determiner, where the A is a prefix ("attributive") attached to a tag of the PPG family ("possessive pronoun") The FO class subsumes formulas, mathematical expressions, adresses etc.. (Sampson 1995, p. 107f.) PNX1 is the SUSANNE tag used for the singular indefinite reflexive "oneself". (Sampson 1995, p. 115) SUSANNE tag used for "every" (Sampson 1995, p. 105), according to http://www.ilc.cnr.it/EAGLES96/morphsyn/node454.html#SECTION00084100000000000000, an indefinite determiner The SUSANNE tag VB0 is used for "be". (Sampson 1995, p. 119) The SUSANNE tag VBG is used for the present participle of "be", i.e., "being". (Sampson 1995, p. 118) The SUSANNE tag VHN is used for the past participle form "had". (Sampson 1995, p. 119) SUSANNE tags beginning with V and containing D,d or K represent past tense verbs (VBD, VHD, VMd, VMK and VVD). These are lexical verbs. These are apposition-introducing elements e.g. "for example", "for instance", "namely". (Sampson 1995, p.116) The SUSANNE tag VHG is used for "having". (Sampson 1995, p. 119) SUSANNE tags beginning with RR... are general adverbs. (Sampson 1995, p.117) The class DoType contains the SUSANNE tags for the forms of "do". (Sampson 1995, p. 118) The SUSANNE tag VBZ is used for "is". (Sampson 1995, p. 118) These are the reflexive personal pronouns. Their tags begin with PPX.. . (Sampson 1995, p. 115) The SUSANNE tag ND1 is used for direction e.g. "north", "N", "southeast". (Sampson 1995, p. 111) Common nouns of style and title. An S term is a status-indicating item which either accompanies one or more individual names within the full title of a person or is used to addres a person of appropriate status, or both. (Sampson 1995, p. 95, 112-113) The VDN tag describes "done". (Sampson 1995, p. 118) This is a single-word comparative adverb other than "more" and "less". (Sampson 1995, p. 117) The SUSANNE tag VHZ is used for "has". (Sampson 1995, p. 119) This is a superlative adverb other than "most" and "least". (Sampson 1995, p. 117) Past forms of "be". SUSANNE tags beginning with N... represent nouns (proper nouns and common nouns). The Conjunction class tags conjunctions and subsumes the classes BTO, coordinating conjunctions, subordinating conjunctions and pre-coordinators. The SUSANNE tags begin with C... . for logical reasons, pre-coordinator (LE) is added here. Demonstratives These are predicatively used adjectives ("akin", "alike", "asleep"). (Sampson 1995, p. 109) The class QuantifyingPronoun subsumes quantifying determiners or pronouns ("all", "both", "half"). (Sampson 1995, p. 106) SUSANNE tags begin with DA... represent quantifiers ("few", "little", "less", "many", "several"). (Sampson 1995, p. 106) SUSANNE tags beginning with RL... represent adverbs of place or direction. (Sampson 1995, p.116) The SubstitutiveRelativePronoun class contains the SUSANNE tags PNQOr and PNQSr. These are interjections tagged UH in SUSANNE. In the CHRISTINE corpus replaced with a range of different U-tags. ZZ is used for letters and letter names of the (roman or other) alphabet. (Sampson 1995, p. 120) Following the EAGLES recommendations, the Unique class subsumes different 'particles'. These are isolate forms, uninflectable, which have not been included anwhere else (see http://www.ilc.cnr.it/EAGLES96/morphsyn/node700.html#SECTION000153100000000000000 for a list). Includes "to" (+ Infin.) (UI), negative "not", "n't" (UN), and existential "there" (UX) SUSANNE tags beginning with CS... represent subcoordinating conjunctions ("although", "how", "if"). (Sampson 1995, p.106) The tags BTO and TO are also included here, according to van Valin and Lapolla (1997), English "to" (+infinitive) is a subordinating conjunction. VDD is used for "did". (Sampson 1995, p. 118) SUSANNE tags beginning with I... represent prepositions, but also PrepositionOrSubordinatingConjunction. The SUSANNE tag LE applies to pre-coordinators, i.e., the first element of paired coordination markers (cf. http://www.ilc.cnr.it/EAGLES96/morphsyn/node622.html#SECTION000124200000000000000, §3.71). In the Susanne Corpus "pre-co-ordinator" means the first part of paired co-ordinating markers, e.g. in "both... and", "neither... nor". (Sampson 1995, p.110) The SubstitutiveInterrogativePronoun class contains the SUSANNE tags PNQOq and PNQSq. SUSANNE tags beginning with P... represent substitutive pronouns (EAGLES "pronouns" as opposed to "determiner"). These are interrogative or relative pronouns in attributive use. (Sampson 1995, p. 107) SUSANNE tags beginningwith VVN represent past participles of lexical verbs. SUSANNE tags beginning with NP... apply to several subtypes of proper nouns. (Sampson 1995, p. 113f.) In SUSANNE terminology, a "qualifier" is an adverb-modifying adjective or adverb. Tags beginning with RG... apply to qualifiers which having no other adverbial use. (Sampson 1995, p.116) SUSANNE tag beginning with FW... apply to foreign words which are not capable of being allocated a more specific tag by reference to their English context. (Sampson 1995, p. 108) These are nouns for time. (Sampson 1995, p. 113) The SUSANNE tag VMK is used for "ought" and "used" as modal catenatives. (Sampson 1995, p. 118) SUSANNE tags which begin with VV... are used for non-modal/non-auxiliar verbs. (Sampson 1995, p. 118f) added by Johannes Bubenzer (09.09.2006) for a TnT tag used in a data sample from Collaborative Research Center (SFB) 538 "Multilingualism" (Hamburg), project K4 The SUSANNE tag VBR is used for "are". (Sampson 1995, p. 118) SUSANNE tags beginning with D... form a very heterogeneous class of expressions include quantifiers, DB and determiners. According to the examples, the prototypical forms which are collected here can be used as determiners. Some have ambiguity with pronouns. The Present class contains the SUSANNE tags VBM, VBZ, VDZ, VHZ and VVZ. SUSANNE V...-tags which contain "M" represent first person singular of "be", those which contain "Z" represent third person singular verbs. Other present verbs are tagged as BaseForm. This is the definite article "the". (Sampson 1995, p. 105) These are third person personal pronouns. (Sampson 1995, p. 115) Organization nouns are common nouns that denote is an organization and that are used as the head word in the name of organizations of that kind. (Sampson 1995, p. 94) SUSANNE tags which begin with PNQ... represent SubstitutiveWHPronouns. This class contains PNQV ("whosever", "whomever", "whoever"), SubstitutiveInterrogativePronoun and SubstitutiveRelativePronoun. EX is the SUSANNE tag used for existential "there". (Sampson 1995, p. 107) The SUSANNE tag VMd is used for modals in past form e.g. "could", "might", "should", "would". (Sampson 1995, p. 118) "d" in the tag name is possibly misspelled for "D". SUSANNE tag beginning with DD... represent a very heterogeneous group including demonstratives, quantifiers and wh-determiner. (Sampson 1995, p. 106) GG is the SUSANNE tag used for the Germanic genitive inflection +´s, or + ´ after a plural stem and certain other stems ending in -s. (Sampson 1995, p. 108) Germanic genitive 's is modelled as NonPOS here because of its diachronic origin as a morphological element, cf. FA. (Chiarcos) Alternatively, 's could be treated as postposition (http://www.ilc.cnr.it/EAGLES96/morphsyn/node56.html#SECTION00044000000000000000, http://www.ilc.cnr.it/EAGLES96/morphsyn/node586.html#SECTION000114100000000000000) DA2R is the SUSANNE tag used for "fewer". DA2R pos pos MDo MDo is the tag for "first", "second". VVDi is used for the past tense form of intransitive verbs. VVDi pos
DA2 is the SUSANNE tag used for "few" and "many". DA2 pos pos MDt MDt is used for "next", "last". AT1e AT1e is used for "every". pos pos NNU1n NNU1n is used for unit nouns which can be used in countable singular or as uncountable mass noun e.g. "metre". NP1f pos NP1f tags a feminine forename. ICSt pos ISCt describes "after", "before", "ere", "since", "until", "til" as prepositions (with a complement) or subordinating conjunctions. NPM1 NPM1 is the SUSANNE tag used for a month name, e.g. "October" pos pos PPGm is used for "his". PPGm DD1a DD1a is used for "that" as determiner, demonstrative pronoun, or qualifier (e.g. that slowly). pos NNU1c is the tag for unit countable nouns in singular e.g. "inch", "kilogram". NNU1c pos pos FA is the SUSANNE tag used for a suffix which is seperated by a hyphen. FA CCB is used for "but" as coordinating conjunction. CCB pos pos These are the plural forms of the reflexive personal pronouns. PPX2 pos AT The SUSANNE tag AT is used for "the". DA1 pos DA1 is used for "much", "little". PPGy is used for "yours"in singular or plural. NPM pos APPGi1 pos APPGi1 is the tag for "my" as possessive. BTO BTO stands for " in order" and occurs within a infinitival clause as a left sister of an infinitival verb/ verb group. (Sampson 1995, p.105, 268) pos NP2f pos NP2f is used for plural feminine forenames. RGQV RGQV is the tag for "however", "no matter how" as qualifier. pos pos PN1 is used for e.g. "anybody", "anyone", "anything", "everybody", "everything", "naught", "nobody", "no one", "nothing". (Sampson 1995, p. 114) PN1 pos NP2j is the SUSANNE tag used for plural organization names, e.g. "ICP's", "Unilevers". NP2j pos NNO MF pos MF is used for fraction numerals. VVG pos VVG is used for present participle verbforms. DD2 DD2 is used for "a few", "a good few", "a good many", "a good great". pos PPH pos These are third person pronouns. <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSA is used for an item of style or title fllowing name, with or without a countable singular use, e.g. "Jr", "Sr".</rdfs:comment> <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSA</olia_system:hasTag> These are adverbs. pos RA RRQV is used for wh-...-ever adverbs, e.g. "however", "no matter how", "whenever", "wherever", "wheresoever". RRQV pos MCs MCs is the desciption for integers written digitally with a leading zero e.g. "007". pos NNT1h pos NNT1h is the tag for holiday or season, e.g. "Christmas", "Boxing Day". NPD1 is the SUSANNE tag used for a day of the week e.g. "Monday". pos NPD1 NP1m NP1f tags a masculine forename. pos NP1m MCr pos MCr is used for roman numerals from " I " upwards. pos DA2q is used for "several". DA2q EX pos EX is the SUSANNE tag used for the existential "there". pos PPIO2 is the SUSANNE tag used for "us". PPIO2 DAr is used for "former"/ " latter" in all uses. DAr pos PPHO1m for "him". pos PPHO1m VVNi used for the past participle of intransitive verbs pos VVNi DDQ is the SUSANNE tag used for "what". pos DDQ pos RAq RAq is the SUSANNE tag used for "apiece", distributive uses for "each". NNmm stands for the plural of NNm abbreviation, e.g. "Figs.", "Nos.". NNmm pos RRR RRR is used for a single-word comparative adverb other than "more", e.g. " better", "closer", "deeper", "earlier" and so on. pos "Whose" without a specification of its use. pos DDQG DBh is used for "half" as determiner or pronoun. pos DBh <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSS</olia_system:hasTag> <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <hasNumber rdf:resource="#plural"/> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSS is used for style or title-only adjectival items in plural, e.g. "Messrs", "Mmes".</rdfs:comment> VBD VBD is used for the past forms of be. pos MC2n is used for the plural of cardinals written digitally. MC2n pos PPGf PPGf is used for "hers". pos pos DAg DAg is used for "own" as part of a genitive construction. pos JBy JBy is used for "only" as an adjective like in the example "the only thing". YC is the SUSANNE tag used for comma. pos YC FD is the SUSANNE tag used for a distorted word. It is only used in speech analysis. pos FD NNLb are descriptive heads of place-name nouns with tendency to precede the specific name "Costa". pos NNLb NNJ pos NNJ is used for a noun which appears only as a desciptive head of an organisation name, e.g. "Corp", "Bros". Singular or plural is possible. pos LE LE is used for "both", "not only" as pre-co-ordinator. YBL describes the beginning of heading: <bmajhd> <bminhd>. YBL pos pos PPX1h PPX1h tags "itself". VH0 The SUSANNE tag VH0 is used for "have". pos pos NPM is the tag for an abbreviated month name, e.g. "Oct". NPM CSN is used for "than" in all uses. CSN pos pos RRg is used for "long" as adverb. RRg PPX1f PPX1f is the SUSANNE tag used for "herself". pos DDo DDo is used for "a lot". pos pos This is the time of the day written digitally, e.g. "10:30", "10.30". NNa PPGi1 pos PPGi1 is used for "mine". PNQV is used for substitutive wh pronouns without a specification of case. PNQV pos pos YPR is the SUSANNE tag used for the closing bracket ")". YPR DB2 DB2 is used for "both" as determiner or pronoun. pos This is a noun which appears as a countable singular and is a desciptive head of an organisation name NNJ1c pos YBR pos YBR tags the end of heading <emajhd> <eminhd>. IIb IIb is used for "by". pos PNQOr pos This tag is used for the relative "whom". (Sampson 1995, p. 115) RRe pos RRe is the tag for "enough" as clause adverb. The SUSANNE tag VBM is used for "am". VBM pos pos CSg CSg is used for "though" as subordinating conjunction. pos ICSk ICSk is the tag for "like" as preposition, subordinating conjunction, or in a relative use. UH is the SUSANNE tag used for an interjection e.g. "blimey", "hello", "please", "well", "yes". UH pos pos RLh is used for "here", "there" as adverbs of place RLh MC1n is the tag for 1 written digitally, including uses as ordinal e.g. "February 1". MC1n pos FOx pos FOx is the tag for an algebraic expression with nominal as opposed to equative function e.g. "a", "pi". DDQGr for "whose" in relative uses. pos DDQGr RLw is used for "somewhere", "someplace", "anywhere", "anyplace", "everywhere", "nowhere" are adverbs of place or direction. RLw pos YPL YPL is the SUSANNE tag used for the opening bracket "(". pos AT1 is used for the indefinite article "a", "an". pos AT1 RTo RTo is used for "now". pos NP2z is used for the pluralized form of a 'code name' functioning as countable noun, e.g. "1-11´s". NP2z pos pos NNOc is used for e.g. "dozen", "score", "gross" etc. . NNOc VBR VBR is used for "are". pos added by Johannes Bubenzer (2006-09-09) according to a TnT-tagged dataset provided by SFB 538 (Hamburg), project K4. NNL1 is used for topographical nouns in singular. pos NNL1 The SUSANNE tag VMd is used for modals in past form e.g. "could", "might", "should", "would". pos VMd pos NNu is the tag for plural mass nouns e.g. "data", "measles". NNu NP1g NP1g is used for miscellaneous singular geographical proper names, e.g. "Adriatic", "America", "Sherwood". pos The SUSANNE tag VHD is used for the past tense form "had". VHD pos pos APPGi2 APPGi2 is used for "our". DD1t DD1t is used for "a little". pos RAe pos RAe is the tag for "else" in all uses. YX YX is the SUSANNE tag used for the exclamation mark "!". pos NP1z is used for a 'code name', an acronym or arbitrary alphanumeric expression functioning as countable noun. NP1z pos PPX2h PPX2h is the SUSANNE tag used for "themselves", "each other" or "one another". pos PPI pos PPI is used for substitutive first person pronouns. added by Johannes Bubenzer (2006-09-09) in accordance to a TnT-tagged data set provided by SFB 538 (Hamburg), project K4 NNT1 is sued for temporal nouns in singular. NNT1 pos DD1 is used for determiner in singular. added by Johannes Bubenzer (2006-09-09) in accordance to a TnT-tagged data set provided by SFB 538 (Hamburg), project K4 pos DD1 pos The SUSANNE tag VHG is used for "having". VHG pos These are wh-adverbs. RRQ DD1q pos DD1q is used for "another", "each", "one and the same" as determiner or pronoun. YR is the SUSANNE tag used for an interruption point. It is only used in speech analysis. YR pos pos YG YG is the SUSANNE tag used for a logical position of a transformationally moved/deleted item. VVDt pos The past tense form of transitive verbs is tagged with VVDt. pos CSW is used for "whether" in all uses. CSW PPHS1m id used for "he". PPHS1m pos pos FW FW is the SUSANNE tag used for a foreign word which is not capable of being allocated a more specific tag by reference to its English context. pos The SUSANNE tag VMo is used for modals in present form e.g. "can", "dare", "may", "must", "need", "shall", "will". VMo NNux NNux is used for plural mass nouns in "-ics", e.g. "mechanics". pos The SUSANNE tag VHZ is used for "has". VHZ pos RLe RLe is used for "elsewhere". pos pos VBG VBG is used for "being". DDQr pos DDQr for "which" in relative uses. NP1i is the tag for intitials of personal names. pos NP1i RL is the SUSANNE tag used for an adverb of place or direction e.g. "along", "back", "outside". RL pos NP2m tags plural masculine forenames. pos NP2m MCn is used for cardinal numerals written in digits, e.g "February 28". MCn pos RAj is used for postnomianl adjectives e.g. "destinate", "centigrade", "elect", "galore". RAj pos pos PPH1 is used for "it". PPH1 pos DAz DAz is used for "such". RTt RTt is used for "today", "tomorrow", "tonight", "yesterday". pos NNp is the tag for the time of a day written digitally in 24-hour notation from 13:00 on, e.g. "13:30". pos NNp These are plural proper names. NP2 pos PNQVS pos This tag is used for "no matter who", "whoever", "whosever". RRx stands for "only" as adverb. pos RRx NP2s pos NP2s describes plural surnames, as in the "Robinsons". pos The VDN tag describes "done". VDN pos The base form of transitive verbs is tagged with VV0t. VV0t pos JA is used for predicatively used adjectives e.g. "ablaze", "alone". JA PNX1 is the SUSANNE tag used for the singular indefinite reflexive "oneself". pos PNX1 pos II is the SUSANNE tag used for general prepositions, including prepositional use of words which can function either as preposition or as adverb. e.g. "into", "near", "off", "on", "onto", "opposite", "outside", "over", "past", "through". (Sampson 1995, p. 109) II VVZ is used for the third person singular verbforms. pos VVZ MFn MFn is the SUSANNE tag used for fractions written digitally e.g. "2/3". pos VVGi is used for present participle intransitive verbforms. pos VVGi ICS ICS is used for "considering", "notwithstanding". pos pos VVNv for the past participle of verbs having transitive and intransitive uses. VVNv RT is the SUSANNE tag used for "again", "hereafter", "overnight". RT pos LEe is used for "either" as pre-co-ordinator. LEe pos MDn MDn are ordinal numbers written digitally e.g. "1 st", "2 nd". pos pos NNUp NNUp is used for "%", "percent", "per cent". pos This tag is used for "no matter whose", "whosever". PNQVG CSn CSn is used for "when" as subordinating conjunction. pos YQ is the SUSANNE tag used for the question mark "?". pos YQ NNT1c is for a singular time noun that can head a noun phrase functioning adverbially, e.g. "hour", "day", "evening". pos NNT1c pos NP These are proper nouns. RP pos RP is used for "across", "down", "in", "off", "on", "out", "over", "through", "up" in adverberbial uses. pos RGQq RGQq for "how" as qualifier. ND is used for direction nouns. ND pos DD1i pos DD1i is used for "this" in al uses including as qualifier. pos DAT DAT is used for "least" and "most" in all uses. YND for the colon-dash ":". YND pos FWs pos FWs is the tag for a biological Latin species (or lower rank) name e.g. "sapiens officinale". PNQS is used for "who". This tag is taken from the reduced TnT tagset. pos PNQS RGa Rga is used for "as" as qualifier. RGb is the tag for "quite" as qualifier or before an article e.g. in "quite a good idea". RGf stands for "too" as qualifier. pos pos NPM2 NPM2 is the SUSANNE tag used for plural of month names, e.g. "Octobers". pos DDQV The SUSANNE tag DDQV describes e.g. "whatever", "whichever", "whichsever", "no matter which". JB pos JB is used for attributively used adjectives e.g. "chief", "entire", "future", "inverse", "major". pos NPD2 is the SUSANNE tag used for a day of the week in plural, e.g. "Tuesdays". NPD2 TO is the SUSANNE tag used for the infinitival "to". It could be analyzed as a sub-type of subordination. (Sampson 1995, p. 118) TO pos YD is the SUSANNE tag used for dash (em-dash or larger). YD pos pos VDD is used for "did". VDD RAz pos RAz tags "or so" as in "fifty or so". pos ATn ATn stands for "no" as determiner or qualifier. pos ZZ1 is the SUSANNE tag used for a singular letter of the (roman or other) alphabet. ZZ1 RPK pos RPK is used for "about" in adverbial and catenative use. pos NNUc NNUc is used for unit nouns which can stand in countable singular or in plural, e.g. "hertz", "yen". VBN This is the SUSANNE tag used for "been". pos JJR JJRis used for comparative adjectives, including "elder" and "further". pos NNL1n is used for descriptive heads of place-name in countable singular or uncountable mass nouns, e.g. "drive", "water", "green". NNL1n pos VDZ is used for "does". VDZ pos PPG These are substitutive possessive pronouns. pos pos LEn is used for "neither" as pre-co-ordinator. LEn <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS</olia_system:hasTag> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS is used for nouns of style or title in singular, e.g. "Mr", "Mrs", "Dr".</rdfs:comment> <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <hasNumber rdf:resource="#singular"/> YH is the SUSANNE tag used for a hyphen, "-" dash. YH pos pos NNOn NNOn is used for abbreviated NNOc words and is ambiguous between singular and plural, e.g. "m" for million(s). pos RGAf for "enough" as qualifier. RGAf DB pos DB is used for "all", "both" or "half" as pronoun or determiner. JJh JJh is used for pseudo-adjectives formed by suffixing "-ed" to the last word of a nominal compound. pos RAa is the SUSANNE tag used for "ago", "since" as synonym for age. RAa pos pos RGz for "so" as qualifier. RGz VBDR is used for "were". pos VBDR NN1 This is a common noun in singular. pos APPGh1 is the tag for "its". APPGh1 pos pos FOc tags a formula or an acronym for chemical substances, molecules or subatomic particles e.g. "TNT", "DDT". FOc RAn pos RAn is used for "whatever", "whatsever" after a negative or non-assertive nominal head. YE pos YE is the SUSANNE tag used for the ellipsis mark "...". RLn is used for "downstairs", "upstairs" . RLn pos RRz is used for "so" introducing a main clause or a clause of purpose or result, or as an adverb of manner or degree. pos RRz PPX2i describes "ourselves"/ "ourself". PPX2i pos VMK pos The SUSANNE tag VMK is used for "ought used" as modal catenative. MD is the SUSANNE tag used for ordinal numbers, whether used as ordinal adjective or adverb or as fraction e.g. "third", "fourth". pos MD MCe is for numbers containing a non-decimal separator, other than a time-of-day expression, e.g. "4:11", "4.11"(stands for four years and 11 months), "3.4.93". pos MCe RAi pos RAi is the tag for "inst", "ult" etc. . NNUn pos NNUn is used unit nouns which can be used as countable singular, plural or as uncountable mass noun. NNm NNm is the tag for a singular abbreviation normally followed by a numeral functioning as a proper name e.g. "Fig.", "No.". pos pos DA DA is used for quantifiers. pos added by Johannes Bubenzer (2006-09-09) in accordance to a TnT-tagged data set provided by SFB 538 (Hamburg), project K4 VM is used for modal verbforms. VM RRy RRy is used for "any" as qualifier with comparative. pos NNL1cb pos NNL1cb is used for descriptive heads of place-name in countable singular with tendency to precede the specific name, e.g. "camp", "lake", "mount". pos NP2g NP2g is used for miscellaneous plural geographical proper names, e.g. "Alps", "Americas", "Antilles". NNT1m pos NNT1m is used for a point-of-time noun, which cannot head a noun phrase functioning adverbially, e.g. "noon", "midnight", "midyear". VVGv describes the present participle form of verbs which have transitive and intransitive uses. VVGv pos pos FOt is used for a telephone number. FOt pos MC2 MC2 tags the plural of cardinals. APPGy is used for "your". APPGy pos CSi pos CSi is used for "if". DD1n is used for "neither" as determiner or pronoun. DD1n pos pos CCn is used for "nor". CCn RGf pos RGf stands for "too" as qualifier. CS pos These are subordinating conjunctions. <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS2</olia_system:hasTag> <hasNumber rdf:resource="#plural"/> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS2 is used for nouns of style or title in plural, e.g. "doctors".</rdfs:comment> pos PPX1m is used for "himself". PPX1m pos FOr is the tag for a road name e.g. "M6 B6480 I-95". FOr <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSj is used for style or title-only adjectival items, e.g "Rev", "Very Rev".</rdfs:comment> <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNSj</olia_system:hasTag> <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> PPX1y PPX1y for "yourselves". pos pos NP1t is the tag for a town name. NP1t pos RGb RGb is the tag for "quite" as qualifier or before an article e.g. in "quite a good idea". PNQSq This tag is used for the interrogative "who". (Sampson 1995, p. 115) pos YT pos YT is used for bolds. DD1b is used for "a bit". DD1b pos APPGm is used for "his" except as pronoun. APPGm pos pos NNU NNU is used for unit-only nouns in singular, plural, or neutral. pos These are interrogative (RRQq) wh- adverbs e.g. "how", "when", "where". RRQq pos CSA is the SUSANNE tag for "as" as subordinating conjunction or as preposition in comparative sense. CSA pos VVN iS used for the past participle verbforms. VVN pos DDf is used for "enough" as pronoun or pre- or postmodifying a noun. DDf VVDv VVDv describes the past tense form of verbs which have transitive and intransitive uses. pos <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1n</olia_system:hasTag> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1n is used for nouns of style or title in countable singular or uncountable mass noun, e.g. "justice".</rdfs:comment> <hasNumber rdf:resource="#singular"/> DAy DAy is the tag for "same" / "selfsame". pos VVZi is used for the third person singular form of intransitive verbs. pos VVZi pos NPD is the SUSANNE tag for an abbreviated day of the week, e.g. "Tues". NPD This is a countable singular noun, e.g. "clock", "crowd", "Buddhist". NN1c pos NP1p NP1p is the SUSANNE tag used for 'province' name, name of an US state, an English county etc. . pos CCr pos CCr is the tag for "or". pos RAb RAb is used for "A.D./ Anno Domini" and for counterparts used in the pre-date position in other calendars. pos MC2y MC2y is used for the plural of year names e.g. "1960s". This is the tag for "we". PPIS2 pos pos VVD VVD is used for past tense verbforms. pos VDG This is the SUSANNE tag used for "doing". CSr pos CSr is used for "where" as subordinating conjunction. pos JBT is the SUSANNE tag used for e.g. "utmost", "uttermost". JBT PPHS1f PPHS1f describes "she". pos pos YIL YIL is the SUSANNE tag used for left quotes, opening (single or double) inverted commas. MCb is used for 'labels' for cross-reference within and between texts, comprising combinations of digits, letters and/ or non-alphanumeric characters. vMCb pos pos GG GG is the SUSANNE tag used for the Germanic genitive inflection +´s, or + ´ after a plural stem and certain other stems ending in -s. NNU2 pos NNU2 is the tag for unit nouns in plural, e.g. "inches". NN1u pos This is a uncountable/ mass noun, e.g. "snow", "Buddhism". pos VV0i is used for the base form of intransitive verbs. VV0i RRQr RRQr is used for relative wh- adverbs (having overt antecedent - not "fused" relative) e.g. "when", "where", "whereat", "whereby". pos pos DA2T is used for "fewest". DA2T VVGK VVGK is used for "going" as catenative (in BE "going to"). pos These are base form of verbs. VV0 pos pos PN1o PN1o is the SUSANNE tag usedfor the singular indefinite pronoun ("one"). (Sampson 1995, p. 114) ICSx ISCx is the SUSANNE tag used for "but", "except", "save" as prepositions or subordinating conjunctions. pos pos NN1ux This is a uncountable/ mass noun whch is ending on "-ics", e.g. "athletics", "thermodynamics". RAy pos RAy describes "B.C." and counterparts following dates in other calendars. pos FOs FOs is the tag for registrtation/ reference/ serial/ model number. YM is the SUSANNE tag used for a filled pause. Its only used in speech analysis. YM pos PPX pos PPX are reflexive personal pronouns. NNL2 is used for descriptive heads of place-name in plural, e.g. "buildings", "montains". NNL2 pos These are singular proper names. pos NP1 PPGh2 PPGh2 is used for "theirs". pos CSk pos CSk is used for "as if", "as though". DD2a is used for "those". DD2a pos RRT is used for a superlative adverb other than "most", e.g. "best", "brightest", "closest", "earliest", "fastest" and so on. RRT pos PPHS2 is used for "they". pos PPHS2 NN1n pos This is a noun which can be used in a countable singular or as uncountable/ mass noun. JBo is used for "other" in all uses. JBo pos RTn is used for "then". pos RTn PN1z is used for "so" as a pro-form, for instance in the "do so"- construction. (Sampson 1995, p. 114) pos PN1z JJj JJj is the tag for abbreviated adjectives appended to organization names to identify it legal status e.g. "Ltd Inc Pty" and counterparts in other jurisdictions. pos pos This tag is used for "no matter whom", "whomever". PNQVO DAR pos DAR is used for "less" or "more". CC is used for coordinating conjunctions "and", "and/or", "as well as", "plus" pos CC VVZv describes the third person singular form of verbs which have transitive and intransitive uses. VVZv pos YTL pos YTL is used for a begin bold. MC2r decribes the plural of roman numerals. MC2r pos RRs describes "otherwise", "yet". RRs pos FOqc FOqc is the tag fora chemical equation, when it is analysed as a single word. pos NNJ1 is used for organization nouns in singular. added by Johannes Bubenzer (2006-09-09) in accordance to a TnT-tagged data set provided by SFB 538 (Hamburg), project K4 pos NNJ1 ND1 is used for direction e.g. "north", "N", "southeast". pos ND1 NP2p is the SUSANNE tag used for plural province names, e.g. "Carolinas". pos NP2p MCy MCy stands for the year name written digitally (full or with apostrophe) e.g. "1987", " `99". pos NNL1c pos NNL1c is used for descriptive heads of place-name in countable singular, e.g. "road", "island". PPHO2 pos PPHO2 is used for "them". pos The SUSANNE tag VD0 is used for "do" as infinitive or as indicative. VD0 IIp is used for "per". pos IIp NNU1 is used for unit nouns in singular. NNU1 pos PN PN is the SUSANNE tag used for the negative substitutive pronoun "none" in all uses. (Sampson 1995, p. 114) pos pos PPIS1 This is the tag for ' I ' as personal pronoun. DD2i DD2i is used for "these". pos RAh is used "am" (for 'ante meridiem'), "pm" o`clock. RAh pos FWg is used for a biological Latin name of genus or other rank higherthan species e.g. "Equus Umbelliferae". FWg pos pos APPG is used for attributive possessive pronouns. APPG NP2c stands for the plural of country names, e.g. "the Philippines". NP2c pos pos VV0v VV0v is used for the base form of verbs which have transitive and intransitive uses. pos IIg is the tag for "aged" as a pseudo-preposition. IIg APPGf APPGf is used for "her" as possessive. pos JBR JBR is used for e.g. "inner", "lesser", "nether", "outer", "upper". pos PPX2y pos PPX2y tags "yourselves". XX XX-Tag pos XX is the SUSANNE tag used for "not". pos RGQ RGQ is used for question qualifiers. NNLc pos NNLc is used for descriptive heads of place-name in countable singular or plural, e.g. "barracks", "links". pos PPGi2 is used "ours". PPGi2 pos The SUSANNE tag VHN is used for the past participle form "had". VHN IF is the SUSANNE tag used for "for" as preposition. (Sampson 1995, p. 109) pos IF RG is the SUSANNE tag used for a qualifier which having no other adverbial use e.g. "very", "jolly", "mighty", "stark". RG pos RGA is the SUSANNE tag for "indeed" as qualifier. pos RGA pos DD1e is used for "either" as determiner or pronoun. DD1e DBa pos DBa is used for "all" as determiner or pronoun. pos PPX1i is used for "myself". PPX1i RGr RGr is used for "rather" as qualifier or before an article. RGz for "so" as qualifier. pos This tag is used for "you" in singular or plural. PPY pos pos YP YP is the SUSANNE tag used for the silent pause. It is used only in speech analysis. pos DD is used for "yon", "yonder" as determiner, "somesuch", "the rest". DD pos NP2x are miscellaneous plural proper names. NP2x CST is the tag for "that" as subordinating conjunction, including in its use in introducing relative clauses. CST pos The SUSANNE tag VBZ is used for "is". pos VBZ IIt is used for "to" as preposition. IIt pos pos RAp is used for "per annum", "pa", "per diem" etc. . RAp pos YIR is the SUSANNE tag used for closing (single or double) inverted commas. YIR These are the singular forms of the reflexive personal pronouns. pos PPX1 NNUb is used for a unit symbol which precedes numerals, e.g. "$". NNUb pos pos THis is an attributive common noun, e.g. "scissor", "trouser" NNb MCd is the tag for numerals including decimal points. pos MCd VBDZ is used for "was". pos VBDZ JJT JJT is used for superlative adjectives, including "eldest" and "furthest". pos pos NP1s NP1s is used for surnames. PNQSr pos This tag is used for the relative "who". (Sampson 1995, p. 115) This is a common noun. pos NN ZZ is used for letters of the (roman or other) alphabet. ZZ pos MC1 is used for "one" as numeral. pos MC1 JJs is used for adjectives in -most orther than "utmost/ uttermost" e.g. "backmost", "inmost", "lowermost", "midmost". JJs pos MC is the SUSANNE tag used for cardinal numbers from "zero" upwards, spelled out. This includes "umpteen". pos MC RAh DDQq for "which" in interrogative uses. pos pos ZZ2 which is used for a letter of the alphabet with plural inflexion. ZZ2 This is a common noun in plural. NN2 pos JJ pos JJ is used for general adjectives e.g. "blue", "English". <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1c</olia_system:hasTag> <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <hasCountability rdf:resource="#countable"/> <hasNumber rdf:resource="#singular"/> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1c is used for nouns of style or title in countable singular "doctor", "miss", "madam".</rdfs:comment> pos REX REX is the SUSANNE tag used for an apposition-introducing element e.g. "for example", "for instance", "namely". This is a noun which beginns with "mid-" other than time noun, e.g. "midstream", "midfield". pos NN1m pos FO is the SUSANNE tag used for a determinate formula. FO pos VB0 is used for "be". VB0 VVNt used for the past participle of transitive verbs. pos VVNt This tag is used for the interrogative "whom". (Sampson 1995, p. 115) pos PNQOq pos NP1j stands for organization names, e.g. "Unilever", "Kiwanis". NP1j pos IO IO is the SUSANNE tag used for "of". (Sampson 1995, p. 109) DDy pos DDy is used for "any" as determiner or pronoun. FOqx is used for a algebraic equation, when it is analysed as a single word. pos FOqx pos This is a noun which can appear in a countable singular or in plural, e.g. "sheep", "species", "people". NNc VVGt The present participle form of transitive verbs is tagged with VVGt. pos FB pos FB is the SUSANNE tag used for a prefix which is seperated by a hyphen. RRf RRf is used for "far" as adverb. pos IIa IIa is the SUSANNE tag used for "as". pos NNJ2 NNJ2 is used for plural descriptive heads of an organisation name, e.g. "associates". pos YF pos YF is the SUSANNE tag used for the full stop, American "period", ".". pos RAx RAx is used for a mathematical postfix operator e.g. "!" for factorial. MCo pos MCo is the tag for "0" written as digit. pos NNL is used for descriptive head of place-name items in singular or plural, e.g. "Rd", "Is". NNL pos The third person singular form of transitive verbs is tagged with VVZt. VVZt pos FOp FOp is used for a London postal district, the British post-code, teh American 'Zip-code' e.g. "W.C.2", "LA6 3AN". NP1x NP1x is used for miscellaneous singular proper name, e.g. "Parthenon", "Persil". pos NNT2 stands for the plural of NNT1.. nouns. NNT2 pos pos YS YS is the SUSANNE tag used for a semicolon. pos APPGh2 for "their". APPGh2 RAc stands for a cocordinationg element e.g. "etc", "f" or "ff" (means 'following'). RAc pos YTR pos YTR is used for an end bold. pos DDQGq is the tag for "whose" in interrogative uses. DDQGq YB pos YB is the SUSANNE tag used for text devisions of a paragraph or a higher rank which do not have a heading: <majbrk> <minbrk>. PPHS1 PPHS1 ia used for subjective third person pronouns in singular. pos IIx IIx is used for mathematical infix operators e.g. "+", ">", "=", etc. . pos NNJ1n is used for nouns which can be uncountable ("mass"), a countable singular and a descriptive head of an organisation name, e.g. "company", "organization". pos NNJ1n NP1c NP1c is the used special tag for country names. pos PPHO1f is the SUSANNE tag used for "her" as pronoun. PPHO1f pos PPIO1 pos PPIO1 is used for "me". IW pos IW is the SUSANNE tag used for "with" in all uses, "without" as preposition, "what with". (Sampson 1995, p. 109) YO YO is the SUSANNE tag used for bullet, pilcrom, or other special symbol visually marking a text division. pos pos CSf CSf is used for "for" as conjunction. DDi is used for "some" as determiner or pronoun. DDi pos pos NNn NNn is used for nouns which can be uncountable ("mass"), countable singular or plural, e.g "fish" (a fish, three fish, a piece of fish), "Chinese (a Chinese, three Chinese, Chinese is tonal)" . RGi pos The SUSANNE tag RGi is used for "about", "around", "circa", "getting on for" etc.. pos PPHO1 ia used for objective third person pronouns in singular. PPHO1 RR is used for general adverbs. RR pos added by Johannes Bubenzer (2006-09-09) in accordance to a TnT-tagged data set provided by SFB 538 (Hamburg), project K4 <olia_system:hasTier rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >pos</olia_system:hasTier> <hasNumber rdf:resource="#singular"/> <olia_system:hasTag rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1</olia_system:hasTag> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >NNS1 is used for title nouns in singular.</rdfs:comment> NP2t is used for plural town names. NP2t pos YN pos YN is the SUSANNE tag used for colon.