A Quick Guide to the TOSCA/ICE Grammar used in our parsed corpora.

The TOSCA/ICE Grammar is the annotation scheme used in the parsed ICE-GB and DCPSE corpora.
It is a phrase structure grammar based on Quirk et al. (1985). Each node in the tree consists of three elements. By default, ICECUP displays these in the following locations.

Both corpus tree nodes and Fuzzy Tree Fragment nodes are arranged in the same way.
The figure above is of a single node FTF drawn left-to-right. Here is an example tree drawn in the same way.

What do the different labels mean? What is a ‘PU’ or an ‘OD’? What does ‘prd’ stand for? This page is a quick reference.
Much more information is available in the online Help file supplied with the corpus. Below is only a very brief outline.
Our ICECUP software is designed from the ground up around the problem that all users have to ‘learn the grammar in order to explore the grammar’. So: don’t be put off by the apparent complexity of the grammar. The software is very forgiving and you will learn by trying things out for yourself!
Outline
Individual terms form phrase structure trees. The scheme can be thought of as operating at two levels.
1. Part of speech tagging
Consider the sentence in ICE-GB/DCPSE above, I think that's fascinating (S1A-002/DI-A02 #28). This is tagged as follows:-
I | think | that | 's | fascinating |
PRON(pers, sing) | V(montr, pres) | PRON(dem, sing) | V(cop, pres, encl) | ADJ(ingp) |
Part of speech tags, or ‘wordclass tags’, classify words into types. Here there are three types: pronoun, verb, and adjective. They may also include features. Here we have the following: personal singular pronoun, monotransitive present tense verb, demonstrative singular pronoun, copular present tense enclitic verb and -ing participle adjective. Note that ’s (= is) is treated as a separate word for the purposes of analysis.
Part of speech tagging doesn’t record the structure of the sentence.
We rectify this by fully parsing the sentence. The part of speech tags sit at the ‘leaves’ of the phrase structure tree.

The illustration shows the phrase structure of this particular sentence. Leaf nodes annotating words contain the wordclass information. So, from bottom to top (right to left in the figure) —
- I (PRON(pers, sing)) is the head (NPHD) of the first noun phrase (NP). This is the subject of the main clause which is also the parsing unit (PU).
- think (V(montr, pres)) is the main verb (MVB) in the verb phrase (VP). This has the function label ‘verbal’ (VB), and is also found in the main clause. The features ‘monotransitive’ (montr) and ‘present’ (pres) are copied to (‘percolate up to’) the verb phrase and host clause.
- that (PRON(dem, sing) is the head of a second noun phrase (NP). This forms the subject of a dependent clause (CL(depend)) that is the direct object (OD) of the main clause.
- ’s (= is, V(cop, pres, encl)) is the main verb (MVB) in the second, copular, verb phrase (VP). The features ‘copular’ and ‘present’ percolate up to the verb phrase (VP) and host clause, i.e. the dependent clause.
- fascinating (ADJ(ingp)) is the head of an adjective phrase (AJP), which is a ‘predicative’ (prd) subject complement, i.e. it adds a predicate to the subject that.
- The short pause ‘<,>’ is also tagged and attached to the tree.
Note that some of the features are hidden in this view for reasons of space. The small triangle in the feature sector indicates that some are not shown.
Functions, Categories and Wordclass labels
F | A | Adverbial |
WC | ADJ | Adjective |
WC | ADV | Adverb |
F | AJHD | Adjective Phrase Head |
F | AJP | Adjective Phrase |
F | AJPO | Adjective Phrase Postmodifier |
F | AJPR | Adjective Phrase Premodifier |
WC | ART | Article |
WC | AUX | Auxiliary Verb |
F | AVB | Auxiliary Verb (function) |
F | AVHD | Adverb Phrase Head |
C | AVP | Adverb Phrase |
F | AVPO | Adverb Phrase Postmodifier |
F | AVPR | Adverb Phrase Premodifier |
F | CF | Focus Complement (in a cleft construction) |
F | CJ | Conjoin |
C | CL | Clause |
WC | CLEFTIT | Cleft it |
F | CLOP | Cleft Operator |
F | CO | Object Complement |
WC | CONNEC | Connective |
F | COOR | Coordinator |
F | CS | Subject Complement |
F | CT | Transitive Complement |
F | DEFUNC | Detached Function |
F | DISMK | Discourse Marker |
C | DISP | Disparate (categories) |
F | DT | Determiner |
F | DTCE | Central Determiner |
C | DTP | Determiner Phrase |
F | DTPE | Predeterminer |
F | DTPO | Determiner Postmodifier |
F | DTPR | Determiner Premodifier |
F | DTPS | Postdeterminer |
F | ELE | Element (of a non-clause) |
C | EMPTY | Empty text unit |
F | EXOP | Existential operator |
WC | EXTHERE | Existential there |
F | FNPPO | Floating noun phrase postmodifier |
F | FOC | Focus (of a cleft construction) |
C | FRM | Formulaic Expression |
F | GENF | Genitive Function |
WC | GENM | Genitive Marker |
F | IMPOP | Imperative Operator |
F | INDET | Indeterminate |
WC | INTERJEC | Interjection |
F | INTOP | Interrogative Operator |
F | INVOP | Inverted Operator |
F | MVB | Main Verb |
WC | N | Noun |
WC | NADJ | Nominal Adjective |
C | NONCL | Non-clause |
F | NOOD | Notional Direct Object |
F | NOSU | Notional Subject |
C | NP | Noun Phrase |
F | NPHD | Noun Phrase Head |
F | NPPO | Noun Phrase Postmodifier |
F | NPPR | Noun Phrase Premodifier |
WC | NUM | Numeral |
F | OD | Direct Object |
F | OI | Indirect Object |
F | OP | Operator |
F | P | Prepositional (function) |
F | PARA | Parataxis |
F,WC | PAUSE | Pause |
F | PC | Prepositional Complement |
F | PMOD | Prepositional Modifier |
C | PP | Prepositional Phrase |
C | PREDEL | Predicate Element |
F | PREDGP | Predicate Group |
WC | PREP | Preposition |
F | PROD | Provisional Direct Object |
WC | PROFM | Proform |
WC | PRON | Pronoun |
F | PRSU | Provisional Subject |
WC | PRTCL | Particle |
F | PS | Stranded Preposition |
F | PU | Parsing Unit |
F,WC | PUNC | Punctuation |
WC | REACT | Reaction Signal |
F | SBHD | Subordinator Phrase Head |
F | SBMO | Subordinator Phrase Modifier |
F | SU | Subject |
F | SUB | Subordinator |
C | SUBP | Subordinator Phrase |
F | TAGQ | Tag Question |
F | TO | Particle to |
WC | UNTAG | Unassignable Wordclass Tag |
WC | V | Verb |
F | VB | Verbal (function) |
C | VP | Verb Phrase |
WC | ? | Untranscribable |
Key: ‘F’ = Function, ‘C’ = Category, and ‘WC’ = Wordclass Category.
Features
add | additive (adverb) |
antit | anticipatory it |
appos | appositive |
ass | assertive (pronoun) |
attrd | deferred attributive (adjective) |
attribute | attribute |
attru | unmarked attributive (adjective) |
card | cardinal (numeral) |
cbrack | closing bracket |
cleft | cleft construction |
col | colon |
com | common (noun) |
comma | comma |
comment | comment |
comp | comparative (adjective, adverb) |
conjoin | proform conjoin |
coord | coordinating (conjunction) |
coordn | coordination |
cop | copular |
cquo | closing quotation mark |
cxtr | complex transitive |
dash | dash |
def | definite (article) |
dem | demonstrative (pronoun) |
depend | dependent (clause) |
dimontr | dimonotransitive |
ditr | ditransitive |
do | auxiliary do |
edp | -ed participle |
ellip | ellipsis mark (punctuation) |
ellipt | elliptical |
encl | enclitic |
excl | exclusive (adverb) |
exclam | exclamative |
exist | existential |
exm | exclamation mark |
extod | extraposed direct object |
extsu | extraposed subject |
for | particle for |
frac | fraction |
ge | general (adjective, adverb) |
genv | genitive |
hyph | hyphenated (numeral) |
imp | imperative |
incomp | incomplete |
indef | indefinite (article) |
indrel | independent relative |
infin | infinitive |
ingp | -ing participle |
inten | intensifying (adverb) |
inter | interrogative |
intr | intransitive |
inv | inverted |
laugh | laughter |
let | let auxiliary |
long | long pause |
main | main (clause) |
modal | modal (auxiliary) |
montr | monotransitive |
mult | multiplier (numeral) |
neg | negative (pronoun) |
nom | nominal relative (pronoun) |
nonass | nonassertive (pronoun) |
obrack | opening bracket |
one | pronoun one |
-op | without operator |
oquo | opening quotation mark |
ord | ordinal (numeral) |
other | other punctuation |
partic | particularizer (adverb) |
pass | passive |
past | past (tense) |
per | period (full stop) |
perf | perfective auxiliary |
pers | personal (pronoun) |
phras | phrasal (adverb, preposition) |
plu | plural |
poss | possessive (pronoun) |
prd | predicative |
preco | preposed object complement |
precs | preposed subject complement |
preod | preposed direct object |
preoi | preposed indirect object |
prepc | preposed prepositional complement |
pres | present (tense) |
presu | preposed subject |
procl | proclitic |
prog | progressive (auxiliary) |
prop | proper (noun) |
pushdn | pushdown |
qm | question mark |
quant | quantifier (pronoun) |
recip | reciprocal (pronoun) |
red | reduced (clause) |
ref | reflexive (pronoun) |
reference | reference |
rel | relative (adverb, clause, pronoun) |
scol | semi-colon |
semi | semi-auxiliary |
semip | semi-auxiliary followed by an -ing participle |
short | short pause |
sing | singular |
so | proform so |
-su | without subject |
sub | subordinate (clause) |
subjun | subjunctive |
subord | subordinating (conjunction) |
sup | superlative (adjective, adverb) |
to | particle to |
trans | transitive |
univ | universal (pronoun) |
-v | without verb |
voc | vocative |
vocal | vocalising |
wh- | wh- (adverb) |
with | particle with |
zrel | zero relative |
zsub | zero subordinate |