The ICE-GB Corpus
The British Component of the International Corpus of English

The ICE-GB Corpus

ICE-GB is the British Component of the International Corpus of English. Published by the Survey of English Usage, it contains over one million words of fully-parsed written and spoken English c.1990-9

The DCPSE Corpus
The Diachronic Corpus of Present Day Spoken English

The DCPSE Corpus

DCPSE is the Diachronic Corpus of Present-day Spoken English. Published by the Survey of English Usage, it contains over 800,000 words of fully-parsed spoken English from 1957-1993.

ICECUP 3.1
ICECUP 3.1

ICECUP 3.1

ICECUP 3.1 is a state-of-the-art corpus exploration program designed for parsed corpora such as ICE-GB and DCPSE. It is distributed with our corpora for free with extensive online help.

The TOSCA/ICE Grammar
The TOSCA/ICE Grammar

The TOSCA/ICE Grammar

The TOSCA/ICE Grammar is a detailed grammatical framework based on Quirk et al.’s (1985) Comprehensive Grammar of English which is used throughout our parsed corpora.

Fuzzy Tree Fragments
FTFs logo

Fuzzy Tree Fragments

Fuzzy Tree Fragments (FTFs) are structured grammatical queries designed for searching parsed corpora. FTFs were initially developed in the Corpus Query project and are implemented in ICECUP.

Statistics Resources
Statistics Resources, looking down the neck of a bottle

Statistics Resources

A range of statistics resources developed by Sean Wallis in the course of his research. See also his corp.ling.stats blog.