More information

Questions: What’s in the package? | What do I need to run ICECUP? | What can ICECUP do? | Will ICECUP continue to be developed? | Feedback

The ICE-GB R2 Sample Corpus is available for download NOW.

It comes complete with 10 texts selected by Gerry Nelson from the ICE-GB Corpus, and the state-of-the-art ICECUP 3.1.1 software written by Sean Wallis.

The Sample Corpus comes in two flavours:

  • The Text sampler equivalent to the old 'complete' sampler.
  • The Text+Audio sampler.

WHAT IS IN THE SAMPLE CORPUS PACKAGE?

  • Ten texts (over 20,000 words), fully parsed and annotated, exactly as they are in ICE-GB.
  • The latest release of ICECUP 3.1. This is a full working version of the software (see below) complete with help.
  • Example Fuzzy Tree Fragments.
  • Option: Audio for the 5 spoken texts.

The sample contains the following ten texts, shown in the last column. You can view these texts and their classification when you download and install the software. The complete ICE structure is visible from ICECUP’s Corpus Map.

Spoken Texts (300) Dialogues (180) Private (100) face-to-face conversations (90)
phonecalls (10)
S1A-010
S1A-094
Public (80) classroom lessons (20)
broadcast discussions (20)
broadcast interviews (10)
parliamentary debates (10)
legal cross-examinations (10)
business transactions (10)
 
Monologues (100) Unscripted (70) spontaneous commentaries (20)
unscripted speeches (30)
demonstrations (10)
legal presentations (10)
S2A-011
Scripted (30) broadcast talks (20)
non-broadcast speeches (10)
S2B-026
Mixed (20) broadcast news (20)
S2B-002
Written Texts (200) Non-printed (50) Non-professional writing (20) untimed student essays (10)
student examination scripts (10)
W1A-001
Correspondence (30) social letters (15)
business letters (15)
W1B-001
Printed (150) Academic writing (40) humanities (10)
social sciences (10)
natural sciences (10)
technology (10)
W2A-005
Non-academic writing (40) humanities (10)
social sciences (10)
natural sciences (10)
technology (10)
 
Reportage (20) press news reports (20) W2C-009
Instructional writing (20) administrative / regulatory (10)
skills / hobbies (10)
W2D-018
Persuasive writing (10) press editorials (10)  
Creative writing (20) novels / stories (20)  

WHAT IS NOT INCLUDED?

Release 2 of ICE-GB is supplied on CD-ROM. ICE-GB contains five hundred texts of spoken and written contemporary British English. To obtain the other 490 texts, you must order the CD-ROM! If you want to do this, click here.

WHAT DO I NEED TO RUN ICECUP 3.1? 

The latest version of ICECUP, ICECUP 3.1.1 runs on 32 bit and 64 bit Windows, from Windows XP to Windows 10. You can upgrade to the most recent version for free.

Older versions of ICECUP run on 32 bit Windows, from 3.1 upwards.

Sampler system requirements

There are two install packages, with hard disk capacity requirements as follows:

Process Text Audio
To download: 3.5Mb 96Mb
To install: 21Mb 210Mb
To run: 8.1Mb 104Mb

We have tested the software extensively on platforms that we have access to, and we have witnessed it working OK on others. The most up-to-date list is shown below.

System requirements for the ICE-GB corpus (CD-ROM)

These differ from the above only in terms of hard disk space. You need 96Mb to install the entire corpus. Note that you can also run searches off the CD without installing anything.

The software is identical to that supplied with the sample corpus. You can therefore ‘try before you buy’. If in doubt, install the sample corpus and software before ordering the CD.

email: s.wallis@ucl.ac.uk


WILL ICECUP CONTINUE TO BE DEVELOPED?

Yes. ICECUP 3.1.1 is simply the latest release of the software. We have developed and maintained ICECUP for two decades.


COMMENTS, SUGGESTIONS, QUERIES...

The Resources section of this website has sections on Fuzzy Tree Fragments and an explanation of carrying out experiments with parsed corpora.

If you want to ask a question of the author you can email Sean Wallis directly.

This page last modified 25 August, 2017 by Survey Web Administrator.