Download the ICE-GB R1 Sample Corpus

This Sample is from Release 1 of ICE-GB and is supplied with Version 3.0 of ICECUP.

Note that this package has been superseded by ICECUP 3.1 and ICE-GB Release 2. The new download package is available here.

Questions: What’s in the package? | What do I need to run ICECUP? | What can ICECUP do? | Will ICECUP continue to be developed? | Feedback | Download now
Other Frequently Asked Questions (including solutions to many download problems)

The ICE-GB Sample Corpus is available for download NOW.

It comes complete with 10 texts selected by Gerry Nelson from the ICE-GB Corpus, and the state-of-the-art ICECUP III software written by Sean Wallis.

The Sample Corpus comes in two flavours:

  • The Minimum sampler without Help (for easy downloading).
  • The Complete sampler, with the main Help file and Getting Started tutorial.

WHAT IS IN THE SAMPLE CORPUS PACKAGE? 

  • Ten texts (over 20,000 words), fully parsed and annotated, exactly as they are in ICE-GB.
  • The latest release of ICECUP III. This is a full working version of the software (see below).
  • Example Fuzzy Tree Fragments.
  • Option: Full help files (around 3Mb extra).

The sample contains the following ten texts, shown in the last column. You can view these texts and their classification when you download and install the software. The complete ICE structure is visible from ICECUP’s Corpus Map.

Spoken Texts (300) Dialogues (180) Private (100)

face-to-face conversations (90)
phonecalls (10)

S1A-010
S1A-094
Public (80)

classroom lessons (20)
broadcast discussions (20)
broadcast interviews (10)
parliamentary debates (10)
legal cross-examinations (10)
business transactions (10)

 
Monologues (100) Unscripted (70)

spontaneous commentaries (20)
unscripted speeches (30)
demonstrations (10)
legal presentations (10)

S2A-011
Scripted (30) broadcast talks (20)
non-broadcast speeches (10)
S2B-026
Mixed (20) broadcast news (20)
S2B-002
Written Texts (200) Non-printed (50) Non-professional writing (20) untimed student essays (10)
student examination scripts (10)
W1A-001
Correspondence (30) social letters (15)
business letters (15)
W1B-001
Printed (150) Academic writing (40) humanities (10)
social sciences (10)
natural sciences (10)
technology (10)
W2A-005
Non-academic writing (40) humanities (10)
social sciences (10)
natural sciences (10)
technology (10)
 
Reportage (20) press news reports (20) W2C-009
Instructional writing (20) administrative / regulatory (10)
skills / hobbies (10)
W2D-018
Persuasive writing (10) press editorials (10)  
Creative writing (20) novels / stories (20)  

WHAT IS NOT INCLUDED?

Release 1.0 of ICE-GB is supplied on CD-ROM. ICE-GB contains five hundred texts of spoken and written contemporary British English. To obtain the other 490 texts, you must order the CD-ROM. If you want to do this, click here.

WHAT DO I NEED TO RUN ICECUP III? 

ICECUP runs on PCs under Windows 3.1 and above. It has been tested exhaustively on 3.1, 95 and 98. Owing to the nature of the program, we recommend a fast processor and a fast hard disk, although these are not essential. ICECUP will run on any stand-alone or networked PC from a 386 running Windows 3.1 with 8MB upwards.

Sampler system requirements

There are two install packages, with hard disk capacity requirements as follows:

Process Minimum Complete (incl. help)
To download: 1.2Mb (<1 HD floppy) 3.5Mb
To install: 11Mb 20Mb
To run: 4.8Mb 8.1Mb

We have tested the software extensively on platforms that we have access to, and we have witnessed it working OK on others. The most up-to-date list is shown below.

Software platform Status
Windows 3.1/WfW3.11 Heavily tested, in constant use
Windows 95/98 Heavily tested, in constant use
Windows NT4/WTS/2000 Tested OK (note Font problem on NT4)
Windows Vista Tested OK
OS/2 Not tested
Mac/PPC Windows emulator Tested OK

Feedback: Please tell us if you try to run ICECUP on platforms other than PCs running Windows 95/98/ME or Windows 3.1. We want to know about your software problems, to help you and other end users. Since we are supplying software free, and “as is” (see the licence agreement) with a Sample Corpus taken from ICE-GB, we think that it is only fair that you give us some feedback.

If ICECUP runs brilliantly on Windows XP or crashes dismally on a Mac, tell us. Email us at the Survey (s.wallis@ucl.ac.uk) so that (a) we can try to solve any outstanding problems, and (b) tell others about them.

System requirements for the ICE-GB corpus (CD-ROM)

These differ from the above only in terms of hard disk space. You need 83Mb to install the entire corpus. Note that you can also run searches off the CD without installing anything.

The software is identical to that supplied with the sample corpus. You can therefore ‘try before you buy’. If in doubt, install the sample corpus and software before ordering the CD.

email: s.wallis@ucl.ac.uk

WHAT CAN ICECUP DO?

ICECUP is a corpus exploration tool designed for syntactically parsed corpora. It allows you to experiment with, and explore the corpus. ICECUP has a number of facilities to enable you to do this.

  • The corpus map depicts the structure of the corpus and its texts from the top down.
  • The text browser allows you to browse the text and the results of queries, reveal and hide annotation, and perform concordancing.
  • The tree viewer allows you to see the full parse analysis of the corpus.

But that’s only the beginning...

There are a number of sophisticated query systems for searching the corpus, including

  • Markup queries
  • Exact and inexact grammatical node queries
  • Text fragment queries
  • Fuzzy Tree Fragment queries
  • Sociolinguistic variable queries
  • Random sampling

These queries may be combined using Drag and Drop logic.

WILL ICECUP CONTINUE TO BE DEVELOPED?

Yes. ICECUP is being developed under the auspices of the ESRC Corpus Queries project. We will continue to develop ICECUP at least until the end of January 1999 and new versions of ICECUP, with the sampler, will be freely available from this site. We suggest that you bookmark this page and watch this space.

This means that if you buy the ICE-GB CD-ROM now, you will be able to upgrade to later versions of ICECUP at cost price. We will email users of the CD-ROM to let them know when a new version is available.

Version 3.0 of ICECUP is available to download from this section of the website. The new ICECUP 3.1 is available in a beta form from here.

FEEDBACK

As mentioned above, we would like feedback on technical problems and successes encountered running ICECUP on platforms that we have not been able to test ourselves.

PREPARE TO DOWNLOAD!

Download ICECUP 3.0 and the ICE-GB Sample Corpus by clicking here

This page last modified 12 June, 2013 by Survey Web Administrator.