Progress Report No.3: 25 June- 4 November 2002Anna Sexton25 October 20021. StaffingFrom 27 May 2002- 6 October 2002 Chris Turner has been employed on a part-time (60%) contract. On 7 October 2002 Chris contract was changed to full-time and he will continue to be employed on this basis until the end of the project. 2. Research and Development2.1 Categorisation of archive usersIn the draft paper Towards an Analysis of User Needs (Version 14, 20 December 2001), the team put forward a model for categorising or segmenting archive users. This model has been used to form the basis of a questionnaire that the team are using in a survey of archive users across different repositories. The questionnaire survey will provide a means of establishing a profile of the typical archive user. This profile will then act as a basis for selecting a sample of users who can provide more detailed information about their needs and feedback on our work. Initially the team felt that the in-depth surveying of archive users should be carried out across three archive repositories in the UK. However, the team sought advice from experts within the market research field who suggested that the surveying needed to be conducted in at least 6 different repositories. These repositories should be representative of the diversity in types of archives and should be spread across the UK. The team contacted a number of repositories to ask for participation in the user survey, in doing so the team were looking to encourage participation from the following repository types: * National Archives The final list of participants enlisted are: * The Public Record Office (National Archive) It has proved difficult to find any business archives that are willing to participate in the survey. Those that have been invited to participate felt that their external user numbers were so insignificant that they would prove useless. In a business archive the users are mainly internal members of staff and much of the archival research is conducted by the archivists on the users behalf and is reported back via telephone or email. The closest the team have come to enlisting a business archive is the involvement of the University of Glasgow Archive Service who maintain and administer their own institutional archive as well as pro-actively collecting business archives from other institutions/corporations. The team had planned to carry out all the surveying over a two month period running from 7 October 2002 6 December 2002. However, this time frame clashed with the annual Survey of Visitors to British Archives run by the Public Services Quality Group. Many of the repositories that expressed an interest in participating in the LEADERS survey were also involved with the PSQG and some felt unable to run two user surveys at the same time. This has meant that the LEADERS survey time frame has had to be divided into two phases. In the first phase that is running from October-December the following repositories are being surveyed: * The Public Record Office The Public Record Office have incorporated the LEADERS questions into their own survey that they give to new readers. The PRO have offered to input and analyse the answers to the LEADERS questions as part of their larger survey. LEADERS is extremely grateful to the PRO for their enthusiasm, support and practical assistance. The other repositories that have been enlisted will be surveyed in a two month period from January 2003 (exact dates still to be arranged). 2.2 Building the demonstrator applicationLEADERS are developing a model system that will serve as a demonstrator to show what can be produced from the encoded materials and the LEADERS toolset. The demonstrator will incorporate basic search and retrieval functions and alternative presentations to show the possibilities of TEI/EAD encoded resources. The application will be used to gather feedback from users which will guide further design and development. The demonstrator is being developed on the Microsoft .NET framework using ASP.NET and if necessary C#. We are in discussion with an organisation based in Portugal Bookmarc, which has developed Bibliographic applications using similar techniques derived from XML files. We have a direct contact with Bookmarc via Maira Ines Cordeiro a PhD student at SLAIS. Digitisation of the selected samples for the demonstrator is now complete. The team opted to employ the UCLs Photographic Service to carry out the work. The originals have been copied on a high resolution camera which has produced a file of under 18 MB per image. The photography complies with the Association of Photographers guidelines for the supply of digital images which stipulates that the images are saved in an uncompressed TIFF format with an embedded [Adobe (1998)] colour profile. All the pictures are neutralised to a Kodak greyscale and no editing at all has been carried out on the original high resolution files. The high resolution files will act as the archive from which all surrogate images for the prototype will derive. The images have been supplied to us on a CD in ISO-9660 format. We will be using the NISO Metadata for Images in XML (NISO MIX) Schema to record image metadata, as recommended by VADS and HEDS. Rosamund Cummings has been given a CD of the UCL material and Gill Furlong has been given a copy of the Orwell material for their own use. 2.3 The use of TEI markup for textual representations of archival documentsThe LEADERS team are working towards producing a subset of the TEI which can be used to encode a wide range of archival material. In the research conducted so far, we have found that the TEI contains a number of tags that can deal with many of the commonly occurring features of archival material such as complex, additions, deletions, and gaps in the text and changes in the hand, style or character of the writing. Most of these tags can be invoked through the use of the additional TEI tagset for the Transcription of Primary Sources. Much valuable work has been done by others to set out the encoding options available for dealing with such features and this work will undoubtedly act as a foundation for some of the models and rules developed by LEADERS. However, we have also identified the need to build models and rules that can deal with structures and features such as overlaid data, textual and numerical data presented in complex tables and the presence of formulae and mathematical expressions within the text. Data within archive documents can be described as overlaid when an underlying layer of data is used as the basic structure onto which further data (other layer(s)) is applied. The underlying layer of data is usually printed, and the overlaying layers are usually handwritten on top pf the printed structure. Such structures and features are often found in archival material (particularly administrative archives) but the TEIs current encoding scheme will need to be developed if they are to be comprehensively dealt with. The team are looking at a variety of examples of these structures and features across a range of documents from the UCL Archive. Rules and models for encoding are currently being formulated and tested by the team. 2.4 Overlaps between TEI and EADThe team have spent time identifying areas of overlap in the metadata provided by the EAD encoding framework and that provided by TEI. This research has shown that overlaps occur in relation to metadata that: * Identifies, locates and gives details about the creation
of the original object The team have identified that solutions to these overlaps and our final integration method must be capable of: * Avoiding repetition of information. 3. Text Encoding Initiative (TEI) Consortium MembershipIn August/September 2002, SLAIS supported the project in becoming a paid member of the TEI Consortium. As the deliverables from the project involve adaptation and development of TEI, consortium membership is vital as it entitles the project to:
4. Dissemination4.1 Conferences and meetingsSusan Hockey and Anna Sexton attended the Association for Literary and Linguistic Computing and the Association for Computing in the Humanities (ALLC/ACH) Conference which was held from 24-28 July 2002 at Tubingen University, Germany. Chris Turner and Anna Sexton delivered the teams paper entitled TEI, EAD and Integrated User Access to Archives: Towards a Generic Toolset at the Digital Resources in the Humanities (DRH) Conference which took place at Edinburgh University from 8-11 September 2002. The full paper has been submitted to be considered for publication in the forthcoming conference proceedings. Chris Turner attended the Society of Archivists Annual Conference in Jersey from 1-4 October 2002 and delivered a paper entitled Love Match or Shotgun Wedding?: Archivists and IT Vendors. Chris paper was not officially delivered in his capacity as LEADERS Project Manager, but he did make references to our work within his talk which have proved to be useful in the promotion and dissemination of the project. Susan Hockey and Chris Turner attended the TEI Consortiums Annual Meeting on 12 October 2002. Susan was invited to deliver the opening keynote paper which she entitled Markup, TEI, Digital Libraries & Humanities Scholarship and Chris gave an introduction to the LEADERS Project in the Reports from Members section of the meeting. The team have submitted a proposal for running a session
(3 related papers) at the Society of Archivists Annual Conference
2003 which will be held next September in Southampton, as well as a proposal
to run a special focus session at The Society of American Archivists Annual
Conference 2003 which will be held on 18-24 August in Los Angeles, USA.
The team are also working on a submission for the ALLC/ACH 2003 Conference
which will take place on 29 May -2 June in Athens, Georgia, USA. 4.2 TalksChris Turner and Anna Sexton have been invited to give a talk on LEADERS at the Society of Archivists EAD/Data Exchange Group Meeting on 14 November 2002. The team have also been invited by Mark Greengrass to speak
at the Humanities Research Institute at the University of Sheffield. The
exact date is still to be arranged. 4.3 WebsiteThe LEADERS website (http://www.ucl.ac.uk/leaders-project) was officially launched on 22 July 2002. The website was last updated on 9 October 2002 when the references page was substantially updated and PowerPoint slides from the talk delivered at DRH were added to the site. Some general statistics indicating website usage (analysed from 22 July 20 October) are given below: Table 1: Monthly breakdown of requests for pages
Table 2: Domains accessing website (domains with at least 100 requests are listed)
Table 3: List of top 20 organisations accessing website (ordered by no of requests: highest first)
4.4 LeafletsThe team have had 1000 leaflets printed for the project.
The leaflets provide general information about the project with an introduction
to our aims, objectives, deliverables and research questions alongside
contact details for the project team. 5. Training
|