UCLDH Seminar with Jean Bauer



The UCL Centre for Digital Humanities contributes to and holds a variety of events.

Recurring events include the UCLDH Seminar and the Susan Hockey Lecture series. Our events are primarily advertised right here on this page, which is syndicated in an RSS feed, but also on our DH Blog, on Twitter, and via our mailing list


Freedom to constrain: Introducing TEI customisation

Start: Oct 12, 2016 5:30:00 PM
End: Oct 12, 2016 6:30:00 PM

The Guidelines of the Text Encoding Initiative have become a de facto standard for those undertaking historical textual research projects in Digital Humanities. These Guidelines provide in-depth recommendations for standard methods of the representation of text in digital form. The international membership consortium that supports the maintenance of the Guidelines and related software includes institutions, projects, and individuals from many different disciplines inside and outside the traditional humanities fields. As a freely-available, open source, community-based standard, the TEI tries to cater for as many needs as possible through the generalisation of the representation of textual phenomena and through a comprehensive method of customisation of the schemas users rely on to validate their texts.  This literate programming method of writing a meta-schema, a TEI ODD customisation, containing both documentation and machine-readable descriptions of how one’s project differs from the TEI as a whole, is one of the great strengths of the TEI. It enables projects not only to produce a schema that is right for their project, and provide localised project-specific documentation, but also to record their use of the TEI framework for archival documentation for long-term preservation.

In this talk, Dr James Cummings will introduce the customisation mechanisms of the TEI on a general level (no previous TEI experience is required), before investigating some of the benefits and drawbacks of these methods. He will investigate issues with community development, interoperability, software development, analysis, and collaboration with respect to the use of the TEI. The investigation will be supported by examples drawn from Cummings’ extensive experience with research and institutional projects using the the TEI for a wide variety of purposes.


Dr James Cummings is the Senior Academic Research Technology Specialist at the University of Oxford’s IT Services. Dr Cummings helps academics plan research projects with digital aspects, and is the founding Director of the annual week-long Digital Humanities at Oxford Summer School. He has been an elected member of the TEI Technical Council for over a decade and often manages projects related to text encoding. He completed a PhD in medieval drama at the University of Leeds, and also holds an MA in Medieval Studies from the University of Leeds, and a BA in Medieval Studies from the University of Toronto. His PhD investigates the archival survival of information concerning the performance of drama in medieval culture. Dr Cummings acts a liaison between developers and researchers, sometimes taking on principal investigator or project management roles within digital humanities projects. Occasionally he still gets to do some active coding and development on work-packages related to XML, markup, schemas and digital editions.

All welcome and there will be drinks and discussion after the talk. Please note that registration is required.

Information Extraction and Semantic Annotation

Start: Oct 19, 2016 5:30:00 PM
End: Oct 19, 2016 6:30:00 PM

Archaeological reports contain a great deal of information that conveys facts and findings in different ways. This kind of information is highly relevant to the research and analysis of archaeological evidence but at the same time can be a hindrance for the accurate indexing of documents. Information Extraction as a Natural Language Processing can unlock and surface such information by analysing a textual input and producing a structured textual output that is suitable for further manipulation. In this process Semantic Annotation links ontological definitions to natural language text   by   providing   class   information   for   textual   instances. Described as a mediator platform between concepts and their worded representations, semantic annotation as metadata can automate the identification of concepts and their relationships in documents. It is proposed as a mechanism for connecting natural language and formal conceptual structures to enable new information access methods and to enhance existing ones.   The annotation process enriches documents and enables access on the basis of a conceptual structure. This aids information retrieval from heterogeneous data sources, empowering users to search across resources for entities and relations instead of words.