From Archive to Researcher: a Generic Tool Set

Project start date: 2001-10 Project end date: 2005-02
There were two major aims in the LEADERS (Linking EAD to Electronically Retrievable Sources) project: 1.To carry out research on the needs of users of archive and records collections 2.To develop a toolset to deliver finding aids, transcriptions and digital images of archives over the Internet The Internet makes it possible for finding aids (metadata about archives) to be accessed electronically, most often using the Encoded Archival Description (EAD), a standard for electronic finding aids. However, few archives at present deliver electronic images or transcriptions of the material and researchers must therefore travel to the collections to use them. Within the humanities computing community more advanced XML-based methodologies for encoding and delivering full text representations of historical documents have been developed in the Text Encoding Initiative (TEI) project, but the TEI is relatively weak on metadata structures and at the start of our project had not been used together with the EAD in an integrated fashion. A toolset must obviously provide the functions that users need, but we found little research on these needs. We aimed to investigate the needs of users in different kinds of archives and to build up a picture of what searches they wanted to carry out. These functions could then be built into the LEADERS toolkit. This aspect of the project was addressed by questionnaires in six archive and record offices and by three focus groups trialling the pilot software
Methods usedCategory
2d Scanning and photographyData capture
Coding and standardisationData structuring and enhancement
Text encoding - descriptiveData structuring and enhancement
Text encoding - referentialData structuring and enhancement
Record linkagesData analysis
Searching and queryingData analysis
Manual input and transcriptionData capture
Funding sources: 
Arts and Humanities Research Council (AHRC)
Content types created: 
Dataset/structured data
Software tools used: 
Java, Apache Tomcat, Cocoon, Lucene, Web services
Source material used:  
As the principal output of this project is software, this section is not relevant. A small number of records from two collections at University College London, the George Orwell Archive and the College Archive, are used in the demonstrator software. These are intended to show the functionality of the software and encoding scheme and are not sufficient to function as a digital resource on their own.
Digital resource created:  
The digital resource in this project is software for delivering archival material, not the material itself. The basic screens in the LEADERS software show the documents which have been retrieved via a search request accompanied by the relevant portions of the accompanying metadata. The user can choose to view a transcription or image of the document and there are various options for metadata display. A demonstrator available at http://ptolemy.bookmarc.pt:8083/leaderswsclient and using records from two collections at University College London illustrates some of the functions, but the toolset can be reconfigured to suit the needs of different users. For example, it may be used where there are only images and no transcriptions, or to derive standalone documents with TEI headers. The implementer can choose which elements are indexed and therefore searchable and which options are provided for the metadata display. LEADERS is fully documented and is available as open source software via http://leaders.sourceforge.net/. The LEADERS toolset builds on existing standards which all use the Extensible Markup Language (XML). These are the Encoded Archival Description (EAD) for finding aids, the Text Encoding Initiative (TEI) for transcriptions of textual material, the Encoded Archival Context (EAC) for contextual information and NISO Mix for the technical characteristics of digital images. These standards are integrated within the LEADERS schema which uses the EAD as the overall shell
Data Formats created: 
Hypertext Markup Language (HTML), Extensible Markup Language (XML) TEI-compliant
Generation of HTML from XML. LEADERS makes extensive use of XSLT for displays.
Metadata standards employed: 
Encoded Archival Description (EAD), Text Encoding Initiative (TEI), NISO Metadata for Images in XML (MIX), Encoded Archival Context (EAC)
Publications:  
Sexton, Anna and Chris Turner. "Expanding the Role of EAD: Providing Adequate Metadata for Digital As Well As Original Archive Documents". Vine 32:4 (2002): 71-80.

Sexton, Anna, Chris Turner, Geoffrey Yeo and Susan Hockey. “TEI, EAD, and Integrated User Access to Archives: Towards a Generic Toolset”. In DRH 2001 and 2002 Selected Papers from the Digital Resources for the Humanities Conferences 2001 and 2002, edited by Jean Anderson, Alistair Dunning and Michael Fraser. London: Office for Humanities Communication, 2003.

Sexton, Anna, Chris Turner, Geoffrey Yeo and Susan Hockey. “Understanding Users: a Prerequisite for Developing New Technologies”. Journal of the Society of Archivists 25 (2004): 33-49.

Sexton, Anna, Chris Turner, Geoffrey Yeo and Susan Hockey. “User Feedback: Testing the LEADERS Demonstrator Application”. Journal of the Society of Archivists 25 (2004): 189-208.

Institutions affiliated with this project: 

UK HE institutions involved:
University College London

Project staff and expertise: 

Principal staff member:Professor Susan Hockey
Other staff:Postdoctoral researcher(s) / Research assistant(s)
External expertise:


Metadata on this arts-humanities.net record
Author(s) of recordSusan Hockey
TitleFrom Archive to Researcher: a Generic Tool Set
Record created2005-11-07
Record updated2011-05-11 16:09
URL of recordhttp://www.arts-humanities.net/node/2053
Citation of recordSusan Hockey: From Archive to Researcher: a Generic Tool Set.
<http://www.arts-humanities.net/node/2053>
created: 2005-11-07, last updated 2011-05-11 16:09