Apache

tool: Solr

Purpose: 

Solr is an open source enterprise search platform from the Apache Lucene project. It operates as a standalone full-text search server within an appropriate servlet container, such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language.

Features: 

• May be tailored to many types of application with minimal programming knowledge
• Extensive plug-in support
• Full-text indexing and search

A&H use case 1 description: 
The “British Cartoon Archive Digitisation (BCAD)” project has used Solr to deliver the search results and metadata.
Creator: 
CNET Networks
Publisher: 
Apache Software Foundation
Software/programming languages used: 
Suite: 
Data structuring and enhancement: 
Alternate tool(s): 

Sphynx

Licence: 
lifecycleStage: 
Platform: 

tool: Lucene

Purpose: 

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

Features: 

• Scalable, high-performance indexing
• Powerful, accurate and efficient search algorithms
• Cross-platform solution

A&H use case 1 description: 
The “Freeze Frame – Historic Polar Images 1845-1960” project has used Lucene for advanced search of photographs from both Arctic and Antarctic expeditions.
Creator: 
Doug Cutting
Publisher: 
Apache Software Foundation
Software/programming languages used: 
Suite: 
Data structuring and enhancement: 
Alternate tool(s): 

InQuira, Verity, dtSearch, ISYS

Licence: 
lifecycleStage: 
Platform: 

tool: MMax2

Purpose: 

MMax2 is a text annotation tool for creating and visualizing annotations. It has advanced and customizable methods for information and relation visualization.

Features: 

• Determination of the word class / part of speech (POS) for words in a text
• Determination of word senses, including the disambiguation of homonymous and polysemous words
• Detection of anaphoric expressions and identification of their antecedents

A&H use case 1 description: 
The MMax2 annotation tool has been used by the Embedded GeoCrossWalk project to perform annotation of the Stormont Papers resource.
Publisher: 
EML Research gGmbH
Creator: 
Christoph Muller
Software/programming languages used: 
Data capture: 
Communication and collaboration: 
Suite: 
Data publishing and dissemination: 
Strategy and project management: 
Practice-led research: 
Alternate tool(s): 

NXT (NITE XML), Annotate

Licence: 
lifecycleStage: 
Purpose: 

An open source version control system that is used to manage multiple versions of data objects in a controlled environment. Access and revision to objects are carefully controlled, to prevent unauthorised access and alteration.

Features: 

• Version support for all files and directories in repository, enabling administrator to track changes that have occurred and obtain earlier versions of files
• Free-form versioned metadata
• Atomic commits
• Merge tracking
• File locking
• Preservation of file attributes, such as executable and read/write status
• Apache network server support with WebDAV/DeltaV protocol
• Write-through proxy using WebDAV
• Interactive conflict resolution
• Repository read-only mirroring

A&H use case 1 description: 
The Citation and Allusion project has used Subversion to document different versions of files as the work on the project proceeded.
Creator: 
CollabNet
Publisher: 
Apache Software Foundation
Strategy and project management: 
Software/programming languages used: 
Alternate tool(s): 

CVS

Licence: 
lifecycleStage: 

tool: Xalan

Purpose: 

Xalan is an XSLT processor for transforming XML documents into HTML, text, or other XML document types. It implements XSL Transformations (XSLT) Version 1.0 and XML Path Language (XPath) Version 1.0.

Features: 
  • Conversion between structured markup formats
  • Stylesheet validation
A&H use case 1 description: 
The Tibetan text project used Xalan to convert Classical Tibetan texts stored as XML into other formats for publication.
A&H use case 2 description: 
The Anglo-Norman On-line Hub has used Xalan to convert XML tagged copies of the revised Anglo-Norman Dictionary between XML and HTML.
Creator: 
Apache Software Foundation
Publisher: 
Apache Software Foundation
Data publishing and dissemination: 
Data analysis: 
Data structuring and enhancement: 
Software/programming languages used: 
Licence: 
Purpose: 

An architectural framework upon which a digital asset management (DAM) may be built. Fedora Commons (Flexible Extensible Digital Object Repository Architecture) is widely used in the academic sector as a basis for the creation of an digital library, archive, or repository system. Fedora provides a set of software tools to ingest, manage and deliver digital objects. Fedora Objects may contain many types of information, including descriptive, technical, preservation and relationship metadata.

Features: 
  • Granular object management via implementer-defined content models
  • Data available via Web APIs (REST/SOAP)
  • RDF search (SPARQL)
  • Rebuilder Utility (for disaster recovery and data migration)
  • OAI-PMH Provider Service
  • GSearch (fulltext) Search Service
A&H use case 1 description: 
The East London Theatre Archive (ELTA) implemented a Fedora-based digital repository to store and deliver digitised copies of playbills, press cuttings, photographs and other material held by the V&A Theatre Collection and various East London theatres.
Creator: 
Cornell University, University of Virginia and several third-parties
Publisher: 
Duraspace
Data analysis: 
Data structuring and enhancement: 
Data publishing and dissemination: 
lifecycleStage: 
Strategy and project management: 
Alternate tool(s): 

EPrints.org, DSpace

Software/programming languages used: 
Platform: 
Licence: