· A · B · C · D · E · F · G · H · I · J · K · L · M · N · O · P · Q · R · S · T · U · V · W · X · Y · Z ·


Ace

ACE (Auditing Control Environment) is a system that incorporates anew methodology to address the integrity of long term archives using rigorous cryptographic techniques. ACE continuously audits the contents of the various objects according to the policy set by the archive, and provides mechanisms for an independent third-party auditor to certify the integrity of any object. ACE consists of two components, the first an Audit Manager(AM) that checks files locally to ensure they have not been compromised. The second part, the Integrity Management Service (IMS), issues tokens that the AM can use to verify that its local store of file digests has not been tampered with.

Preserving

— University of Maryland Institute for Advanced Computer Sciences —

https://wiki.umiacs.umd.edu/adapt/index.php/Ace:Main
https://wiki.umiacs.umd.edu/adapt/index.php/Ace:Audit_Manager_Installation_Guide

Digital Preservation



License
Open

Last activity
2014




Active_fedora

Rubydora and ActiveFedora provide a set of Ruby gems for creating and managing objects in the Fedora Repository Architecture (http://fedora-commons.org). ActiveFedora is loosely based on “ActiveRecord” in Rails. The 3.x series of ActiveFedora depends on Rails 3, specifically activemodel and activesupport.

Preserving

— Project Hydra —

Collection Management



License
Apache 2.0

Last release
5/2/2015




ActiveTriples

An ActiveModel-like interface for RDF data. Models graphs as Resources with property/attribute configuration, accessors, and other methods to support Linked Data in a Ruby/Rails enviornment.

This library was extracted from work on ActiveFedora. It is closely related to (and borrows some syntax from) Spira, but does some important things differently.

Modifying, Categorizing Browsing

— ActiveTriples —

Linked Open Data



License
Apache 2.0

Last release
22/8/2014




Amara

Amara gives individuals, communities, and larger organizations the power to overcome accessibility and language barriers for online video. Amara is composed of three main parts: A subtitle creation and viewing tool (aka the widget)A collaborative subtitling websiteAn open protocol for subtitle search/delivery

Producing, Transcribing, Collaborating, Retrieving, Subtitling

— Participatory Culture Foundation —

http://www.amara.org/en/
https://github.com/pculture/unisubs

Media Applications



License
GNU GPL

Last release
2/6/4/2014




Amplifon: Sounds of Street View

Sounds of Street View is a digital explorative sound experiment by Amplifon which gives users a 3-dimensional sound experience in a Street View environment. Utilising the Web Audio API platform, sounds are designated as though they were ordinary Google Maps markers, but instead of an image and information being assigned to the marker, a sound is

Producing, Recording, Presenting

— Amplifon —

http://www.amplifon.co.uk/sounds-of-street-view/index.html
https://github.com/Amplifon/Sounds-of-Street-View-Framework

Exhibition Management, Geo-spatial applications, Media applications


Examples
http://www.amplifon.co.uk/sounds-of-street-view/index.html


License
MIT License






Annotator

The Annotator is an open-source JavaScript library and tool that can be added to any webpage to make it annotatable.

Annotating

— Open Knowledge Foundation —

http://okfnlabs.org/annotator/
https://github.com/okfn/annotator/

Data Annotation/Curation


Quality of Documentation
There is a well structured Github with well written documentation, including API docs. Also there is a dedicated website listing information about the tool and providing links to all the important information such as a tutorial, a link to a live demostration page where you can test out the tool, a link to a plugin development page, etc.

Ease of Adaptation
The extensibility of the tool is promoted on the main website. There is a link to a plugin development page containing a well written comprehensive tutorial.

Code Quality
The code is maintained in GitHub. CoffeeScript is used as the main language and all files are packaged in a logical structure. In general, the code includes extensive comments. The code is published for download as a minified JavaScript file. A file with the authors of the code is available, listing 20 contributors to the code.

Examples
http://okfnlabs.org/annotator/demo/


License
either MIT or GNU GPLv3

Last release
12/3/2013




Annotorious

Annotorious is a JavaScript annotation library for images and zoomable images. Link the Annotorious source files (CSS and JavaScript) into an existing HTML page, and images inside this page will be enhanced with an interactive drawing and commenting tool.

Annotating, Commenting, Visualizing

— Rainer Simon Peter Pilgerstorfer Paul Weichhart —

http://annotorious.github.io
https://github.com/annotorious/annotorious

Media Annotation/Tagging


Quality of Documentation
The main website is very informative and it’s main menu includes links to: a demo page, a getting started page, a plugin page an API documentation page and finally an about page.

Ease of Adaptation
There is a plugin development page available. In combination with a comprehensive GitHub page, it shouldn’t be too hard to develop plugins.

Code Quality
The code is maintained in GitHub. The JavaScript (using Google Closure) code seems clearly structured and includes comments. There is a useful page with information on how the code was built and can be adapted.

Examples
http://annotorious.github.io http://crowdsourced.micropasts.org/app/phototaggingHorsfield /http://ait-ngcms.github.io/annotorious-openskos-demo/


License
LGPL

Last release
v0.6 (14/08/2013)

Last activity
actively ongoing




Apache PDFBox

The Apache PDFBox™ library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.

Producing, Organizing, Data recognition, Annotating

— The Apache Software Foundation —

http://pdfbox.apache.org/
http://pdfbox.apache.org/downloads.html

Digital Asset Management, Collection Management


Quality of Documentation
The main website clearly lists a ton of useful information about the tool, namely e.g. software dependancies, the architecture and it’s underlying ideas, API docs and information on individually available command line tools.

Ease of Adaptation
The software does not have a hands on easy to use plugin structure.

Code Quality
The code is hosted in an SVN. This page explains how to build the software and how to create new branches for further development. Moreover a page on coding conventions is linked from the main site.


License
Apache License, Version 2.0




Apache Stanbol

Apache Stanbol (currently in incubation) is an open source modular software stack and reusable set of components for semantic content management.

Organizing, Content Analysis, Data recognition

— Apache Software Foundation —

http://incubator.apache.org/stanbol/

Semantic Enrichment, Collection Management, Digital Asset Management


Quality of Documentation
The main website clearly lists various links with useful information such as: the ideas & architecture of the software, usage scenario’s, a comprehensive ‘getting started’ section.

Ease of Adaptation
There seems to be no direct plugin architecture, however there is extensive documentation on development practices, which includes an overview how to contribute your (Java) code to the SVN by following the Apache guidelines.

Code Quality
When browsing the source code it seems that it is well structured and that in incorporates Maven (pom.xml), which ensures the neat handling of dependancies and versioning. Moreover it includes a benchmark directory, which indicates the system is well tested.

Examples
http://dev.iks-project.eu:8081/enhancer/chain/austrian-national-library


License
Apache License, Version 2.0

Last release
5/31/2012




Apache Tika

The Apache Tika toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.

Parsing, Data recognition

— Apache Software Foundation —

http://tika.apache.org/
http://tika.apache.org/download.html

Metadata Retrieval Services, Semantic Enrichment


Quality of Documentation
On the home page there are links to a dedicated wiki, an issue tracker, mailing lists and for each version of Tika there is documentation on how to get started (you have to download the sources and build them with Maven), API docs and more.

Ease of Adaptation
Tika has a plugin structure for parsers, which enables developers to extend Tika’s supported formats with their own. Developers are encouraged to use existing libraries to implement these custom parsers.

Code Quality
Because it is from Apache and by looking at the structure of Java classes in the API docs, the quality of the code seems perfectly fine.


License
Apache License, Version 2.0

Last release
7/3/2013

Last activity
actively ongoing




Archivematica

Archivematica is a free and open-source digital preservation system that is designed to maintain standards-based, long-term access to collections of digital objects.

Preserving

— Artefactual Systems Inc —

https://www.archivematica.org/
https://github.com/artefactual/archivematica

Digital Preservation


Quality of Documentation
Excellent documentation, arranged by versions, divided between user and administration manual, well structured and informative, illustrated with screenshots that are aligned with the documentation text. Professionally created screencast gives a high level overview. Documentation hosted on Wiki.

Ease of Adaptation
Microservices allow adapting workflows and easily integrating new services. Scale out is possible by adding new processing nodes. Service oriented, separation between ingest, storage, and access services, all have well defined APIs.

Code Quality
Code ist hosted on Github. High code quality (Python 89.7% JavaScript 6.3% Shell 3.2% Other 0.9%), well structured, not many comments, no continuous integration, tests are available.

Examples
http://sankofa.archivematica.org (user name: demo@example.com, password: demodemo)


License
AGPL3 license

Last release
v1.5.0 (June 2016)

Last activity
actively ongoing




ArchivesSpace

A next-generation archives management application that will incorporate the best features of Archivist’s Toolkit (AT) and Archon. The project team is developing a technical platform, governance structure, and service model that will provide the archival community with a cutting-edge, extensible, and sustainable platform for describing analog and born-digital archival materials. The ArchivesSpace product is being developed using an Agile scrum process, guided by a Product Vision for ArchivesSpace.

Managing, Adding metainformation Preserving

— Hudson Molonglo —

www.archivesspace.org
https://github.com/archivesspace/archivesspace/wiki

Collection Management


Quality of Documentation
Excellent documentation, well structured, providing many additional documentation ressources. Extensive documentation index.

Ease of Adaptation
ArchivesSpace has a backand for the the major workflows, and a REST API and many interfaces for adapting the system.

Code Quality
Code ist hosted on Github. High code quality ( Ruby 76.6% XSLT 8.9% JavaScript 7.7% CSS 6.0% Shell 0.8), well structured, not many comments, no continuous integration, tests are available.


License
ECL 2.0

Last release
ArchivesSpace 1.0.7.1 (March 2014)

Last activity
on-going




Artoo.js

artoo.js is a piece of JavaScript code meant to be run in your browser’s console to provide you with some scraping utilities.

This nice droid is loaded into the JavaScript context of any webpage through a handy bookmarklet you can instantly install by dropping the above icon onto your bookmark bar.

Archiving, Extracting data

— Guillaume Plique —

http://medialab.github.io/artoo/
https://github.com/medialab/artoo

Metadata Retrieval Services



License
MIT License




ASV Toolbox

ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms. The ASV Toolbox can work on plain text files and connect to a MySQL database. While it is especially designed to work with corpora of the Leipzig Corpora Collection, it can easily be adapted to other sources.

Named entity recognition, POS Tagging, Extracting data

— University of Leipzig —

http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/
http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/

Collection Management, Ontology/Vocabulary Building and Management



License
MIT




Avalon Media System

The Avalon Media System is an open source system for managing large collections of digital audio and video files

Managing, Archiving

— Indiana University and Northwestern University —

http://www.avalonmediasystem.org
https://github.com/avalonmediasystem

Collection Management


Examples
http://www.avalonmediasystem.org/try-out-avalon


License
Apache 2.0

Last release
11/2013

Last activity
ongoing




BIBFRAME

Two tools designed to help you evaluate MARC Bibliographic data in the BIBFRAME model: a comparison service and a transformation service

Comparing, Modifying

— Library of Congress Network Development and MARC Standards Office —

http://bibframe.org/
https://github.com/lcnetdev/marc2bibframe

Metadata Checker/Validation, Metadata Mapping/Conversation/normalisation


Quality of Documentation
Documentation on the concepts behind BIBFRAME are quite extensive, however documentation for developers seems to be limited by a not very descriptive README in GitHub.

Ease of Adaptation
There is no plugin structure mentioned, but the tool’s functionalities (“convert MARC/XML Bibliographic records to BIBFRAME resources”) are fairly straightforward (there is an XQuery file for each format), so extending the tool for other types of bibliographic records should be not too hard.

Code Quality
The most important bits of code are written in XQuery. Next to this there is an exhibit app in HTML. Both parts seem clear and are not hard to comprehend. However there is not much in the sense of comments.

Examples
http://bibframe.org/demos/





Binarization and Conversion Tool

The BCT tool can be used to produce JPEG2000 or JPEG files of newspaper master images for presentation on the web. This tool calls two other tools, a binarization method from Basilis Gatos which is optimised for OCR, and Kakadu, a software development kit for creating JPEG2000 images. Therefore, in order to fully use the features of BCT, both tools must be installed and licensed. However, feel free to call your own tools from BCT.

Imaging, Compressing

— University of Innsbruck —

https://github.com/dea-uibk/BCT
https://github.com/dea-uibk/BCT

Publication, Image Search, Media Checker/Validation



License
MIT

Last release
10/12/2014




BitCurator Access

BitCurator Access software tools will assist collecting institutions (libraries, archives, and museums) in providing web-based and local access to born-digital materials held on disk images. BitCurator Access will focus on software that simplifies access to raw and forensically-packaged disk images, allowing collecting institutions to incorporate these objects into access environments in a manner that reflects the original order and relevant environmental context. The use of open source digital forensics software will allow for detailed analysis of file and file system provenance, quality and accessibility of files, metadata in files and the file system, and residual (non-file system) data contained within disk images.

Direct accessing, Analyzing, Access Management, Adding metainformation

— University of North Carolina at Chapel Hill —

http://access.bitcurator.net/index.php?title=Main_Page
https://github.com/kamwoods/bca-webtools

Collection Management



License
GPL v 3

Last release
30/12/2014




Blacklight

RoR gem discovery interface for any Solr index

Browsing

http://projectblacklight.org/
https://github.com/projectblacklight/blacklight

Search and Browsing


Quality of Documentation
The home page quickly and clearly gives an introduction to Blacklight and shows all the necessary links, e.g. towards a quickstart guide and the GitHub, to get started as a developer. The overall documentation is available in a wiki and seems mostly well written and clear.

Ease of Adaptation
There is a dedicated section on how to contribute to Blacklight as well as a page that lists the (un)official add-ons that are available. With several examples and clear instructions, as well as a developer mailing list, it seems fairly straightforward to create custom extensions.

Code Quality
The Ruby code contains useful comments. The main app is nicely structured according to MVC and includes YAML config files. The main JavaScript function clearly explains the structure and rationale of the code; the aim is to use or extend JQuery plugins as much as possible. Overall the code seems to be of good quality.

Examples
https://github.com/projectblacklight/blacklight/wiki/Examples


License
Creative Commons Attribution-Share Alike 3.0 United States License.

Last release
6/25/2012




Bolt

Bolt is a tool for Content Management, which strives to be as simple and straightforward as possible. With this general user manual you will be able to use Bolt to produce and edit content for your website.

Managing

— Bob den Otter —

http://bolt.cm/
https://github.com/bolt/bolt

Collection Management



License
MIT

Last activity
15/5/2014




BRAT Rapid Annotation Tool

online environment for collaborative text annotation”; focused on structured annotation of text, e.g., tagging named entities such as persons, organizations, etc., and their relationships

Annotation

— University of Tokyo —

http://brat.nlplab.org/index.html
https://github.com/nlplab/brat

Data Annotation/Curation


Quality of Documentation
Extensive


License
MIT

Last release
22/11/2012

Last activity
1/7/2016




BWF MetaEdit Tool

This tool permits embedding, editing, and exporting of metadata in Broadcast WAVE Format (BWF) files. This tool can also enforce metadata guidelines developed by the Federal Agencies Audio-Visual Working Group, as well as recommendations and specifications from the European Broadcasting Union (EBU), Microsoft, and IBM.

Adding metainformation, Extracting metadata

— FADGI —

http://bwfmetaedit.sourceforge.net/
http://bwfmetaedit.sourceforge.net/Download/Source/

Metadata Mapping/Conversion/normalisation, Metadata Checker/Validation, Digital Asset Management


Quality of Documentation
The documentation is quite technical and not very attractive, but just enough to make use of the tool. There does not seem to be a way to download the source code. The tool itself does work well (used by the reviewer).

Ease of Adaptation
There does not seem to be any way of getting or contributing to the source code.

Code Quality
Nothing can be said of the code quality due to the inavailability of the code.


Last release
3/1/2012




Carto

CARTO is an open, powerful, and intuitive platform for discovering and predicting the key insights underlying the location data in our world.

Imaging, Georeferencing

— CARTO —

https://carto.com/
https://github.com/CartoDB/cartodb

Geo-spatial Apllications


Quality of Documentation
Extensive

Examples
https://carto.com/gallery/


License
https://github.com/CartoDB/cartodb/blob/master/LICENSE

Last release
6/6/2016




Catmandu

Catmandu provides a suite of Perl modules to ease the import, storage, retrieval, export and transformation of metadata records.

Preserving

— Nicolas Steenlant, Patrick Hochstenbach —

http://librecat.org/index.html
https://github.com/librecat/catmandu/

Metadata Retrieval Services, Metadata Mapping/Conversation/normalisation, Collection Management


Quality of Documentation
The main site offers a brief introduction of the capabilities of Catmandu and offers an extensive tutorial. The GitHub code repository also contains developer documentation, but needs to be generated (or you can find it by browsing the code repository) after downloading. The developer documentation seems well written and quite extensive.

Ease of Adaptation
The code repository in GitHub also includes a page directed to contributers, but does not seem to desribe any plugin architecture. Since the code seems professionally maintained and fairly well documented, adding customizations to the code seems worth considering.

Code Quality
The Perl code looks well structured, well written, and contains comments in the form of perldoc. There is a long list of tests available. Also the code is included in CPAN (a repository of Perl libraries).


License
GPL-2, or later

Last release
11/2013

Last activity
ongoing




Cesium

Cesium is a JavaScript library for creating 3D globes and 2D maps in a web browser without a plugin. It uses WebGL for hardware-accelerated graphics, and is cross-platform, cross-browser, and tuned for dynamic-data visualization.

Imaging

— AGI —

http://cesiumjs.org/index.html
https://github.com/AnalyticalGraphicsInc/cesium

Geo-spatial Apllications


Examples
http://cesiumjs.org/demos.html


License
Apache 2.0

Last release
2/3/2015




ChicagoCodeX

An authoring and publishing environment for online catalogues with full scholarly apparatus

Authoring, Publishing

— The Art Institute of Chicago and the Indianapolis Museum of Art. —

http://www.oscitoolkit.org/
https://github.com/IMAmuseum/ChicagoCodeX

Exhibition Management, Collection Management


Examples
http://www.getty.edu/foundation/initiatives/current/osci/osci_browse_catalogues.html


License
GNU GPL




CKAN

CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data.

Managing, Publishing, Resource Sharing

— Open Knowledge Foundation —

http://ckan.org/
https://github.com/ckan/ckan

Collection Management


Quality of Documentation
The quality of the website and the available documentation is very good. The purpose and features, including the API, of CKAN are all documented very well.

Ease of Adaptation
There is dedicated documentation on the writing of extensions. Moreover there are 60 external extensions available, which can serve as an example. The code base is large, but the documentation is good, so writing these extensions shouldn’t be too hard.

Code Quality
The Pyton code looks good, there are not many comments, but for the important objects (within the comments) references to the API docs are given. The API docs are very detailed.

Examples
http://ckan.org/case-studies/


License
Affero GNU GPL v3.0

Last release
13/08/2013

Last activity
ongoing






Clipper Prototype 3

Clipper is a free open-source web application enabling researchers to create and share virtual-clips without altering the original media files. Clipper enables you to mark the start and end of interesting events while playing audio or video data files through a standard web browser. You can add rich text annotations to each clip, and combine clips into playlists (cliplists)

Annotating

— The City of Glasgow College, The Open University and Reachwill Ltd —

http://blog.clippertube.com/index.php/clipper-prototype-3/
https://github.com/reachwill/clipper

Media Annotation/Tagging



Last release
18/11/2015

Last activity
ongoing




CMUSphinx

CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

Speech recognition

— Carnegie Mellon University —

http://cmusphinx.sourceforge.net/
http://sourceforge.net/projects/cmusphinx/

Enriching metadata, Speech Recognition



License
BSD license




CollectionSpace

CollectionSpace is an open-source collections management application that meets the needs of museums, historical societies, and other collection-holding organizations. CollectionSpace is designed to be configurable to each organization’s needs, serving as a gateway to digital and physical assets across an institution. The software is freely distributed via open-source licensing, and an active developer community ensures that CollectionSpace is continually improving.

Managing

— January 2014: Lyrasis is now the organizational home of CollectionSpace. (The project was initiated and led by Museum of the Moving Image. Transition team includes developers based at University of California, Berkeley and Fluid Project at Ontario College of Art and Design; and Jesse Martinez, Freelance developer and Service Provider.) Project Partners 2008-2013 included: University of California Berkeley, Fluid Project at Ontario College of Art and Design, and Centre for Applied Research in Educational Technologies (CARET) University of Cambridge. —

www.collectionspace.org
https://github.com/collectionspace/

Collection Management


Quality of Documentation
The home page clearly provides a link to the documentation, which is very thorough and covers the most relevant topics such as: system requirements, how to configure & install it, how to use and maintain it and also how to develop customizations. The overall quality of this documentation is fairly good.

Ease of Adaptation
CollectionSpace describes having a so-called hook system in place (which is used in e.g. Drupal or Wordpress), a design pattern to conveniently extend a system. Hook systems in general take some time to learn and in this case specific documentation on the hooks are yet to be added (some limited examples are available though).

Code Quality
The different parts, namely the UI, services, application, tools, etc, are in different sections in GitHub, making the distinction clear. Each of these sections is sparsely documented, however the code looks good with JavaDoc and maven pom.xml files.

Examples
www.demo.collectionspace.org


License
ECL 2.0

Last release
10/2013

Last activity
on-going




Collective Access

This is one of the more powerful open source CMS (collection managemetn systems). It allows you to manage metadata in variouos formats, using metadata profile descriptions. Available profiles include VRAcore, EBUcore, LIDO (contributed by LIBIS), etc

Managing, Presenting, Adding meta-information

— Whirl-i-Gig —

http://www.collectiveaccess.org/
https://github.com/collectiveaccess

Collection Management, Content Retrival Services, Exhibition Management


Quality of Documentation
Extensive documentation available on Wiki, well structured. Demo, Installation instructions, Upgrade instructions

Ease of Adaptation
PHP Web Application, easily extensible. No standards. Documentation for adaption, API documentation available.

Code Quality
Mainly PHP-based (PHP 86.7% JavaScript 10.8% CSS 1.9% Other 0.6%). Code hosted on Github. Highly active, last commit 2 days old. Code is well written and documented. Separation into modules, clean code layout. Bugtracker available.

Examples
http://collectiveaccess.org/about/users


License
GNU GPL v.2

Last release
1/2012






ConservationSpace

The goal of ConservationSpace is to develop an open-source software application that will address a core need of the conservation community for a shared solution to the problem of documentation management. The conservation community has long recognized that a digital approach to managing its documentation would improve continuity in procedures, increase access, expand research opportunities, and better ensure the preservation of its documents.

Managing Preserving

— Sirma Group (Sirma ITT and Ontotext). The project is led by the National Gallery of Art (Washington, D.C., USA) with participation from Yale University Special Collections Library, Courtauld Institute, British Museum, Indianapolis Museum of Art, Metropolitan Museum of Art, Statens Museum for Kunst, and the Denver Art Museum). —

www.conservationspace.org

Collection Management, Ontology/Vocabulary Building and Management


Quality of Documentation
Extensive project docs at https://www.wiztechno.com (user: guest, pwd: conservationspace)

Ease of Adaptation
See https://www.wiztechno.com/wiki/display/CSR2/System+Customization+and+Configuration+Requirements for specific requirements of project partners

Code Quality
The code is not yet publically available.

Examples
https://v9tsig.dm2303.livefilestore.com/y2m3Y4yon2CsUhV3hBrzdgUB2DroiygcRNR_6t0g3DJmZdGUNiW8klxe0JDoukJZkLmGHuvKj_pFYZOyuneWBCNkgFBwQcgLVIwSbTMJPl4LifVaOIBK2PHDTkQVVVtyBescrMibeY5OYIccnGXZ2VyEA/1Sirma%20ITT%20-%20Conservation%20Space%20PR%2003-2015%20-%20screenshot.png





Cross-Platform Authentication - Authorization Provider

Hybrid media devices, which can deliver audio, video and interactive content over both broadcast and broadband, create new opportunities and challenges for broadcasters. Augmenting the broadcast experience with interactive content delivered over the Internet changes the classic one-to-many paradigm bringing it closer to a one-to-one relationship.
Cross-Platform Authentication (CPA) offers an open standard for associating any media device with an online identity, which facilitates delivery of personalized services to these devices.

Organizing

— EBU —

http://ebu.io/project/cpa
https://github.com/ebu/cpa-auth-provider

Media Applications



Last release
31/7/2014




CubicWeb

CubicWeb is a semantic web application framework, licensed under the LGPL, that empowers developers to efficiently build web applications by reusing components (called cubes) and following the well known object-oriented design principles. Its main features are: an engine driven by the explicit data model of the application,a query language named RQL similar to W3C’s SPARQL,a selection+view mechanism for semi-automatic XHTML/XML/JSON/text generation,a library of reusable components (data model and views) that fulfill common needs,the power and flexibility of the Python programming language,the reliability of SQL databases, LDAP directories, Subversion and Mercurial for storage backends.

Web-developing

— LogiLab —

http://www.cubicweb.org/
https://docs.cubicweb.org/book/admin/setup

Collection Management, Semantic Extraction, Digital Asset Management






Cultural Enrichment Mashificator

Collaboration tools.

Collaborating

— Jeremy Ottevanger —

http://ccgi.ottevanger.plus.com/cgi-bin/ez/index.php

Content Retrieval Services, Metadata Retrival Services


Quality of Documentation
Nice presentation of the idea together with a demo. There does not seem to be the intention to disseminate the code. No documentation available. Broken links.

Ease of Adaptation
PHP integration documented.

Code Quality
No code available


Last release
Unreleased




Dédalo: Intangible Heritage management and Oral History

Semantic RDF data sources

Annotating, Managing

— Juan Franciso Onielfa, Alejandro Peña —

http://www.fmomo.org
http://www.fmomo.org/dedalo/pg/download.php

Collection Management, Content Retrival Services


Quality of Documentation
Documentation only available in Spanish, PDF documents.

Ease of Adaptation
No documentation available.

Code Quality
Web application (HTML, CSS, Javascript, PHP, MySQL). Download only possible after registration, code not available.

Examples
http://dedalo.antropolis.net/dedalo/home/


License
GNU GPL v3

Last release
3.5.56

Last activity
July 2012




D3-cartogram

Continuous area cartograms with d3 and TopoJSON

Imaging

— Shawn Allen —

http://prag.ma/code/d3-cartogram/
https://github.com/shawnbot/d3-cartogram/

Geo-spatial Apllications, Data Visulaisation


Quality of Documentation
The main site briefly describes what the tool is and what dependencies the tool has. A prominent, but somehow easy to overlook, link to GitHub is available. On GitHub there is a short readme, which mainly points to an example index.html.

Ease of Adaptation
The example HTML file includes quite a number of JavaScript functions that need to be examined in order to be able to understand how it all works. Besides this, the code is small and it shouldn’t be too difficult to adapt the code to use different (geographical) input.

Code Quality
There is not much code and it does look readable, but there are not many comments to help you. Because the code uses d3.js and uses geographical algorithms, it takes a while to get in to.


License
MIT license




D3

D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG and CSS. D3’s emphasis on web standards gives you the full capabilities of modern browsers without tying yourself to a proprietary framework, combining powerful visualization components and a data-driven approach to DOM manipulation.

Managing, Visualizing

— Dustin Ewers —

http://d3js.org/
https://github.com/mbostock/d3/wiki

Data Visualization


Examples
http://bl.ocks.org/mbostock/4063318


License
BSD License

Last release
7/18/2014




dat

dat is an open source tool that enables the sharing of large datasets, allowing for a decentralized collaboration flow

Resource sharing

— Max Ogden —

http://dat-data.com/
https://github.com/maxogden/dat

Metadata Mapping/Conversion/normalisation, Collection Management


Quality of Documentation
The main page embeds a very informative YouTube video on a talk the creator gives about the tool. The documentation on GitHub is extensive, well written and even illustrated. It gets you started and enthuses you to contribute.

Ease of Adaptation
The code base is well documented and quite small. Adapting the system, e.g. to support new formats. Should not be too hard.

Code Quality
The code is well documented, concise and straightforward to understand. In the code however, it appears there are no comments.


License
BSD License

Last release
still pre-alpha




Data Pipes

Data Pipes is a service to provide streaming, “pipe-like” data transformations on the web – things like deleting rows or columns, find and replace, head, grep etc.

Modifying, Categorizing, Browsing

— Open Knowledge Foundation —

http://datapipes.okfnlabs.org/
https://github.com/okfn/datapipes

Metadata Mapping/Conversion/normalisation


Quality of Documentation
The site starts of with a useful example of what the tool can do. Following the example the API is detailed. At the bottom you can find a link to GitHub, which contains basic practical documentation on how to install & run the tool.

Ease of Adaptation
The documentation mentions that contributing is “super easy” and points you to a list of issues. Other than that there does not seem a specific plugin architecture or obvious way to extend the tool.

Code Quality
The code base is small. The code looks well written. Occasionally there are comments.

Examples
http://datapipes.okfnlabs.org/csv/head%20-n%2050/cut%200/delete%201:7/grep%20-i%20London/html?url=https://raw.github.com/okfn/datapipes/master/test/data/gla.csv


License
MIT License




DBpedia Spotlight

DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text, providing a solution for linking unstructured information sources to the Linked Open Data cloud through DBpedia.

Annotating, Linking

— Pablo Mendes (Freie Universität Berlin), Jun 2010-present. Jo Daiber (Charles University in Prague), Mar 2011-present. Prof. Dr. Chris Bizer (Freie Universität Berlin), supervisor, Jun 2010-present. —

https://github.com/dbpedia-spotlight/dbpedia-spotlight
https://github.com/dbpedia-spotlight/dbpedia-spotlight

Linked Open Data, Data Annotation/Curation


Quality of Documentation
Well documented, short documentation on Github, further documentation on Wiki, well structured and written.

Ease of Adaptation
REST Webservices allow integration and adaption.

Code Quality
Java/Scala based project (Java 48.7% Scala 48.1% Shell 1.7% Python 1.5%), actively maintained, last commit days ago, continous integration, well-structured, maven-based build, tests available. Many branches and contributors.


License
Apache License, 2.0, LingPipe

Last release
10/2012




Dedupe

A python library for accurate and scaleable data deduplication and entity-resolution.

Curating

— Gregg, Forest, and Derek Eder —

https://github.com/datamade/dedupe
https://github.com/datamade/dedupe

Collection Management


Quality of Documentation
Well documented and structured documentation.

Ease of Adaptation
Extensible.

Code Quality
Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.


License
MIT License

Last activity
2013




Delving Platform: CultureHub and SIP-Creator

Delving has developed and refined an open source platform with tools specifically tailored to the needs of the cultural heritage domain.

Preserving

— Manuel Bernhardt, Gerald de Jong, Eric van der Meulen, Sjoerd Siebinga, Thomas Wikman, Juliane Stiller —

http://delving.eu
https://github.com/delving

Metadata Mapping/Conversion/normalisation, Metadata Checker/Validation, Metadata Retrival Services


Quality of Documentation
Well documented and structured documentation.

Ease of Adaptation
Extensible.

Code Quality
Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.

Examples
http://www.digitalecollectienederland.nl/ http://samlinger.kulturnett.no/


License
EUPL, Apache 2.0

Last release
Monthly cycle

Last activity
Active now




digilib

Digilib is a web based client/server technology for images. The image content is processed on-the-fly by a Java Servlet on the server side so that only the visible portion of the image is sent to the web browser on the client side. It supports a wide range of image formats and viewing options on the server side while only requiring an internet browser with javascript and a low bandwidth internet connection on the client side. digilib enables very detailed work on an image as required by scientists with elaborate zoom features like an option to show images on the screen in their original size. digilib facilitates cooperation of scientists over the internet and novel uses of source material by image annotations and stable references that can be embedded in URLs. digilib is used in the ALCATRAZ framework of image, text and annotation tools and many other projects. - See more at: http://dirtdirectory.org/resources/digilib#sthash.60AWtXSq.CDwoNbFA.dpuf

Editing, Visualizing, Annotating

— Max-Planck-Insitute for the History of Sciencethe University of Bern —

http://digilib.sourceforge.net/
https://sourceforge.net/p/digilib/code/ci/default/tree https://github.com/robcast/digilib

Media Annotation/Tagging



License
GNU GPL

Last release
4/9/2009

Last activity
4/5/2015




Digital Preservation Software Platform (DPSP)

The DPSP is a collection of software applications which support the goal of digital preservation.

Preserving

— National Archives of Australia —

http://dpsp.sourceforge.net/
http://dpsp.sourceforge.net/download.php

Digital Preservation


Quality of Documentation
Clean presentation, documentation short and concise, documentation hosted on sourceforge.

Ease of Adaptation
Unknown

Code Quality
Builds hosted on sourceforge, no source code available.


License
GPLv3

Last release
7/31/2013




DigitalLabels

Django app to generate an interactive gallery label interface, optimised for touch screens and tablets.

Annotating

— V&A Digital Media team —

http://vanda.github.io/DigitalLabels/
http://vanda.github.io/DigitalLabels/

Exhibition Management


Quality of Documentation
Nice presentation, Basic documentation, well written and structured, documentation hosted on github pages.

Ease of Adaptation
No APIs or documentation about interfaces available.

Code Quality
Python/JavaScript based ( Python 65.2% JavaScript 34.8%), hosted on Github, not being actively maintained, not many comments in code, but readable.


License
BSD License




Digitate

Digitate is a free application designed for use on the iOS platform, specifically on iPad devices. The application allows scholars and enthusiasts with an interest in the visual and material elements of a cultural artefact to make notes and annotations directly on an image of such an artefact.For example, a literary scholar might use it to annotate the material or bibliographic elements of a rare text or first edition, while an art historian might do the same on an image of a painting.

Annotating

— OpenDHAT —

Media Annotation/Tagging


Examples
https://www.youtube.com/watch?v=69CKhuiak0Q


License
CC BY-SA

Last activity
1/2/2015




Diva.js

Diva.js (Document Image Viewer with AJAX) is a Javascript frontend for viewing documents, designed to work with digital libraries to present multi-page documents as a single, continuous item. Only the pages that are being viewed at any given time are actually present in the document, with the rest appended as necessary, ensuring efficient memory usage and high loading speeds. Written as a jQuery plugin, diva.js requires the jQuery Javascript library. Diva’s back end is constituted by the IIPImage server.

Modifying, Browsing

— Distributed Digital Music Archives and Libraries —

http://ddmal.github.io/diva.js/about/
https://github.com/DDMAL/diva.js

Publication, Exhibition Management



License
Open w/ attribution

Last release
3/6/2015

Last activity
10/6/2015




Doc Viewer

HTML5 document viewer

Browsing

— The New York Times —

https://github.com/documentcloud/document-viewer
https://github.com/documentcloud/document-viewer

Media Applications


Quality of Documentation
Clean documentation, well written and structured, easily comprehensible.

Ease of Adaptation
Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.

Code Quality
Mainly JavaScript, some Ruby project (JavaScript 73.0% CSS 26.9% Ruby 0.1%). Clean code, many comments support readability of code. Actively maintained, many contributers, code hosted on github.


License
Apache License, v2.0

Last release
4/6/2010

Last activity
27/5/2015




DocSplit

Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages…)

Modifying

— Jeremy Ashkenas, DocumentCloud —

http://documentcloud.github.io/docsplit/
https://github.com/documentcloud/docsplit/

Collection Management


Quality of Documentation
Short and concise documentation, well written and structured, easily comprehensible.

Ease of Adaptation
Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.

Code Quality
Pure Ruby project. Clean code, tests available, many comments support readability of code. Actively maintained, many contributers, code hosted on github.


License
LGPL

Last release
17/11/21014

Last activity
5/2/2015




DSpace

DSpace open source software is a turnkey repository application

Preserving

— DSpace Foundation —

http://www.dspace.org/
https://github.com/DSpace/DSpace

Collection Management, Metadata Retrival Services, Repository Software


Quality of Documentation
Very high quality software documentation, commercial support.

Ease of Adaptation
Complex product ecosystem.

Code Quality
Maven-based, highly modularized, tests available, clean code, mainly java, many comments explaining concepts and supporting readability. Good object oriented design.

Examples
demo.dspace.org


License
Open Source: http://www.dspace.org/license/

Last release
20/5/2015

Last activity
23/6/2015




EADitor

EADitor is an EAD (Encoded Archival Description) editor based on Orbeon XForms. It uses various external services (eg Getty AAT, TGN, ULAN) for accessing LOD to be used in the descriptions. A companion tool is xEAC for creating and managing EAC-CPF records (corporates, persons, families)

Managing

— Ethan Gruber, American Numismatic Society —

http://eaditor.blogspot.com/
http://code.google.com/p/eaditor/

Collection Management, Metadata Retrival Services


Quality of Documentation
The documentation available on Github seems a bit like a copy& paste text blog that is not structured. But there is a lot of information available and blog posts explain the concepts of the software and give further examples.

Ease of Adaptation
REST or SOAP interfaces allow easy integration. No plugin mechanism.

Code Quality
CSS/XSLT/Java-Script-based project hosted on Github (CSS 62.3% XSLT 17.3% JavaScript 15.0% XProc 5.4%). One contributor (highly active Github committer), the code is being actively maintained. A lot of XSLT is not commented but readable.

Examples
http://numismatics.org/archives/


License
Apache License 2.0

Last release
.1112 beta (December 2011)

Last activity
June 2012




easyLOD

Simple framework for exposing Linked Open Data from arbitrary sources.

Presenting

— Mark Jordan —

https://github.com/mjordan/easyLOD
https://github.com/mjordan/easyLOD

Linked Open Data, Metadata Retrival Services


Quality of Documentation
Information available on Github page. Documentation and instructions for applying data source plugins, resource URIs and mapping namespaces in resource URIs to data source plugins. Also provides info on installing and testing Easy LOD. Instructions also available within the code repository. Easy to access, read, and follow.

Ease of Adaptation
The GitHub explains having a plugin structure for converting different datasources (to LOD), so creating a plugin for custom content sources should be fairly straightforward. Moreover, the codebase is small an clear, so further adapting the tool should not be too difficult.

Code Quality
The Slim PHP micro framework is used, making the code and all configurations not hard to grasp. The code itself contains several useful comments.


License
MIT License






Echoprint

Internet design and authoring tools.

Web-developing, Authoring

— The Echonest in partnership with MusicBrainz —

http://echoprint.me/
https://github.com/echonest/

Content Retrieval Services, Metadata Retrival Services


Quality of Documentation
Website available that is easy to navigate however it’s possible to get linked back to the massive Echonest webpage. Plenty of resources and tutorials to help users get started and along the way. Wrong linked listed for code repository at Github, the current link takes you to all the Echonest apps. This is the more specific link: https://github.com/echonest/echoprint-codegen.

Ease of Adaptation
Echoprint offers several useful services which can be used to build your own applications. Extending these services for your own purposes is most likely not needed, except for maybe solving bugs.

Code Quality
The code of the most tools/services seem to be fairly well organized and commented.


License
The code generator is MIT and the server is Apache 2

Last activity
9/1/2015




ElasticSearch

ElasticSearch is a distributed RESTful search engine built for the cloud.

Seeking

— http://elasticsearch.com/about/team/ —

http://www.elasticsearch.org/
https://github.com/elasticsearch/elasticsearch

Search and Browsing, Content Retrival Services, Metadata Retrival Services, Collection Management


Quality of Documentation
Very user friendly website with extremely thorough documentation. Training, development support, and production support are all available. Github page with notes and directions. Blog is updated regularly and is up-to-date. There are also very interesting and helpful case studies available to see how groups have made use of ElasticSearch. One downside of the documentation is that it’s sometimes hard to find concrete examples of every day use cases.

Ease of Adaptation
ElasticSearch has many configuration options for building the search engine needed for your use case. Extending the tool itself is possible by contributing to GitHub, but isn’t probably something you would need to do.

Code Quality
The (mainly Java) code seems well documented and commented.


License
Apache License, Version 2.0

Last release
16/7/2015

Last activity
24/7/2015




ERMes

Electronic Resource Management (ERM) Software

Managing

— William Doering, Galadriel Chilton —

http://murphylibrary.uwlax.edu/erm/
murphylibrary.uwlax.edu/erm/

Metadata Retrieval Services


Quality of Documentation
Website not very clear and requires a lot of downloading. There is a semi-active forum for discussion and help and some user generated instructional videos have been provided. No readily visible documentation.

Ease of Adaptation
The code is only available as a Access 2007 file, which makes adapting the code not very easy for most contemporary developers.

Code Quality
Without Access 2007 the code could not be inspected by the reviewer.


License
Attribution Assurance License

Last release
5/1/2014








Europeanap-dbpedia-disambiguation

A simple Python library and webservice, that allows named entity disambiguation against a label database. The idea is to use a Solr query to filter possible candidates and use the more detailed analysis on string similarity, number of inlinks and entity type to select the “best” candidate. It contains code to handle (multi-lingual) DBpedia dumps and load them into a Solr backend. It also contains helper code for the annotation of ALTO 2.1 files that are used in the context of the Europeana Newspapers project.

Named Entity Recognition

— KBNL Research —

https://github.com/KBNLresearch/europeananp-dbpedia-disambiguation
https://github.com/KBNLresearch/europeananp-dbpedia-disambiguation

Metadata Retrieval Services, Media Annotation/Tagging



Last release
January 2015




Europeanap-ner

This tool takes container documents (MPEG21-DIDL, METS), parses all references to ALTO files and tries to find named entities in the pages (with most models: Location, Person, Organisation, Misc). The aim is to keep the physical location on the page available through the whole process to be able to highlight the results in a viewer.

Parsing

— KB Research —

http://europeana-newspapers.eu/
https://github.com/KBNLresearch/europeananp-ner

Metadata Retrieval Services, Semantic Extraction


Examples
http://researchkb.wordpress.com/2014/03/03/ner-newspapers/


License
European Union Public License

Last release
v.0.0.2 March 2014






Evergreen

Evergreen is a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries.

Seeking, Managing, Resource sharing

— Various —

http://www.open-ils.org/
http://www.open-ils.org/downloads.php

Collection Management, Search and Browsing


Quality of Documentation
Very clear and extensive documentation. Updated regularly. There is a documentation interest group that meets periodically and the minutes are recorded. The meetings are open for anyone to attend. While the documentation is very clear and organized some may find its vastness overwhelming.

Ease of Adaptation
Evergreen is a huge collection of software packages. To adapt it means a lot of reading, installing and testing. However since the documentation is very thorough and extensive, it should be possible.

Code Quality
The reviewer did not inspect any code, because of the size of the codebase and the several different tools that are available.


License
GNU GPL

Last release
16/6/2015

Last activity
4/9/2011




ExifTool

ExifTool is a platform-independent Perl library plus a command-line application for reading, writing and editing meta information in a wide variety of files.

Adding Metainformation

— Phil Harvey —

http://www.sno.phy.queensu.ca/~phil/exiftool/
http://www.sno.phy.queensu.ca/~phil/exiftool/

Metadata Retrival Services, Metadata Mapping/Conversation/normalisation, Metadata Checker/Validation



License
GNU General Public License, v.1 or later

Last release
10.20 (June 13, 2016)




eXtensible Catalog

eXtensible Catalog is open source, user-centered, next generation discovery interface software for libraries and consortia. It comprises four software components that can be used independently to address a particular need or combined to provide an end-to-end discovery system to connect library users with resources.

Seeking

— eXtensible Catalog Organization · University of Rochester —

http://www.extensiblecatalog.org/
https://www.drupal.org/project/xc

Collection Management, Search and Browsing, Linked Open Data


Examples
http://catalog.lib.kyushu-u.ac.jp/en/xc/search


License
MIT License (the Java based Toolkits), GNU General Public License (Drupal Toolkit)

Last release
1/28/2014

Last activity
2/26/2014




Fedora Commons

Fedora (Flexible Extensible Digital Object Repository Architecture) was originally developed by researchers at Cornell University as an architecture for storing, managing, and accessing digital content in the form of digital objects inspired by the Kahn and Wilensky Framework. Fedora defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking “behaviors” (i.e., services) to digital objects.

Managing, Storing, Direct Accessing

— various researchers at Cornell University —

http://fedora-commons.org/
http://www.fedora-commons.org/software

Digital Asset Management, Content Retrival Services, Collection Management


Quality of Documentation
Documentation is not visibly and clearly labeled but when found it is very thorough and fairly organized. The documentation is hard to navigate and not as clear as some others. The project is on going and a new version will be coming out. Explanations about upgrading are provided. No visible community or forum space.

Examples
http://www.fedora-commons.org/about/examples


License
Apache License, Version 2.0.

Last release
15/11/2012

Last activity
actively ongoing




Fedora Migrate

Migrates content from a Fedora3 repository to a Fedora4 one.

Migrating

— Penn State —

Collection Management



License
MIT

Last release
19/12/2015




File Analyzer and Metadata Harvester

This application functions like a digitization Swiss army knife. The application allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories. The application can be used to generate checksum values to ensure the bit-level integrity of files after they have been copied to a new device. After a collection of files have been converted from one digital format to another, this application can verify that there is a one-to-one match of before and after files.

Digitizing

— NARA —

http://blogs.archives.gov/online-public-access/?p=6270
https://github.com/Georgetown-University-Libraries/File-Analyzer

Digital Preservation, Metadata Mapping/Conversation/normalisation, Metadata Checker/Validation


Quality of Documentation
Documentation of the File Analyzer and Metadata Harvester has been thoroughly updated in the past few months. One can find the tool's wiki here : https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki which provides graphics as well as extensive documentation.


License
NARA OPEN SOURCE AGREEMENT VERSION 1.3

Last release
2011

Last activity
5/2014




File_MARC

Parse, modify, and create MARC records

Parsing, Modifying

— Dan Scott —

http://pear.php.net/package/File_MARC/
http://pear.php.net/package/File_MARC/download

Collection Management


Quality of Documentation
The documentation for File_MARC is very clear and easy to find on the Pear website. The documentation is complete and thorough but lacks the detail that some other documentations have.


License
GNU Lesser General Public License

Last release
10/30/2013




File Rename Tool (FRT)

The File Rename Tool (FRT): Deliveries dates are obviously available on newspapers so that they have the possibility to be saerched for by data later on. If a newspaper is not available in day folders, the ‘File Renaming Tool’ can help to bring them into the right structure and support libraries in renaming and reordering their images according to the Europeana Newspapers project specifications. The main idea of FRT is that images, which may be stored on year level, can be quickly ordered on the basis of issues and publishing date.

Modifying, Managing

— University of Innsbruck —

http://dbis-halvar.uibk.ac.at/dokuwiki/doku.php?id=main:structify
https://github.com/dea-uibk/FRT

Content Retrieval Services, Digital Asset Management



License
MIT

Last release
13/12/2014






Franken +

The Initiative for Digital Humanities Media and Culture (IDHMC) at Texas A&M University as part of its Early Modern OCR Project (eMOP) has created a new tool called Franken+ that provides a way to create font training for the Tesseract OCR engine using page images. This is in contrast to Tesseract’s document method of font training which involves using a word processing program with a modern font. ‘’'’Franken+ works in conjunction with PRImA’s Aletheia tool and allows users to easily and quickly identify one or more idealized forms of each glyph found on a set of page images. These identified forms are then used to generate a set of Franken-page images matching the page characteristics documented in Tesseract’s training instructions but using a font used in an actual early modern printed document.

Imaging, Analysing

— Texas A&M/Bryan Tarpley —

http://dh-emopweb.tamu.edu/Franken+/
https://github.com/idhmc-tamu/FrankenPlus

Collection Management


Quality of Documentation
Good, webpage has full explanation


License
Apache 2.0

Last release
December 2013

Last activity
11/1/2014




Frog

Frog formerly known as Tadpole is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl the Tilburg memory-based learning software package. Most modules were created in the 1990s at the ILK Research Group (Tilburg University the Netherlands) and the CLiPS Research Centre (University of Antwerp Belgium). Over the years they have been integrated into a single text processing tool. More recently a dependency parser a base phrase chunker and a named-entity recognizer module were added.

Processing, Parsing, Name entity recognition

— University of Tilburg —

http://ilk.uvt.nl/frog/
http://software.ticc.uvt.nl/

Semantic Extraction


Quality of Documentation
poor. install & run, but 'we are in the process of writing a reference guide for Frog that explains all options in detail.'

Ease of Adaptation
limited. no guidelines how to add other languages


License
GNU GPL

Last release
1/29/2015

Last activity
1/29/2015




FromThePage

FromThePage is an open-source tool that allows volunteers to collaborate to transcribe handwritten documents.

Collaborating, Transcribing

— Ben W. Brumfield —

http://beta.fromthepage.com/
https://github.com/benwbrum/fromthepage

Social Applications, Contextualisation


Quality of Documentation
The FromThePage home page provides very little documentation and the Github page also has very little documentation. The documentation that is there is clear and visible but severely lacking in detail.


License
GNU AGPL v3.0

Last release
6/1/2015

Last activity
6/1/2015




GATE (General architecture for text engineering)

GATE is over 15 years old and is in active use for all types of computational task involving human language. GATE solves problems concerning text analysis or human language processing.

Processing, Parsing, Name entity recognition

— Various —

http://gate.ac.uk/
http://gate.ac.uk/download/

Alignment Tools, Metadata Mapping/Conversation/normalisation, Media Checker/Validation


Quality of Documentation
Excellent

Examples
http://gate.ac.uk/demos/


License
GNU LESSER GENERAL PUBLIC LICENSE v.3.0

Last release
6/2/2015

Last activity
actively ongoing






Getty Vocabularies

The AAT, TGN, ULAN, and CONA contain structured terminology for art and other material culture, archival materials, visual surrogates, and bibliographic materials. Compliant with international standards, they provide authoritative information for catalogers and researchers, and can be used to enhance access to databases and Web sites. - See more at: http://www.getty.edu/research/tools/vocabularies/lod/sparql.html#sthash.sCQK6qfa.dpuf

Organizing

— Getty —

http://vocab.getty.edu/
http://vocab.getty.edu/ontology.html

Ontology/Vocabulary Building and Management, Linked Open Data


Quality of Documentation
Good


License
Open Data Commons Attribution License

Last release
v3.1

Last activity
6/5/2015




ggplot

ggplot2 is a plotting system for R, based on the grammar of graphics, which tries to take the good parts of base and lattice graphics and none of the bad parts. It takes care of many of the fiddly details that make plotting a hassle (like drawing legends) as well as providing a powerful model of graphics that makes it easy to produce complex multi-layered graphics.

Visualizing

— Hadley Wickham —

http://ggplot2.org/
https://github.com/hadley/ggplot2

Data Visualization


Quality of Documentation
OK


License
?

Last activity
10 minutes ago!




GIMP

GIMP is the GNU Image Manipulation Program. It is a freely distributed piece of software for such tasks as photo retouching image composition and image authoring.

Visualizing, Imaging

— GIMP Team —

http://www.gimp.org/
http://www.gimp.org/downloads/

Media Annotation/Tagging


Quality of Documentation
Good

Examples
http://www.gimp.org/screenshots/


License
GPL

Last release
v2.8.14

Last activity
6/4/2015




Global References Index to Biodiversity (GRIB)

It will be a tool to manage the taxonomic literature that is (a) already available in digital form, (b) in the process of being digitised, and (c) for which plans have been created for digitisation and to nominate literature to be digitised.

Bibliographic Management

— Boris Jacob —

http://bhleurope.gbv.de/
http://bhleurope.gbv.de/

Metadata Retrieval Services


Examples
http://gso.gbv.de/DB=1.83/


License
N/A

Last release
Unreleased




Goobi

Goobi is an open source software application for digitisation projects and workflow management in libraries, museums and archives.

Digitization, Managing

— Goobi —

http://www.goobi.org/en/
https://github.com/goobi

Collection Management, Media Checker/Validation, Digital Preservation


Examples
http://www.goobi.org/en/references/


License
GNU/GPL (General Public License)

Last release
12/20/2013

Last activity
7 May 2014 (community) 21 Jun 2015 (github)




Graphite

Graphite is a PHP Library, built on top of ARC2, to make it easy to do stuff with RDF data really quickly, without having to naff around with databases.

Adding Metainformation, Programming

— Christopher Gutteridge —

graphite.ecs.soton.ac.uk
https://github.com/cgutteridge/Graphite

Collection Management


Quality of Documentation
Good on the website


License
LGPL

Last release
3/11/2012

Last activity
7/20/2015




Grid

Grid is the Guardian’s new image management system, which provides a universal and fast experience accessing media that is organised and using it in an affordable way to produce high-quality content. Grid runs as a set of independent micro-services (Scala and Play Framework) exposed as hypermedia APIs (argo) and accessed using a rich Web user interface (AngularJS). Grid relies on Elasticsearch for blazing-fast searching, and AWS services as additional storage and communication mechanisms.

Publishing, managing

— The Guardian —

https://github.com/guardian/grid
https://github.com/guardian/grid

Digital Asset Management, Media Applications, Publication


Quality of Documentation
Extensive

Examples
https://www.youtube.com/watch?v=ZoddCAH9EPE


License
Apache 2.0

Last activity
ongoing




griot

Griot is an open-source iPad application that facilitates engagement with a collection of objects (artifacts, artwork, graphs, or anything else that can be represented visually) through annotations (points of interest on the object itself) and stories (related text-based and multimedia content, presented as a series of pages).

Annotating, Accessing

— Minneapolis Institute of Art —

https://github.com/artsmia/griot
https://github.com/artsmia/griot

Digital Asset Management, Collection Management, Media Applications


Quality of Documentation
Limited

Examples
http://artsmia.github.io/griot/#/


License
MIT

Last activity
2/21/2015




gttext

OCR free software and Ground Truthing tool: the gttext project extracts text from colour images.

Imaging, OCR

http://www.softocr.com/
http://www.softocr.com/downloads

Metadata Retrieval Services


Quality of Documentation
None unless with the download


License
GPL v2

Last release
2/27/2013






Heidrun

Heiðrún (a.k.a. Heidrun, pronounced [roughly] hey-droon) is DPLA’s new metadata aggregation system, which we use to harvest metadata from Hubs, map it to the DPLA Metadata Application Profile, enrich it to clean up and add value, and to index it for use in the DPLA Platform API. Heiðrún is implemented as a Ruby on Rails application that builds on Krikri, a Ruby gem for metadata harvesting, mapping, and enrichment. Heiðrún and Krikri are both released as open source software under the MIT License.

Gathering, Adding Metainformation

— DPLA —

https://digitalpubliclibraryofamerica.atlassian.net/wiki/display/TECH/Heidrun
https://github.com/dpla/heidrun

Ingestion Tool


Quality of Documentation
Good on the website


License
MIT

Last release
22/2/2015

Last activity
7/1/2015




Hydra

Hydra is a repository solution that is being used by institutions worldwide to provide access to their digital content. Hydra provides a versatile and feature rich environment for end-users and repository administrators alike.

Curating, Managing, Preservation

http://projecthydra.org/
https://github.com/projecthydra

Digital Preservation, Digital Asset Management


Quality of Documentation
V Good

Examples
http://projecthydra.org/apps-demos-2-2/


License
Apache 2 license

Last release
7/14/2015

Last activity
ongoing




HyperImage

The HyperImage platform supports the linking of (audio)-visual objects, texts and mixed-media documents. HyperImage allows any number of details, or subregions, within an image to be highlighted and described, and for annotations within a corpus to be linked to each other, making them accessible in indices. Interim results as well as final versions can be compiled at any time as an online/offline hypermedia publication. This makes HyperImage a suitable research environment for digital humanities and eScience projects, providing a common research and publication environment for groups as well as individuals.

Annotating, Analyzing,

— Heinz-Günter Kuper, Dr.,JML Jens-Martin Loebel, Dr. —

http://hyperimage.ws/en/team-en/
http://sourceforge.net/p/hyperimage/hi3repos/ci/master/tree/

Media Annotation/Tagging


Quality of Documentation
Extensive

Examples
http://telota.bbaw.de/cvma/HyperCVMA/StendalDom/


License
Apache 2 license

Last release
6/15/2015

Last activity
ongoing




Hypothesis

Hypothes.is brings community peer review to The Internet. It is a web application which enables rich annotation of web content. The project acts as a community aggregator for annotations and identity provider for annotators. It also serves embed code for an annotation agent designed with modern web browsers in mind.

Crowdsourcing, Annotating

— Hypothesis —

https://hypothes.is/
https://github.com/hypothesis/h

Media Annotation/Tagging, Contextualisation


Quality of Documentation
It is a very simple tool to install and use.


License
mixed open source licenses. 'We typically use the 2-Clause BSD License (aka the “Simplified BSD License” or the “FreeBSD License”).'

Last activity
actively ongoing




ICA-AtoM

ICA-AtoM is a web-based archival description software that is based on the International Council on Archives (‘ICA’) standards. ‘AtoM’ is an acronym for ‘Access to Memory’. ICA-AtoM is multi-lingual and supports multi-repository collections.

Adding metainformation, Curating

— Artefactual Systems in collaboration with the ICA Program Commission (PCOM —

https://www.ica-atom.org/
https://github.com/artefactual/atom

Collection Management, Repository Software


Quality of Documentation
The source code is maintained on GitHub https://github.com/artefactual/atom where the majority of the additions already happend in Oktober 2012. A live demo of the software is available at http://demo.ica-atom.org/ and different types of manuals can be found here https://www.ica-atom.org/doc/Main_Page.

Ease of Adaptation
There there are approximately 250 institutions worldwide running ICA-AtoM. A list of them can be found here https://www.ica-atom.org/doc/ICA-AtoM_users.

Code Quality
The code is maintained in GitHub. PHP is used as the main language. There are 8 contributor but only 4 of them are active. The last AtoM maintenance release (2.0.1) was on 16.12.13.

Examples
http://demo.ica-atom.org/


License
GPL

Last release
v1.3.1

Last activity
actively ongoing




IIPImage

IIPImage is an advanced high-performance feature-rich image server system for web-based streamed viewing and zooming of ultra high-resolution images.

Visualizing, Publishing

— Ruven Pillay —

http://iipimage.sf.net
https://github.com/ruven/iipsrv

Exhibition Management, Media Applications


Quality of Documentation
Documentation for IIPImage is extensive and complete. It's neatly organized and includes beneficial troubleshooting procedures. all documentation can be found at the following link. http://iipimage.sourceforge.net/documentation/. A live demo can be found on the IIPImage website: http://iipimage.sourceforge.net/demo/

Ease of Adaptation
You can use the IIPImage Server as it is. If you want to plug in another tool you will have to code it yourself. There is no plug-in ability. Customization is done on the client side (see IIPMooViewer). Quite a lot of institutions are using the IIPImageServer, some of them can be found here: http://iipimage.sourceforge.net/links/.

Code Quality
There are installation packages available for Debian/Ubuntu, Fedora/Redhat/CentOS, Windows and Mac OS X. Nevertheless it is recommended that you compile the server, if possible, in order to fully optimize it for your system! The program language being used C++.

Examples
http://iipimage.sourceforge.net/demo/


License
GNU General Public License, v. 3

Last release
4/9/2011




IIPMooViewer

IIPMooViewer is a high performance light-weight HTML5 Ajax-based javascript image streaming and zooming client designed for the IIPImage high resolution imaging system

Visualizing

— Ruven Pillay —

http://iipimage.sourceforge.net
https://github.com/ruven/iipmooviewer

Media Applications


Ease of Adaptation
If you have an IIPImage Server up and running it is easy to implement a client application with the IIPMooViewer.

Code Quality
The JavaScript files are hosted on GitHub https://github.com/ruven/iipmooviewer.


License
GNU General Public License








ImageJ

It can display, edit, analyze, process, save and print 8-bit, 16-bit and 32-bit images. It can read many image formats including TIFF, GIF, JPEG, BMP, DICOM, FITS and “raw”. It supports “stacks”, a series of images that share a single window. It is multithreaded, so time-consuming operations such as image file reading can be performed in parallel with other operations.

Visualizing

— Wayne Rasband —

http://rsbweb.nih.gov/ij/docs/intro.html
http://rsbweb.nih.gov/ij/download.html

Media Checker/Validation, Digital Asset Management, Media Applications



License
PD




ImageMontage

ImageMontage creates montages of images without the need to load them into a stack first. The source images can be located in one or multiple directories. The images can also have different sizes. To fit images into a montage, the macro scales them vertically to the same user-specified size. Images are added horizontally until they no longer fit on the row and then a new row is started. This process is repeated until all source images are rendered. If you see that the generated montage does not include all source images, change the options to reduce the size of images and/or increase canvas dimensions, and run the macro again.

Visualizing

— Lev Manovich, Matias Giachino, Jay Chow. —

http://lab.softwarestudies.com/2014/03/how-to-visualize-4512-instagram-selfies.html?utm_content=buffer9ee7e&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
http://rsbweb.nih.gov/ij/plugins/image-montage/index.html moved to https://github.com/culturevis

Media Applications, Exhibition Management



License
PD






Internet Archive BookReader

The Internet Archive BookReader is used to view books from the Internet Archive online and can also be used to view other books

Resource Sharing, Visualizing

https://openlibrary.org/dev/docs/bookreader
https://github.com/openlibrary/bookreader

Content Retrieval Services


Quality of Documentation
Documentation can be found here: https://openlibrary.org/dev/docs. Everything necessary seems to be there.

Ease of Adaptation
The easiest way to find out how the Internet Archive BookReader works is to do a “git clone https://github.com/openlibrary/bookreader.git” and go to BookReaderDemo/index.html and open it in you Browser. Next have a look at BookReaderDemo/BookReaderJSSimple.js.

Code Quality
All the functionality is more or less put into BookReader/BookReader.js. There are some comments in the code but not too much! There are 8 contributors but only one of them is contributing.

Examples
http://www.archive.org/stream/birdbookillustra00reedrich#page/n3/mode/2up


License
AGPL v3






Islandora

Islandora is an open source framework that combines the Drupal and Fedora open software applications to create a robust digital asset management system that can be fitted to meet the short and long term collaborative requirements of digital data stewardship. Additional open source applications are added to this core stack to create what we call Solution Packs.

Managing, Collaborating, Storing

— The Islandora Foundation —

http://islandora.ca/
https://github.com/Islandora

Digital Asset Management


Quality of Documentation
A Wiki is set up to provide documentation https://wiki.duraspace.org/display/ISLANDORA/Islandora.

Ease of Adaptation
There seems to be a very active user group: https://groups.google.com/forum/#!forum/islandora-dev and there are 63 installations listed: http://islandora.ca/islandora-installations.

Code Quality
Github: https://github.com/islandora hosts all the repositories for Islandora. All of the 58 repositories are public. The code is written mainly in PHP and JavaScript.

Examples
http://sandbox.islandora.ca/


License
GNU-GPL

Last release
16/04/2012

Last activity
actively ongoing




JHOVE

JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects.

Processing, Preserving

— JSTOR & the Harvard University Library —

http://openpreservation.org/technology/products/jhove/
https://github.com/openpreserve/jhove

Digital Preservation, Media Annotation/Tagging


Quality of Documentation
Documentation is at http://jhove.openpreservation.org/


License
LGPLv2

Last release
1.14 (May 12, 2016)

Last activity
ongoing




JHOVE2

Processing, Preserving






jMet2Ont

Modifying






Joomla Europeana Search Component

Seeking






Julius

Data recognition, Transcribing






Kaltura

A standard Drupal Module that integrates the Kaltura video platform capabilities into Drupal, allowing you to instantly enrich your Drupal site with video, audio, and images. Kaltura’s video platform and video module for Drupal is an enterprise grade solution that offers comprehensive powerful functionality and full control over content ownership, without the need to run your own video operation. The module handles every aspect of rich-media, including uploading and importing content, transcoding, content management, syndication, monetization, distribution, and more.

Web-developing, Publishing

— Kaltura —

http://corp.kaltura.com/
https://github.com/kaltura/platform-install-packages#documentation-and-guides

Media Annotation/Tagging, Media Applications



License
GNU GPL v3.0






Kartograph

Kartograph is a simple and lightweight framework for building interactive map applications without Google Maps or any other mapping service.

Visualizing, Mapping, Capturing

— Gregor Aisch —

http://kartograph.org/
https://github.com/kartograph/kartograph.py and https://github.com/kartograph/kartograph.js

Geo-spatial Apllications, Data Visulaisation


Quality of Documentation
There are two different version of this application with different specialty features that are clearly stated on the main page. Each version has its own clear, descriptive documentation both on the homepage and Github. No technical support available but there are active demos.


License
LPGL (Kartograph.js) & AGPL (Kartograph.py)




KAT

Annotating






Koha

Koha is the first free and open source software library automation package (ILS). Development is sponsored by libraries of varying types and sizes, volunteers, and support companies from around the world.

Adding Metainformation

— Various —

http://koha-community.org/
http://git.koha-community.org/gitweb/

Metadata Retrieval Services, Search and Browsing


Quality of Documentation
Very thorough and detailed documentation that is updated periodically and is extremely easy to navigate on the Koha website. Also the documentation is available in multiple languages but mostly just for older versions. Technical support is also available.


License
GPL-2, or later

Last activity
4/13/2012




Korbo

Korbo is a Semantic Web basket manager. It allows users to search, import and augment Linked Data resources. Personal augmented collections created with Korbo, are then republished in the Linked Data cloud. Korbo is part of the Muruca suite.

Enriching

— Net7 —

http://korbo.muruca.org/

Semantic Enrichment



Last release
Not yet production ready but prototype released 1/7/2012




KriKri

A Rails engine for metadata aggregation, enhancement, and quality control.

Adding metainformation

— DPLA —

Metadata Retrieval Services



License
MIT

Last release
24/2/2015




Kuali OLE

Kuali OLE is the first system designed by and for academic and research libraries for managing and delivering intellectual information. A community of partners will deliver an enterprise-ready, community-source software package to manage and provide access not only to items in their collections but also to licensed and local digital content. Kuali OLE (pronounced oh-LAY, for Open Library Environment) features a governance model in which the entire library community can collaborate to own the resulting intellectual property.

Managing, accessing

— Kuali OLE https://www.kuali.org/ole/organization —

www.kuali.org/ole
https://wiki.kuali.org/display/OLE/OLE+Published+Documentation

Collection Management



License
ECL 2.0

Last release
5/1/2015

Last activity
on-going






Leaflet

Leaflet is an open source JavaScript library for mobile-friendly interactive maps

Visualizing, Mapping

— Vladimir Agafonkin —

http://leafletjs.com/
https://github.com/Leaflet/Leaflet

Data Visualization, Geo-spatial Apllications



License
BSD license

Last release
11/18/2013




LinkedTV editorial-tool

In the LinkedTV project (http://linkedtv.eu), the Editor Tool (ET) enables moderating (named) entities and hyperlinks, which are automatically extracted from audiovisual content. This data is created to provide a useful starting point for editorial teams that are responsible for publishing interactive television programmes, which aim to offer contextual information to viewers.

Contextualizing, Extracting metadata

— Jaap Blom, NISV —

https://github.com/beeldengeluid/linkedtv-editortool
https://github.com/beeldengeluid/linkedtv-editortool

Metadata Retrieval Services, Digital Asset Management, Media Annotation/Tagging


Examples
http://linkedtv.eu/


License
Apache v 2.0




LoCloud_bglink

Background link service for LoCloud

This repository contains the background link service module developed within the LoCloud project. The module consists of a PHP script that implements a REST service, calls DBpedia Spotlight for the actual processing, and wraps the answer into a suitable format.

The background link service uses DBpedia Spotlight as a backbone for performing the linking. In principle, the service can be used in any language, the only requirement being that a DBpedia spotlight instance for this particular language is running.

enriching

— LoCloud —

http://www.locloud.eu/
https://github.com/ixa-ehu/locloud_bglink

Contextualisation, Linked Open Data



License
Apache v 2.0

Last activity
11/26/2014




LodLive

LodLive project provides a demonstration of the use of Linked Data standards (RDF, SPARQL) to browse RDF resources

Browisng

— Diego Valerio Camarda e Silvia Mazzini (http://www.regesta.com/) , Alessandro Antonuccio (http://www.hstudio.it/) —

http://en.lodlive.it/
https://github.com/dvcama/LodLive

Metadata Retrieval Services



License
MIT License

Last release
7/4/2012

Last activity
7/4/2012




LODmilla-frontend

LODmilla - a graph-based Linked Open Data browser. Intended as a prototype tool for generic LOD browsing. This frontend of LODmilla is written in “plain” jQuery, no js frameworks used (yet). Graph visualization is done by jsPlumb.

Browisng

— dsd-sztaki-hu —

https://github.com/dsd-sztaki-hu/LODmilla-frontend
https://github.com/dsd-sztaki-hu/LODmilla-frontend

Search and Browsing, Linked Open Data


Examples
http://munkapad.sztaki.hu/lodmilla/


License
Apache v 2.0

Last activity
2/9/2014






MAchine Readable Cataloging (MARC)

MARC is a ruby library for reading and writing MAchine Readable Cataloging (MARC)

Adding metainformation

— Kevin Clarke, Bill Dueber, William Groppe, Ross Singer, Ed Summers —

http://marc.rubyforge.org/
https://github.com/ruby-marc/ruby-marc/

Collection Management, Metadata Retrival Services


Quality of Documentation
A Wiki is set up to provide documentation https://wiki.duraspace.org/display/ISLANDORA/Islandora.

Ease of Adaptation
There seems to be a very active user group: https://groups.google.com/forum/#!forum/islandora-dev and there are 63 installations listed: http://islandora.ca/islandora-installations.

Code Quality
Github: https://github.com/islandora hosts all the repositories for Islandora. All of the 58 repositories are public. The code is written mainly in PHP and JavaScript.


License
MIT License

Last release
2012

Last activity
2012






MARCXML Toolkit

The MARCXML toolkit is a set of Java programs which allow users to convert to and from the MARC file format (including full character set conversion) and other formats available in the MARCXML architecture.

Conversioning

http://www.loc.gov/standards/marcxml/
http://www.loc.gov/standards/marcxml/

Metadata Mapping/Conversion/normalisation


Quality of Documentation
There is no documentation of the source code (marcxml.jar) locatable.

Ease of Adaptation
It is not clear where to find the source code!

Code Quality
A Windows Batch file (marcxml.bat) has to be used to convert to and from the MARC file format. The recommended Java Version is 1.4 which was released in 2002!


License
GNU LGPL, Version 2.1






MediaInfo

A convenient unified display of the most relevant technical and tag data for video and audio files.

Presenting

— MediaArea —

http://mediainfo.sourceforge.net/en

Ontology/Vocabulary Building and Management, Media Applications


Quality of Documentation
The software is very easy to use. Documentations is not locatable but also not necessary.

Ease of Adaptation
Bug reports (http://sourceforge.net/p/mediainfo/bugs/) and feature requests (http://sourceforge.net/p/mediainfo/feature-requests/) can be filed and there is also a forum for all other questions (http://sourceforge.net/p/mediainfo/discussion/). They are all very active.

Code Quality
The code is hosted on sourceforge http://sourceforge.net/p/mediainfo/code/commit_browser and there seems to be development ongoing. Binaries for nearly all operating systems are available: http://mediaarea.net/en/MediaInfo/Download.


License
LGPL license

Last release
0.7.86 (May 31, 2016)

Last activity
actively ongoing




MediaMosa

MediaMosa is open source software to build a Full Featured, Webservice Oriented Media Management and Distribution platform.

Managing, Presenting

— MediaMosa —

www.mediamosa.org
https://github.com/mediamosa

Collection Management, Content Retrival Services, Media Applications


Quality of Documentation
It is quite easy to find documentation on http://www.mediamosa.org.

Ease of Adaptation
MediaMosa is based on the Representational State Transfer (REST) architecture. This makes it easy to attach your own tools. There is a forum http://www.mediamosa.org/forum/index.php for bugs and features requests. The community seems to be very helpful.

Code Quality
Most of the code is written in PHP. There is sufficient documentation in the code. There are 9 contributors registered on GitHub. Two of them are comitting the majority of the code. You can find the code here: https://github.com/mediamosa/mediamosa.


License
GNU General Public License

Last release
7/4/2012




Mediathread

Mediathread is a Django site for multimedia annotations facilitating collaboration on video and image analysis. Developed at the Columbia Center for New Media Teaching and Learning (CCNMTL)

Annotating, Collaborating, Analyzing

— Columbia Center for New Media Teaching and Learning —

http://mediathread.info/content/about-mediathread
https://github.com/ccnmtl/mediathread

Media Annotation/Tagging, Interactive User Interface



License
GNU GPL

Last release
31/1/2015




MediaWiki

MediaWiki is free server-based software which is licensed under the GNU General Public License (GPL). It’s designed to be run on a large server farm for a website that gets millions of hits per day. MediaWiki is an extremely powerful, scalable software and a feature-rich wiki implementation that uses PHP to process and display data stored in a database, such as MySQL. Pages use MediaWiki’s wikitext format, so that users without knowledge of XHTML or CSS can edit them easily. When a user submits an edit to a page, MediaWiki writes it to the database, but without deleting the previous versions of the page, thus allowing easy reverts in case of vandalism or spamming. MediaWiki can manage image and multimedia files, too, which are stored in the filesystem. For large wikis with lots of users, MediaWiki supports caching and can be easily coupled with Squid proxy server software.

Processing, Presenting

— Wikimedia —

https://www.mediawiki.org/wiki/MediaWiki
https://www.mediawiki.org/wiki/Download

Collection Management, Linked Open Data


Examples
http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham


License
GNU GPL




MFCS Metadata Form Creation System

The Metadata Form Creation System (MFCS) is WVU Libraries answer for providing an easy to use interface for librarians, staff, and students for entering metadata and uploading digital items for our digital collections. MFCS is also our archival and preservation system. MFCS is a delivery and repository agnostic system.

Processing, Archiving, Preserving

— Michael Bond WVU Libraries —

https://github.com/wvulibraries/mfcs
https://github.com/wvulibraries/mfcs

Collection Management, Digital Asset Management, Digital Preservation


Examples
http://www.youtube.com/watch?v=8RVqZNPsf8A


License
WVU Open Source License

Last release
20/5/2014




MINT ingestion tool

MINT services compose a web based platform that was designed and developed to facilitate aggregation initiatives for cultural heritage content and metadata in Europe.

Collecting, Adding metainformation, Aggregating

— Katerina Komninou, Nasos Drosopoulos, Vasilis Tzouvaras, Anna Christaki, Arne Stabenau, Costas Pardalis, Fotis Xenikoudakis, Nikos Simou —

http://mint.image.ece.ntua.gr/
https://github.com/mint-ntua/Mint-Athena

Metadata Mapping/Conversion/normalisation, Collection Management, Metadata Retrival Services


Quality of Documentation
The first release of MINT was authored 2 years ago: https://github.com/mint-ntua/Mint-Athena. A more recent version is developt by the DM2E (Digitised Manuscripts to Europeana) project: https://github.com/DM2E/dm2e-mint. A user manual is available http://mint.image.ece.ntua.gr/mint2/documentation/ although some screenshots would help for a better understanding.

Ease of Adaptation
Mint was designed to operate on different aggregation schemes. Hence it is easy to plug-in your own schema.

Code Quality
As a programming language mainly Java is used. There are enough comments in the code and a JavaDoc can be created. The main contribution are from EU funded projects. It is hard to say if there is also a user community outside of these projects.

Examples
http://oreo.image.ece.ntua.gr:8080/mint/Login_input.action


License
GNU Affero General Public License

Last release
11/30/2011




Mirador

An open-source, web-based ‘multi-up’ viewer that supports zoom-pan-rotate functionality, ability to display/compare simple images, and images with annotations

Presenting

— Stanford University —

http://projectmirador.org/
https://github.com/IIIF/mirador

Media Applications, Media Annotation/Tagging


Quality of Documentation
The quality of the documentation for developers is excellent. Everything you need can be found here: https://github.com/IIIF/mirador/blob/master/README.md. There is no tutorial for users available but this is also not necessary.

Ease of Adaptation
Mirador can connect to repositories that provide a IIIF-compliant Metadata API.

Code Quality
The project started in August 2013. At the moment there are 4 contributors. JavaScript is the language of choice. A “first-time-setup” https://github.com/openseadragon/openseadragon#first-time-setup is quite easy to do. Just install GRUNT (http://gruntjs.com/getting-started) beforehand. On the command line, in the mirador folder type “grunt server” and open in your browser http://localhost:8000.

Examples
http://projectmirador.org/demo/#0e503169-f07e-4b0b-b1c9-8f08066b1547


License
Apache License, Version 2.0

Last release
15/14/2015




mozjpeg

Modern JPEG encoder designed to reduce the size and load time of webpages that carry a lot of pictures. It has been announced in March 2014 and is today supported by big Websites like Facebook and used in tools like ImageOptim. Mozjpeg supports optimized Huffmann table, custom quanitization matrices and modern techniques like trellis quantization while maintaining the same values for the structured similarity index (SSIM).

Encoding

— Josh Aas, Mozilla research —

https://github.com/mozilla/mozjpeg
https://github.com/mozilla/mozjpeg



License
custom (with Copyleft)

Last release
5/18/2015 (v3.1)

Last activity
5/18/2015




Muse Open Source

Muse Open Source is a free software platform for publishing native iPad apps for digital libraries with cultural heritage content.

Publishing

— Glimworm IT —

http://muse-opensource.org/
https://github.com/jonathanrcarter/muse-opensource/wiki

Media Applications, Exhibition Management, Social Applications


Quality of Documentation
The link to the documentation (wiki) does not work!

Ease of Adaptation
The software is based on the Appcelerator Titanium Platform to Create iOS, Android, and mobile web apps from a single code base. There are high dependencies to the Appcelerator Platform.

Code Quality
The Muse Open Source platform seems to be developed by two people. No activities in the last half year.


Last release
5/30/2013

Last activity
5/30/2013




Music 21

Music21 is a set of tools for helping scholars and other active listeners answer questions about music quickly and simply. If you’ve ever asked yourself a question like, “I wonder how often Bach does that” or “I wish I knew which band was the first to use these chords in this order,” or “I’ll bet we’d know more about Renaissance counterpoint (or Indian ragas or post-tonal pitch structures or the form of minuets) if I could write a program to automatically write more of them,” then music21 can help you with your work.

Seeking

— MIT —

http://web.mit.edu/music21/
https://code.google.com/p/music21/downloads/list

Data Visualization



License
LGPL

Last activity
2014




Neatline

The Scholars’ Lab designed Neatline as a suite of plugins for the open-source Omeka framework, which provides a powerful platform for content management and web publication. Through Neatline, you can create create rich representations of places, objects, events, narratives, and documents.

Managing, Publishing, Presenting

— Adam Soroka —

http://neatline.org/
https://github.com/scholarslab/Neatline

Exhibition Management, Social Applications, Media Annotation/Tagging


Quality of Documentation
Neatline is extremely organized and thorough with their documentation. They provide visible demos of the applications and its capabilities. There is no technical support but the website is updated periodically.

Examples
http://neatline.org/neatline-in-action/


License
Apache License v 2.0

Last release
2012

Last activity
actively ongoing






Nodel

Nodel is an open source digital media control system built on Java and Python.

It uses a series of nodes that perform “actions” or respond to “signals” to control various devices on a network, allowing quick and easy management of digital media devices.

Nodel is typically used to control digital media in galleries, museums, corporate meeting rooms and digital signage.

In short, Nodel can manage any programmable device across a wide range of platforms.

Managing

— Museum Victoria —

http://nodel.io/
https://github.com/museumvictoria/nodel

Collection Management, Exhibition Management


Examples
https://raw.githubusercontent.com/museumvictoria/nodel/gh-pages/docs/White_Paper-Nodel.pdf


License
Mozilla Public License 2.0/MIT License

Last release
5/20/2014




NotreDAM

NotreDAM is a multi-user, web-based Digital Asset Management platform

Managing

www.notredam.org
http://code.google.com/p/notredam/

Collection Management, Digital Asset Management, Metadata Retrival Services


Quality of Documentation
The documentation for NotreDAM is very well organized and thorough. There are demos available as well as an active forum.

Examples
http://notredam.org/demo/


License
GNU GPL v3




Numishare

Numishare is an open source suite of applications for managing digital cultural heritage artifacts, with a particular focus on coins and medals.

Managing

— Ethan Gruber, American Numismatic Society —

http://numishare.blogspot.com/
https://github.com/ewg118/numishare

Collection Management, Metadata Mapping/Conversation/normalisation, Exhibition Management


Quality of Documentation
The documentation for Numishare is very minimal and not easy to find. There also appear to be no regular updates or an active forum for discussion.

Examples
http://numismatics.org/search/ http://numismatics.org/ocre/


License
Apache License 2.0

Last release
December 2010

Last activity
26/10/2012 - constant development




Nuxeo Platform

Nuxeo provides a modular, extensible Java-based open source software platform for enterprise content management, and packaged applications for document management, digital asset management and case management.

Managing

— Nuxeo —

http://www.nuxeo.com
http://nuxeo.github.io/

Collection Management, Content Retrival Services, Digital Asset Management


Quality of Documentation
The documentation for Nuxeo is very thorough and visible. There is also a very active forum that people actually use and comment on and additionally there are regular blog posts.


License
LGPL

Last release
10/30/2013

Last activity
actively ongoing








ol3

A high-performance, feature-packed library for all your mapping needs

Visualizing

— OpenLayers —

http://openlayers.org/
https://github.com/openlayers/ol3

Geo-spatial Apllications


Examples
http://openlayers.org/en/v3.2.1/examples/


License
Apace 2.0, BSD, MIT

Last release
13/2/2015




Omeka Contribution Plugin

Makes an Omeka site into one that accepts public contributions. The plugin provides a form to collect stories, images, or other files from the public and manages those contributions in your Omeka archive as items.

Publishing

— Center for History and New Media, George Mason University —

http://omeka.org/codex/Plugins/Contribution
https://github.com/omeka/plugin-Contribution

Exhibition Management, Interactive User Interface, Social Applications



License
GNU GPL

Last release
v 3.0.1 August 2014




Omeka S

A multisite reworking of Omeka on newer tech standards and aiming at interoperability with more systems

Publishing

— Roy Rosenzweig Center for History and New Media, George Mason University —

Exhibition Management


Quality of Documentation
In progress


Last release
v 0.4.0-alpha https://github.com/omeka/omeka-s/releases/tag/v0.4.0-alpha




Omeka

A Collection/Exhibition Management System

Publishing

— Roy Rosenzweig Center for History and New Media, George Mason University —

http://omeka.org/
https://github.com/omeka/Omeka

Exhibition Management, Collection Management



License
GNU GPL

Last release
2.4.1

Last activity
2016-05-25




Open Exhibits

Mulititouch and multiuser software

Collaborating

— Ideum —

http://openexhibits.org/
https://github.com/ideum

Interactive User Interface, Exhibition Management



License
Simplified BSD License

Last release
v.3.0, 13 November, 2013




Open Licenses Service

Data on more than 100 open licenses. Including all OSI-approved open source licenses and all Open Definition conformant open data and content licenses. Provided in easy to use, machine readable JSON – perfect if you need to drop a license chooser into your app.

Seeking

— Open Knowledge Foundation —

http://licenses.opendefinition.org/
http://licenses.opendefinition.org/

Linked Open Data, Content Retrival Services



License
MIT License

Last activity
1/2012




Open Monograph Press

Open Monograph Press is an open source software platform for managing the editorial workflow required to see monographs, edited volumes and, scholarly editions through internal and external review, editing, cataloguing, production, and publication. OMP can operate, as well, as a press website with catalog, distribution, and sales capacities.

Managing, Editing, Cataloguing, Producing, Publishing

— Public Knowledge Project —

https://pkp.sfu.ca/omp/
https://pkp.sfu.ca/omp/omp_documentation/

Collection Management


Examples
https://pkp.sfu.ca/omp/omp_demo/


License
Open Source






OpenSeadragon

An open-source, web-based viewer for zoomable images, implemented in pure JavaScript.

Editing, Browsing

http://openseadragon.github.io/
https://github.com/openseadragon/openseadragon

Media Applications


Quality of Documentation
The quality of the documentation for developers is excellent. Everything you need can be found here: https://github.com/IIIF/openseadragon/blob/master/README.md. There is no tutorial for users available but this is also not necessary.

Ease of Adaptation
Plugins can be used to displays your image's scale in real-world measurements, enhance OpenSeadragon, provide coordinate conversion, pan, and zoom methods in a simplified coordinate system and provide hooks into an OpenSeadragon.Viewer and/or OpenSeadragon.MouseTracker for overriding/extending the default user-input event handling behavior. An issue tracker on GitHub is found here: https://github.com/openseadragon/openseadragon/issues.

Code Quality
The project started in January 2013. At the moment there are 23 contributors. JavaScript is the language of choice. A “first-time-setup” https://github.com/openseadragon/openseadragon#first-time-setup is quite easy to do. Just install GRUNT (http://gruntjs.com/getting-started) beforehand. On the command line, in the openseadragon folder type “grunt connect watch” and open in your browser http://localhost:8000/test/demo/basic.html.


License
BSD license http://openseadragon.github.io/license/

Last release
12/2013








PDFMiner

PDFMiner is a tool for extracting information from PDF documents.

Machine Learning, Extracting Data

— Yusuke Shinyama —

https://euske.github.io/pdfminer/
https://github.com/euske/pdfminer/

Content Retrieval Services, Semantic Extraction


Examples
http://pdf2html.tabesugi.net:8080/


License
MIT/X License

Last release
31/12/2007 - initial release

Last activity
5/4/2015




Pencilcase

Pencil Case is an appweb which offers tools for designers & developers. It showcases over 750 resources in design, development, learning, productivity, collaboration, publishing, testing, and more - all tracked by popularity in realtime.

Seeking

— Pencil Case —

http://pencilcaseapp.com/

Collection Management, Digital Asset Management



Last release
23/02/2013 (terms & guidelines)

Last activity
Copyright 2015




Plumi

Plumi is a free open-source video-sharing app based on Plone

Resource Sharing

— EngageMedia in collaboration with Unweb.me —

http://blog.plumi.org/
https://github.com/plumi/plumi.app

Exhibition Management, Media Applications


Examples
http://blog.plumi.org/sites-using-plumi/ http://demo.plumi.org/


License
As Plumi pulls together a range of different products, different licenses apply to different elements of the software. However most are covered either by the GNU GPL or the Zope Public License

Last release
4.5.2 (june 2015)

Last activity
2015




Popcorn.js

Popcorn.js is a Mozilla’s HTML5 video and media library for the open web. It allows web developers, filmmakers, artists, designers and others to easily create timeline based web productions. Popcorn.js helps simplify media API and implementation differences between browsers and includes a powerful event system and a rich plugin architecture and plugins.

Web-developing

— Mozilla —

http://popcornjs.org/
https://github.com/mozilla/popcorn-js

Media Applications


Quality of Documentation
Excellent

Examples
http://popcornjs.org/demos


License
MIT License

Last release
1.5.6




Posest

Posest is a library for 3D camera pose estimation implemented in C. Posest is a C/C++ Library for Robust 6DoF Pose Estimation from 3D-2D Correspondences

Analyzing

— Manolis Lourakis (FORTH) The development of posest has been partially supported by the EC FP7 programme under grant no. 270138 DARWIN —

http://users.ics.forth.gr/~lourakis/posest/

Media Applications


Quality of Documentation
Rather weak


License
GNU GPL

Last release
1.1




Public Domain Calculator

Public Domain Calculator - determine what is public domain and what’s not.

Filtering

— Open Knowledge Foundation —

http://publicdomainworks.net/
https://github.com/okfn/pdcalc

Search and Browsing, Content Retrival Services, Metadata Retrival Services



License
Open Source

Last release
2014




Pundit

Pundit is a semantic web annotation tool. It allows user to create structured data in their annotations by creating semantic relations between different kind of items, being them portions of texts in a web page, images, Linked Data entities or entries from a custom vocabulary. Annotations can be private or public and can be consumed by external applications via REST API.

Annotating

— Net7 —

http://thepund.it
https://github.com/net7/pundit2

Semantic Enrichment, Data Annotation/Curation, Linked Open Data


Examples
http://thepund.it/try-pundit/


License
Pundit Code is divided into Pundit Server Code and Pundit Client code (http://thepund.it/license/)

Last release
2.0.51 beta

Last activity
2015




PyBossa

PyBossa is an open source platform for crowd-sourcing online (volunteer) assistance to perform tasks that require human cognition, knowledge or intelligence (e.g. image classification, transcription, information, location etc).

Crowdsourcing

— Shuttleworth OKFN Scifabric —

http://www.pybossa.com
https://github.com/PyBossa/pybossa

Digital Asset Management, Media Annotation/Tagging, Social Applications


Quality of Documentation
Excellent


License
GNU Affero General Public License

Last release
v0.2.2 (11/05/2015)

Last activity
2015




pymarc

Pymarc is a python library for working with bibliographic data encoded in MARC21. It should work under python 2.x and 3.x. It provides an API for reading, writing and modifying MARC records. It was mostly designed to be an emergency eject seat, for getting your data assets out of MARC and into some kind of saner representation.

Organizing, Bibliographic Management

— Gabriel Farrell, Mark Matienzo, Geoffrey Spear, Ed Summers —

http://pypi.python.org/pypi/pymarc
https://github.com/edsu/pymarc

Data Annotation/Curation



License
BSD

Last release
v3.0.4 (27/07/2015)

Last activity
2015




Question2Answer

A Q2A site helps your online community to share knowledge. People with questions get the answers they need. The community is enriched by commenting, voting, notifications, points and rankings.

Consulting

— Gideon Greenspan —

http://www.question2answer.org/
https://github.com/q2a/question2answer

Collection Management


Examples
http://demo.question2answer.org/


License
GNU GPL v2

Last release
1.7.1 (27/07/2015)

Last activity
2015




Quick and Dirty RDF browser

this tool returns all the triples in the file and verbosely lists each of their predicate and object values. Quick and easy.  This is a good for reading everything about a particular resource. The tool does not seem to support content negotiation.

Browsing

— Christopher Gutteridge & University of Southampton —

http://graphite.ecs.soton.ac.uk/browser/
http://graphite.ecs.soton.ac.uk/download.php/browser.php

Content Retrieval Services, Semantic Extraction


Examples
http://education.data.gov.uk/id/school/118217 http://eprints.ecs.soton.ac.uk/id/eprint/10053 http://id.southampton.ac.uk/building/59 http://data.totl.net/playingcards/ http://dbpedia.org/resource/Southampton


License
GPL 2.0












Recline.js

A simple but powerful library for building data applications in pure Javascript and HTML.

Programming

— Max Ogden, Rufus Pollock —

http://reclinejs.com/
https://github.com/okfn/recline

Infrastructure


Examples
http://reclinejs.com/demos/


License
MIT license

Last release
sept. 2012

Last activity
actively ongoing






ReSearcher

ReSearcher is an award-winning integrated suite of open source products for locating and managing electronic information resources, designed for use by students and researchers in academic libraries.

Managing, Locating

— Simon Fraser University Library —

http://researcher.sfu.ca/

Search and Browsing, Digital Asset Management



License
GNU GPL

Last activity
actively ongoing






Retailer

Retailer is a platform for web applications written in XSLT.

Essentially it is a kind of XML transforming web proxy, able to present a RESTful API as another API.

Web-developing, Presenting

— Conal Tuohy —

https://github.com/Conal-Tuohy/Retailer
https://github.com/Conal-Tuohy/Retailer

Metadata Retrieval Services


Examples
https://github.com/Conal-Tuohy/Retailer/blob/master/etc/trove.xsl


License
Apache 2.0




Roundware

Roundware is a flexible, distributed framework which collects, stores, organizes and re-presents audio content.

Seeking, Archiving, Presenting

— http://www.roundware.org/category/team/ —

http://www.roundware.org/
https://github.com/hburgund/roundware-server

Digital Asset Management


Examples
http://www.roundware.org/category/examples/


License
LGPLv2




ruby-marc

A Ruby library to parse, create and manage MARC records

Parsing, Producing, Managing

— University of Michigan —

https://github.com/ruby-marc/ruby-marc
https://github.com/ruby-marc/ruby-marc

Digital Asset Management


Quality of Documentation
Besides a README file: https://github.com/ruby-marc/ruby-marc/blob/master/README.md there is no documentation available.

Code Quality
Github: https://github.com/ruby-marc/ruby-marc hosts all the ruby scripts. There are 5 contributors. Just one is realy active. The project already started in 2005. It seems to be still maintained.


License
MIT License




Sanddragon Image Service

The Sanddragon Image Service is a WCF (Windows Communication Foundation) web service which implements the IIIF image API, deployed on Microsoft’s IIS (Internet Information Server)

Visualizing

— British Library —

http://sanddragon.bl.uk/#home
https://github.com/britishlibrary/SanddragonImageService

Exhibition Management, Media Applications


Examples
http://sanddragon.bl.uk/#imageservice


License
BSD-3 Licence

Last release
2013






Sanddragon Widget

The Sanddragon Widget is an image viewer that is based upon Open Seadragon which has been modified to work with an IIIF server, such as the Sanddragon Image Service.

Managing, Browsing

— British Library —

http://sanddragon.bl.uk/#home
https://github.com/britishlibrary/SanddragonWidget

Exhibition Management


Examples
http://sanddragon.bl.uk/#widget


License
BSD-3 Licence

Last release
2013










Shelf.io

Build virtual shelves for book/cd/record/film collections using linked data

Managing

— Harvard Library Innovation Lab —

http://shelf.io/
https://github.com/harvard-lil/shelfio

Data Visualization, Linked Open Data, Social Applications



License
MIT License/GPL License

Last release
2012




Shred.js

Javascript framework to enable annotating of diverse media from diverse sources

Annotating

— Columbia Center for New Media Teaching and Learning —

Media Annotation/Tagging



License
unknown

Last release
26/8/2014






SobekCM

SobekCM allows users to discover online resources via semantic and full-text searches, as well as a variety of different browse mechanisms. For each digital resource in the repository there are a plethora of display options, which may be selected by an appropriately authenticated use. This repository includes online metadata editing and online submissions in support of institutional repositories.

Browsing, Visualizing, Editing

— Mark Sullivan —

http://ufdc.ufl.edu/sobekcm
https://github.com/MarkVSullivan/SobekCM-Web-Application

Content Retrieval Services, Collection Management, Search and Browsing



License
GNU General Public License




SOCH

SOCH (Swedish Open Cultural Heritage) is a web service used to search and retrieve data from any organisation holding information or media relating to Swedish cultural heritage. The Swedish name for SOCH is “K-samsök”, which can be translated as “Cultural cross-search”.

Browsing, Retrieving

— K-Samsök —

http://www.ksamsok.se/in-english/
http://www.ksamsok.se/api/kodexempel/

Metadata Retrieval Services, Linked Open Data








Spira

Spira is a framework for using the information in RDF.rb repositories as model objects. It gives you the ability to work in a resource-oriented way without losing access to statement-oriented nature of linked data, if you so choose. It can be used either to access existing RDF data in a resource-oriented way, or to create a new store of RDF data based on simple defaults.

Managing

— RubyGems —

https://rubygems.org/gems/spira
https://github.com/ruby-rdf/spira

Linked Open Data



License
Unlicense

Last release
28/1/2014




Spotlight

Spotlight is a plugin for Blacklight, that enables libraries and other cultural heritage institutions to build online exhibits from content in their repositories to better highlight their digital collections.  It extends Blacklight by providing a self-service, forms-based user interface that allows exhibit-builders, such as librarians or faculty, to customize the search interface and homepage, and to build media-rich feature pages to better contextualize their collections.

Editing, Web-Developing

— Chris Beer —

https://github.com/sul-dlss/spotlight
https://github.com/sul-dlss/spotlight

Contextualisation, Exhibition Management, Collection Management


Examples
https://www.youtube.com/watch?v=_A7vTbbiF4g&feature=youtu.be


License
Apache 2.0

Last release
6/30/2014






Stacklife

StackLife is a community-based wayfinding tool for navigating the vast resources of the combined Harvard Library System. It enables researchers, teachers, scholars, and students to find what they need and help others learn from them and their paths.

Browsing

— Harvard Library Innovation Lab —

http://stacklife.harvard.edu/explainer.php
https://github.com/harvard-lil/stacklife

Collection Management


Examples
http://stacklife.harvard.edu/item/the-elements-of-graphic-design/612B1DC6-D389-24A7-C6EC-A73298255537


License
MIT

Last release
17/4/2013






Stanford Natural Language Processing Group (NLP)

Stanford CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities, etc. Stanford CoreNLP is an integrated framework, which make it very easy to apply a bunch of language analysis tools to a piece of text. Starting from plain text, you can run all the tools on it with just two lines of code. Its analyses provide the foundational building blocks for higher-level and domain-specific text understanding applications.

Analyzing

— Standford —

http://nlp.stanford.edu/software/corenlp.shtml
http://nlp.stanford.edu/software/corenlp.shtml#Download

Ontology/Vocabulary Building and Management


Examples
http://nlp.stanford.edu/software/corenlp.shtml#Demo


License
GNU GPL v2 or later




StoryMap.JS

StoryMapJS is a free tool to help you tell stories on the web that highlight the locations of a series of events.

Locating, Visualizing

— Northwestern University Knight Lab —

http://storymap.knightlab.com/
http://storymap.knightlab.com/advanced.html

Geo-spatial Apllications, Media Applications, Data Visulaisation


Examples
http://apps.washingtonpost.com/g/page/world/map-how-isis-is-carving-out-a-new-country/1095/


License
Mozilla Public License v 2.0






SubjectsPlus

SubjectsPlus is a free and open source tool to help you manage several interrelated parts of your library website.

Managing

— Joyner Library East Carolina University/ University of Miami Libraries —

http://www.subjectsplus.com/
https://github.com/subjectsplus/SubjectsPlus

Collection Management, Digital Asset Management



License
GNU GPL

Last release
19/12/2014




Supplejack

Supplejack is DigitalNZ’s tool for aggregating, searching and sharing metadata records. Supplejack is used to collect metadata about millions of items, from hundreds of data sources, across many different data formats including HTML, RSS, XML, OAI-PMH and RDF/XML. It transforms messy data, creates a unified search index and makes consistent metadata widely available via an open API data service. It was built using open source technology by DigitalNZ and the National Library of New Zealand.

Seeking, Organizing, Resource Sharing

— DigitalNZ —

http://www.digitalnz.org/developers/supplejack
https://github.com/DigitalNZ

Metadata Retrieval Services, Metadata Mapping/Conversation/normalisation



License
GNU GPL

Last activity
16/2/2015








TAP

TAP is a collection of free and open-source tools which support the creation and delivery of mobile tours. Currently TAP consists of authoring tools built on top of the content management system Drupal, a native iOS mobile application, and a web-based mobile application built upon the jQuery Mobile library.

Programming

— IMA Museums —

http://tapintomuseums.org/TAP
https://github.com/IMAmuseum

Exhibition Management


Examples
http://tapintomuseums.org/case-studies


License
GPLv3 license




Telemeta

Telemeta is a free and open source web audio archiving software which introduces useful and secure methods to organize, backup, index, transcode, analyse, share and publish any digitalized audio or video file with extensive metadata in accordance with open web standards. It is dedicated to collaborative media archiving projects, research laboratories, librairies and digital humanities.

Archiving, Indexing, Archiving, Resource Sharing, Analyzing

— Guillaume Pellerin, Thomas Fillon —

http://telemeta.org
https://github.com/Parisson/Telemeta

Collection Management, Digital Preservation, Media Annotation/Tagging, Metadata Mapping/Conversation/normalisation, Interactive User Interface


Examples
http://archives.crem-cnrs.fr/


License
CeCILL-B (GPLv2 compatible)

Last release
20/2/2015

Last activity
continuous




TemaTres

TemaTres is an open source vocabulary server, web application to manage and exploit vocabularies, thesauri, taxonomies and formal representations of knowledge.

Adding Metainformation

— Diego Ferreyra —

http://www.vocabularyserver.com/
http://sourceforge.net/projects/tematres/

Ontology/Vocabulary Building and Management, Collection Management


Examples
http://r020.com.ar/tematres/demo/index.php?setLang=en


License
GPLv2 license

Last release
08/2013




Tesseract

Tesseract is probably the most accurate open source OCR engine available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 60 languages.

Conversioning

— Ray Smith —

https://github.com/tesseract-ocr/tesseract
https://github.com/tesseract-ocr/tesseract

Collection Management, Digital Asset Management, Ontology/Vocabulary Building and Management, Digital Preservation



License
Apache License 2.0

Last release
23/10/2012




The Guide on the Side software

A freely available tool that allows librarians to quickly and easily create online, interactive tutorials that are based on the principles of authentic and active learning

Web-Developing

— Leslie Sult, Justin Spargur, Gregory Hagedon, and Ginger Bidwell at the University of Arizona Libraries —

http://code.library.arizona.edu/gots/
https://github.com/ualibraries/Guide-on-the-Side

Social Applications, Collection Management, Linked Open Data


Examples
http://www.library.arizona.edu/applications/quickHelp/tutorial/searching-the-ua-library-catalog


License
GNU

Last release
2012

Last activity
actively ongoing




Timeline JS

TimelineJS is an open-source tool that enables you to build visually-rich interactive timelines and is available in 40 languages.

Visualizing

— Northwestern University Knight Lab —

http://timeline.knightlab.com/
https://github.com/NUKnightLab/TimelineJS

Exhibition Management, Data Annotation/Curation, Media Applications


Quality of Documentation
Very well written documentation for users. Also very well written documentation for programmers (found on the Github page). Frequent documentation updates and live demo.

Ease of Adaptation
This is a web-application that can be pluged in on websites. Information for pluging in the webapp is available even on the first page of the project’s custom website. More extensive information exists on the GitHub page of the project. There is a list of known installations.

Code Quality
Excellent code quality and package structure. The project is a work of academic level and follows almost every direction towards openness for collaboration and excplicity of descriptions.

Examples
http://timeline.knightlab.com/


License
Mozilla Public License, v. 2.0

Last release
October 2013

Last activity
actively ongoing




TimeMapper

Timeliner is a simple javascript app to create and share timelines and timemaps build using ReclineJS.

Visualizing, Resource Sharing

— Rufus Pollock —

http://timemapper.okfnlabs.org/
https://github.com/okfn/timemapper

Exhibition Management


Quality of Documentation
The documentation describes the three steps needed in order to get the application to work and produce the desired results for the user. This information can be found on the custom website of the project. All information related to programmers’ views can be found on the GitHub page of the project. Information exists there for the programmers to install and configure the tool based on their needs, but the related text could be of higher quality.

Ease of Adaptation
As this is a software used for one specific operation the ease of adaptation depends on the need that the operation tends to cover. In our case, TimeMapper’s ease of adaptation is bound by the support of JS and availability of a Google account, for the side of the user, and the will to use JS, for the side of the programmer. There is no documentation for adapting/extending the tool.

Code Quality
Very well written documentation for users.  Also very well written documentation for programmers (found on the Github page). Frequent documentation updates and live demo.

Examples
http://timeliner.okfnlabs.org/view/?url=https://docs.google.com/spreadsheet/ccc?key=0Al6mO9_3Hr2PdGZnRjEwUWxOekhreTNNZEFEMWRZbkE


License
MIT license




Traject

Tools for reading MARC records, transforming them with indexing rules, and indexing to Solr.

Browsing, Modifying, Indexing

— Jonathan Rochkind (Johns Hopkins Libraries), Bill Dueber (University of Michigan Libraries) —

https://github.com/traject-project/traject/
https://github.com/traject-project/traject/

Collection Management, Metadata Mapping/Conversation/normalisation


Quality of Documentation
No custom website for the tool. The docs on Github contain all the necessary information for the tool’s operation. Since this is not a software but (could be considered to be) large scale script performing a specific funtion, the lack of a custom website cannot be considered as a substantial drawback in the project’s ebaluation, as long as all the necessary information is exists.The tutorials for users and programmers are very well written and descriptive.

Ease of Adaptation
The ease of adaptation needed for this tool’s specific funtionality is a part of its operation. Namely that would be the index rules setting, for which the information given over the documentation is quite extensive.

Code Quality
The package structure for the tool is clear and there are numerous tests for users and programmers to look upon. The number of contibutors is small but the support and updates frequency is adequate for a tool of that size. The tool is written in JRuby.

Examples
https://github.com/traject-project/traject/blob/master/test/test_support/demo_config.rb


License
MIT License

Last activity
ongoing




UniversalViewer

The Universal Viewer is an open source project to enable cultural heritage institutions to present their digital artifacts in an IIIF-compliant and highly customisable user interface

Visualizing, Publishing

— Edward Silverton —

https://github.com/UniversalViewer/universalviewer
https://github.com/UniversalViewer/universalviewer

Media Applications, Digital Asset Management, Publication


Quality of Documentation
Rather thorough documentation found on the wiki https://github.com/UniversalViewer/UniversalViewer/wiki

Examples
http://universalviewer.io/examples/


License
MIT License

Last release
15/1/2016

Last activity
ongoing








VIVO

VIVO enables the discovery of researchers across institutions. Participants in the network include institutions with local installations of VIVO or those with research discovery and profiling applications that can provide semantic web!-compliant data. The information accessible through VIVO’s search and browse capability will reside and be controlled locally, within institutional VIVOs or other semantic web-compliant applications.

Networking

— Cornell University —

http://vivoweb.org/about
https://wiki.duraspace.org/display/VIVO/VIVO

Semantic Extraction



License
Atlassian Confluence Open Source Project License

Last activity
30/6/2014






VuFind

Library resource portal.

Archiving

— Villanova University's Falvey Memorial Library. —

http://vufind.org/
http://vufind.org/downloads.php

Collection Management


Quality of Documentation
The official website is extensively descriptive about the software. Well written documentation for use and deployment. User and developer manuals are present and there is also a live demo to depict how the tool is used. The tool is even presented in conferences so continuous feedback is given on its use and possible improvements. The project is present on Github. Highlight: used by more than 120 organizations worldwide, mostly universities and public libraries.

Ease of Adaptation
There is a support mailing list dating back to the time of the tool’s release. The list’s archive is open for access. List of known installations publicly available. JIRA issue tracker is used. Several companies offer commercial support for organizations using the tool. Lots of configurable options and adaptations are possible.

Code Quality
Github hosted project having a very clear and well organized package structure. PHP coded with explanatory comments for the code methods. Test files to demonstrate functionalities are included in the Github repository. 21 contributors with 3077 commits are shown in official statistics on April 10th, 2014. Oldest change is 2 years ago, last change 16 hours ago.


License
GPL

Last release
8/26/2013




Wagtail

Content management system designed for the Royal Academy of Art focused on flexibility and user experience. Light, fast, agile, portable.

Managing

— Torchbox —

wagtail.io
https://github.com/torchbox/wagtail/

Collection Management



License
BSD License




Wandora

Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.

Managing, Publishing, Extracting Data

— Wandora Team —

http://www.wandora.org
http://www.wandora.org/wiki/Download

Collection Management, Metadata Retrival Services, Metadata Mapping/Conversation/normalisation


Examples
http://kokoelmat.fng.fi/


License
GNU GPL v3

Last release
2012

Last activity
2012-06-21






WEKA

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.

Machine Learning, Data Mining

— The University of Waikato —

http://www.cs.waikato.ac.nz/ml/weka/
http://www.cs.waikato.ac.nz/ml/weka/downloading.html

Metadata Retrieval Services



License
GNU GPL








xDams Platform OS

xDams is a web based platform for creating, managing and publishing finding aids EAD and EAC-CPF schema compliant.

Producing, Managing, Publishing

— http://www.regesta.com/ —

http://www.xdams.org/
https://github.com/xdamsorg/xDams-core/

Metadata Retrieval Services, Collection Management, Exhibition Management


Quality of Documentation
Nearly everything is written in italian! Even https://github.com/xdamsorg/xDams-core/blob/master/README.md.

Code Quality
Also comments in the code are in italian.

Examples
http://www.youtube.com/user/xdamsplatform


License
GNU GPL v3

Last release
7/10/2012

Last activity
7/10/2012




xEAC

xEAC is an XForms-based prosopographical information management system for creating and publishing Encoded Archival Context (EAC-CPF) records. A companion tool is EADitor for editing EAD (encoded archival description) records

Managing, Producing, Publishing

— Ethan Gruber, American Numismatic Society —

Collection Management, Publication


Quality of Documentation
The installation instructions http://wiki.numismatics.org/xeac:generic_installation a quite straight forward. Just configure and deploy the applicaton on Tomcat. You also have to set up an Apache Solr instance. The user manual is a little bit sparse, but there are at least some screenshots and the application does not seem to be too complicated.

Ease of Adaptation
The software seems to be very tailored to the needs of the American Numismatic Society.

Code Quality
xEAC is based on Orbeon Forms (http://www.orbeon.com/) a solution to build and deploy web forms.


Last activity
July 2012




XTF

The eXtensible Text Framework (XTF) is a powerful open source platform for providing access to digital content.

Direct Accessing

— California Digital Library (CDL) —

http://xtf.cdlib.org/
http://xtf.cdlib.org/download/

Content Retrieval Services


Quality of Documentation
There is a lot of documentation on the website: http://xtf.cdlib.org/documentation/, for the user, the developer and the system administrator.

Ease of Adaptation
A look in the change log shows a steady developement since at least 2006. Unfortunately there are no new versions released since July 2012!

Code Quality
Building XTF from the Source Code is quite simple. Apache Ant is used to do it. The Java code looks good.

Examples
http://xtf.cdlib.org/xtf/


License
Mozilla Public License 1.0 (MPL), BSD License, Apache Software License

Last release
7/27/2012