Ace
ACE (Auditing Control Environment) is a system that incorporates anew methodology to address the integrity of long term archives using rigorous cryptographic techniques. ACE continuously audits the contents of the various objects according to the policy set by the archive, and provides mechanisms for an independent third-party auditor to certify the integrity of any object. ACE consists of two components, the first an Audit Manager(AM) that checks files locally to ensure they have not been compromised. The second part, the Integrity Management Service (IMS), issues tokens that the AM can use to verify that its local store of file digests has not been tampered with.
Preserving
— University of Maryland Institute for Advanced Computer Sciences —
https://wiki.umiacs.umd.edu/adapt/index.php/Ace:Main
https://wiki.umiacs.umd.edu/adapt/index.php/Ace:Audit_Manager_Installation_Guide
Digital Preservation
License
Open
Last activity
2014
Active_fedora
Rubydora and ActiveFedora provide a set of Ruby gems for creating and
managing objects in the Fedora Repository Architecture
(http://fedora-commons.org). ActiveFedora
is loosely based on “ActiveRecord” in Rails. The 3.x series of
ActiveFedora depends on Rails 3, specifically activemodel and
activesupport.
Preserving
— Project Hydra —
Collection Management
License
Apache 2.0
Last release
5/2/2015
ActiveTriples
An ActiveModel-like interface for RDF data. Models graphs as
Resources with property/attribute configuration, accessors, and other methods to support Linked Data in a Ruby/Rails enviornment.
This library was extracted from work on ActiveFedora. It is closely related to (and borrows some syntax from) Spira, but does some important things differently.
Modifying, Categorizing Browsing
— ActiveTriples —
Linked Open Data
License
Apache 2.0
Last release
22/8/2014
Amara
Amara gives individuals, communities, and larger organizations the power to overcome accessibility and language barriers for online video. Amara is composed of three main parts: A subtitle creation and viewing tool (aka the widget)A collaborative subtitling websiteAn open protocol for subtitle search/delivery
Producing, Transcribing, Collaborating, Retrieving, Subtitling
— Participatory Culture Foundation —
http://www.amara.org/en/
https://github.com/pculture/unisubs
Media Applications
License
GNU GPL
Last release
2/6/4/2014
Amplifon: Sounds of Street View
Sounds of Street View is a digital explorative sound experiment
by Amplifon which gives users a 3-dimensional sound experience in a
Street View environment. Utilising the Web Audio API platform, sounds
are designated as though they were ordinary Google Maps markers, but
instead of an image and information being assigned to the marker, a
sound is
Producing, Recording, Presenting
— Amplifon —
http://www.amplifon.co.uk/sounds-of-street-view/index.html
https://github.com/Amplifon/Sounds-of-Street-View-Framework
Exhibition Management, Geo-spatial applications, Media applications
Examples
http://www.amplifon.co.uk/sounds-of-street-view/index.html
Annotation Studio
An online annotation platform for teaching and learning in the humanities
annotating
— MIT —
http://www.annotationstudio.org/
https://github.com/hyperstudio/Annotation-Studio
Media Annotation/Tagging
Quality of Documentation
Extensive
Examples
http://www.annotationstudio.org/pedagogy/case-studies/
License
GNU GPL
Last release
5/10/2015
Annotator
The Annotator is an open-source JavaScript library and tool that can be added to any webpage to make it annotatable.
Annotating
— Open Knowledge Foundation —
http://okfnlabs.org/annotator/
https://github.com/okfn/annotator/
Data Annotation/Curation
Quality of Documentation
There is a well structured Github with well written documentation, including API docs. Also there is a dedicated website listing information about the tool and providing links to all the important information such as a tutorial, a link to a live demostration page where you can test out the tool, a link to a plugin development page, etc.
Ease of Adaptation
The extensibility of the tool is promoted on the main website. There is a link to a plugin development page containing a well written comprehensive tutorial.
Code Quality
The code is maintained in GitHub. CoffeeScript is used as the main language and all files are packaged in a logical structure. In general, the code includes extensive comments. The code is published for download as a minified JavaScript file. A file with the authors of the code is available, listing 20 contributors to the code.
Examples
http://okfnlabs.org/annotator/demo/
License
either MIT or GNU GPLv3
Last release
12/3/2013
Annotorious
Annotorious is a JavaScript annotation library for images and zoomable images. Link the Annotorious source files (CSS and JavaScript) into an existing HTML page, and images inside this page will be enhanced with an interactive drawing and commenting tool.
Annotating, Commenting, Visualizing
— Rainer Simon Peter Pilgerstorfer Paul Weichhart —
http://annotorious.github.io
https://github.com/annotorious/annotorious
Media Annotation/Tagging
Quality of Documentation
The main website is very informative and it’s main menu includes links to: a demo page, a getting started page, a plugin page an API documentation page and finally an about page.
Ease of Adaptation
There is a plugin development page available. In combination with a comprehensive GitHub page, it shouldn’t be too hard to develop plugins.
Code Quality
The code is maintained in GitHub. The JavaScript (using Google Closure) code seems clearly structured and includes comments. There is a useful page with information on how the code was built and can be adapted.
Examples
http://annotorious.github.io http://crowdsourced.micropasts.org/app/phototaggingHorsfield /http://ait-ngcms.github.io/annotorious-openskos-demo/
License
LGPL
Last release
v0.6 (14/08/2013)
Last activity
actively ongoing
Apache PDFBox
The Apache PDFBox™ library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.
Producing, Organizing, Data recognition, Annotating
— The Apache Software Foundation —
http://pdfbox.apache.org/
http://pdfbox.apache.org/downloads.html
Digital Asset Management, Collection Management
Quality of Documentation
The main website clearly lists a ton of useful information about the tool, namely e.g. software dependancies, the architecture and it’s underlying ideas, API docs and information on individually available command line tools.
Ease of Adaptation
The software does not have a hands on easy to use plugin structure.
Code Quality
The code is hosted in an SVN. This page explains how to build the software and how to create new branches for further development. Moreover a page on coding conventions is linked from the main site.
License
Apache License, Version 2.0
Apache Stanbol
Apache Stanbol (currently in incubation) is an open source modular software stack and reusable set of components for semantic content management.
Organizing, Content Analysis, Data recognition
— Apache Software Foundation —
http://incubator.apache.org/stanbol/
Semantic Enrichment, Collection Management, Digital Asset Management
Quality of Documentation
The main website clearly lists various links with useful information such as: the ideas & architecture of the software, usage scenario’s, a comprehensive ‘getting started’ section.
Ease of Adaptation
There seems to be no direct plugin architecture, however there is extensive documentation on development practices, which includes an overview how to contribute your (Java) code to the SVN by following the Apache guidelines.
Code Quality
When browsing the source code it seems that it is well structured and that in incorporates Maven (pom.xml), which ensures the neat handling of dependancies and versioning. Moreover it includes a benchmark directory, which indicates the system is well tested.
Examples
http://dev.iks-project.eu:8081/enhancer/chain/austrian-national-library
License
Apache License, Version 2.0
Last release
5/31/2012
Apache Tika
The Apache Tika toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Parsing, Data recognition
— Apache Software Foundation —
http://tika.apache.org/
http://tika.apache.org/download.html
Metadata Retrieval Services, Semantic Enrichment
Quality of Documentation
On the home page there are links to a dedicated wiki, an issue tracker, mailing lists and for each version of Tika there is documentation on how to get started (you have to download the sources and build them with Maven), API docs and more.
Ease of Adaptation
Tika has a plugin structure for parsers, which enables developers to extend Tika’s supported formats with their own. Developers are encouraged to use existing libraries to implement these custom parsers.
Code Quality
Because it is from Apache and by looking at the structure of Java classes in the API docs, the quality of the code seems perfectly fine.
License
Apache License, Version 2.0
Last release
7/3/2013
Last activity
actively ongoing
Archivematica
Archivematica is a free and open-source digital preservation system that is designed to maintain standards-based, long-term access to collections of digital objects.
Preserving
— Artefactual Systems Inc —
https://www.archivematica.org/
https://github.com/artefactual/archivematica
Digital Preservation
Quality of Documentation
Excellent documentation, arranged by versions, divided between user and administration manual, well structured and informative, illustrated with screenshots that are aligned with the documentation text. Professionally created screencast gives a high level overview. Documentation hosted on Wiki.
Ease of Adaptation
Microservices allow adapting workflows and easily integrating new services. Scale out is possible by adding new processing nodes. Service oriented, separation between ingest, storage, and access services, all have well defined APIs.
Code Quality
Code ist hosted on Github. High code quality (Python 89.7% JavaScript 6.3% Shell 3.2% Other 0.9%), well structured, not many comments, no continuous integration, tests are available.
Examples
http://sankofa.archivematica.org (user name: demo@example.com, password: demodemo)
License
AGPL3 license
Last release
v1.5.0 (June 2016)
Last activity
actively ongoing
ArchivesSpace
A next-generation archives management application that will incorporate the best features of Archivist’s Toolkit (AT) and Archon. The project team is developing a technical platform, governance structure, and service model that will provide the archival community with a cutting-edge, extensible, and sustainable platform for describing analog and born-digital archival materials. The ArchivesSpace product is being developed using an Agile scrum process, guided by a Product Vision for ArchivesSpace.
Managing, Adding metainformation Preserving
— Hudson Molonglo —
www.archivesspace.org
https://github.com/archivesspace/archivesspace/wiki
Collection Management
Quality of Documentation
Excellent documentation, well structured, providing many additional documentation ressources. Extensive documentation index.
Ease of Adaptation
ArchivesSpace has a backand for the the major workflows, and a REST API and many interfaces for adapting the system.
Code Quality
Code ist hosted on Github. High code quality ( Ruby 76.6% XSLT 8.9% JavaScript 7.7% CSS 6.0% Shell 0.8), well structured, not many comments, no continuous integration, tests are available.
License
ECL 2.0
Last release
ArchivesSpace 1.0.7.1 (March 2014)
Last activity
on-going
Artoo.js
artoo.js is a piece of JavaScript code meant to be run in your browser’s console to provide you with some scraping utilities.
This nice droid is loaded into the JavaScript context of any webpage
through a handy bookmarklet you can instantly install by dropping the
above icon onto your bookmark bar.
Archiving, Extracting data
— Guillaume Plique —
http://medialab.github.io/artoo/
https://github.com/medialab/artoo
Metadata Retrieval Services
ASV Toolbox
ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms. The ASV Toolbox can work on plain text files and connect to a MySQL database. While it is especially designed to work with corpora of the Leipzig Corpora Collection, it can easily be adapted to other sources.
Named entity recognition, POS Tagging, Extracting data
— University of Leipzig —
http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/
http://wortschatz.uni-leipzig.de/~cbiemann/software/toolbox/
Collection Management, Ontology/Vocabulary Building and Management
Avalon Media System
The Avalon Media System is an open source system for managing large collections of digital audio and video files
Managing, Archiving
— Indiana University and Northwestern University —
http://www.avalonmediasystem.org
https://github.com/avalonmediasystem
Collection Management
Examples
http://www.avalonmediasystem.org/try-out-avalon
License
Apache 2.0
Last release
11/2013
Last activity
ongoing
BIBFRAME
Two tools designed to help you evaluate MARC Bibliographic data in the BIBFRAME model: a comparison service and a transformation service
Comparing, Modifying
— Library of Congress Network Development and MARC Standards Office —
http://bibframe.org/
https://github.com/lcnetdev/marc2bibframe
Metadata Checker/Validation, Metadata Mapping/Conversation/normalisation
Quality of Documentation
Documentation on the concepts behind BIBFRAME are quite extensive, however documentation for developers seems to be limited by a not very descriptive README in GitHub.
Ease of Adaptation
There is no plugin structure mentioned, but the tool’s functionalities (“convert MARC/XML Bibliographic records to BIBFRAME resources”) are fairly straightforward (there is an XQuery file for each format), so extending the tool for other types of bibliographic records should be not too hard.
Code Quality
The most important bits of code are written in XQuery. Next to this there is an exhibit app in HTML. Both parts seem clear and are not hard to comprehend. However there is not much in the sense of comments.
Examples
http://bibframe.org/demos/
Binarization and Conversion Tool
The BCT tool can be used to produce JPEG2000 or JPEG files of newspaper master images for presentation on the web. This tool calls two other tools, a binarization method from Basilis Gatos which is optimised for OCR, and Kakadu, a software development kit for creating JPEG2000 images. Therefore, in order to fully use the features of BCT, both tools must be installed and licensed. However, feel free to call your own tools from BCT.
Imaging, Compressing
— University of Innsbruck —
https://github.com/dea-uibk/BCT
https://github.com/dea-uibk/BCT
Publication, Image Search, Media Checker/Validation
License
MIT
Last release
10/12/2014
BitCurator Access
BitCurator Access software tools will assist collecting institutions
(libraries, archives, and museums) in providing web-based and local
access to born-digital materials held on disk images. BitCurator Access
will focus on software that simplifies access to raw and
forensically-packaged disk images, allowing collecting institutions to
incorporate these objects into access environments in a manner that
reflects the original order and relevant environmental context. The use
of open source digital forensics software will allow for detailed
analysis of file and file system provenance, quality and accessibility
of files, metadata in files and the file system, and residual (non-file
system) data contained within disk images.
Direct accessing, Analyzing, Access Management, Adding metainformation
— University of North Carolina at Chapel Hill —
http://access.bitcurator.net/index.php?title=Main_Page
https://github.com/kamwoods/bca-webtools
Collection Management
License
GPL v 3
Last release
30/12/2014
Blacklight
RoR gem discovery interface for any Solr index
Browsing
http://projectblacklight.org/
https://github.com/projectblacklight/blacklight
Search and Browsing
Quality of Documentation
The home page quickly and clearly gives an introduction to Blacklight and shows all the necessary links, e.g. towards a quickstart guide and the GitHub, to get started as a developer. The overall documentation is available in a wiki and seems mostly well written and clear.
Ease of Adaptation
There is a dedicated section on how to contribute to Blacklight as well as a page that lists the (un)official add-ons that are available. With several examples and clear instructions, as well as a developer mailing list, it seems fairly straightforward to create custom extensions.
Code Quality
The Ruby code contains useful comments. The main app is nicely structured according to MVC and includes YAML config files. The main JavaScript function clearly explains the structure and rationale of the code; the aim is to use or extend JQuery plugins as much as possible. Overall the code seems to be of good quality.
Examples
https://github.com/projectblacklight/blacklight/wiki/Examples
License
Creative Commons Attribution-Share Alike 3.0 United States License.
Last release
6/25/2012
Bolt
Bolt is a tool for Content Management, which strives to be as simple and straightforward as possible. With this general user manual you will be able to use Bolt to produce and edit content for your website.
Managing
— Bob den Otter —
http://bolt.cm/
https://github.com/bolt/bolt
Collection Management
License
MIT
Last activity
15/5/2014
BRAT Rapid Annotation Tool
online environment for collaborative text annotation”; focused on structured annotation of text, e.g., tagging named entities such as persons, organizations, etc., and their relationships
Annotation
— University of Tokyo —
http://brat.nlplab.org/index.html
https://github.com/nlplab/brat
Data Annotation/Curation
Quality of Documentation
Extensive
License
MIT
Last release
22/11/2012
Last activity
1/7/2016
BWF MetaEdit Tool
This tool permits embedding, editing, and exporting of metadata in Broadcast WAVE Format (BWF) files. This tool can also enforce metadata guidelines developed by the Federal Agencies Audio-Visual Working Group, as well as recommendations and specifications from the European Broadcasting Union (EBU), Microsoft, and IBM.
Adding metainformation, Extracting metadata
— FADGI —
http://bwfmetaedit.sourceforge.net/
http://bwfmetaedit.sourceforge.net/Download/Source/
Metadata Mapping/Conversion/normalisation, Metadata Checker/Validation, Digital Asset Management
Quality of Documentation
The documentation is quite technical and not very attractive, but just enough to make use of the tool. There does not seem to be a way to download the source code. The tool itself does work well (used by the reviewer).
Ease of Adaptation
There does not seem to be any way of getting or contributing to the source code.
Code Quality
Nothing can be said of the code quality due to the inavailability of the code.
Carto
CARTO is an open, powerful, and intuitive platform for discovering and
predicting the key insights underlying the location data in our world.
Imaging, Georeferencing
— CARTO —
https://carto.com/
https://github.com/CartoDB/cartodb
Geo-spatial Apllications
Quality of Documentation
Extensive
Examples
https://carto.com/gallery/
License
https://github.com/CartoDB/cartodb/blob/master/LICENSE
Last release
6/6/2016
Catmandu
Catmandu provides a suite of Perl modules to ease the import, storage, retrieval, export and transformation of metadata records.
Preserving
— Nicolas Steenlant, Patrick Hochstenbach —
http://librecat.org/index.html
https://github.com/librecat/catmandu/
Metadata Retrieval Services, Metadata Mapping/Conversation/normalisation, Collection Management
Quality of Documentation
The main site offers a brief introduction of the capabilities of Catmandu and offers an extensive tutorial. The GitHub code repository also contains developer documentation, but needs to be generated (or you can find it by browsing the code repository) after downloading. The developer documentation seems well written and quite extensive.
Ease of Adaptation
The code repository in GitHub also includes a page directed to contributers, but does not seem to desribe any plugin architecture. Since the code seems professionally maintained and fairly well documented, adding customizations to the code seems worth considering.
Code Quality
The Perl code looks well structured, well written, and contains comments in the form of perldoc. There is a long list of tests available. Also the code is included in CPAN (a repository of Perl libraries).
License
GPL-2, or later
Last release
11/2013
Last activity
ongoing
Cesium
Cesium is a JavaScript library for creating 3D globes and 2D maps in a web browser without a plugin. It uses WebGL for hardware-accelerated graphics, and is cross-platform, cross-browser, and tuned for dynamic-data visualization.
Imaging
— AGI —
http://cesiumjs.org/index.html
https://github.com/AnalyticalGraphicsInc/cesium
Geo-spatial Apllications
Examples
http://cesiumjs.org/demos.html
License
Apache 2.0
Last release
2/3/2015
ChicagoCodeX
An authoring and publishing environment for online catalogues with full scholarly apparatus
Authoring, Publishing
— The Art Institute of Chicago and the Indianapolis Museum of Art. —
http://www.oscitoolkit.org/
https://github.com/IMAmuseum/ChicagoCodeX
Exhibition Management, Collection Management
Examples
http://www.getty.edu/foundation/initiatives/current/osci/osci_browse_catalogues.html
CKAN
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data.
Managing, Publishing, Resource Sharing
— Open Knowledge Foundation —
http://ckan.org/
https://github.com/ckan/ckan
Collection Management
Quality of Documentation
The quality of the website and the available documentation is very good. The purpose and features, including the API, of CKAN are all documented very well.
Ease of Adaptation
There is dedicated documentation on the writing of extensions. Moreover there are 60 external extensions available, which can serve as an example. The code base is large, but the documentation is good, so writing these extensions shouldn’t be too hard.
Code Quality
The Pyton code looks good, there are not many comments, but for the important objects (within the comments) references to the API docs are given. The API docs are very detailed.
Examples
http://ckan.org/case-studies/
License
Affero GNU GPL v3.0
Last release
13/08/2013
Last activity
ongoing
ClioPatria semantic search web-server
An RDF and semantic search web server based on Prolog
Browsing
— Friedrich Lindenberg —
http://e-culture.multimedian.nl/software/ClioPatria.shtml
http://cliopatria.swi-prolog.org/cpack_home
Search and Browsing
Examples
http://eculture.cs.vu.nl/europeana/session/search
License
GPL-2
Last activity
11/22/2011
Clipper Prototype 3
Clipper is a free open-source web application enabling researchers to create and share virtual-clips without altering the original media files. Clipper enables you to mark the start and end of interesting events while playing audio or video data files through a standard web browser. You can add rich text annotations to each clip, and combine clips into playlists (cliplists)
Annotating
— The City of Glasgow College, The Open University and Reachwill Ltd —
http://blog.clippertube.com/index.php/clipper-prototype-3/
https://github.com/reachwill/clipper
Media Annotation/Tagging
Last release
18/11/2015
Last activity
ongoing
CMUSphinx
CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.
Speech recognition
— Carnegie Mellon University —
http://cmusphinx.sourceforge.net/
http://sourceforge.net/projects/cmusphinx/
Enriching metadata, Speech Recognition
CollectionSpace
CollectionSpace is an open-source collections management application that meets the needs of museums, historical societies, and other collection-holding organizations. CollectionSpace is designed to be configurable to each organization’s needs, serving as a gateway to digital and physical assets across an institution. The software is freely distributed via open-source licensing, and an active developer community ensures that CollectionSpace is continually improving.
Managing
— January 2014: Lyrasis is now the organizational home of CollectionSpace. (The project was initiated and led by Museum of the Moving Image. Transition team includes developers based at University of California, Berkeley and Fluid Project at Ontario College of Art and Design; and Jesse Martinez, Freelance developer and Service Provider.) Project Partners 2008-2013 included: University of California Berkeley, Fluid Project at Ontario College of Art and Design, and Centre for Applied Research in Educational Technologies (CARET) University of Cambridge. —
www.collectionspace.org
https://github.com/collectionspace/
Collection Management
Quality of Documentation
The home page clearly provides a link to the documentation, which is very thorough and covers the most relevant topics such as: system requirements, how to configure & install it, how to use and maintain it and also how to develop customizations. The overall quality of this documentation is fairly good.
Ease of Adaptation
CollectionSpace describes having a so-called hook system in place (which is used in e.g. Drupal or Wordpress), a design pattern to conveniently extend a system. Hook systems in general take some time to learn and in this case specific documentation on the hooks are yet to be added (some limited examples are available though).
Code Quality
The different parts, namely the UI, services, application, tools, etc, are in different sections in GitHub, making the distinction clear. Each of these sections is sparsely documented, however the code looks good with JavaDoc and maven pom.xml files.
Examples
www.demo.collectionspace.org
License
ECL 2.0
Last release
10/2013
Last activity
on-going
Collective Access
This is one of the more powerful open source CMS (collection managemetn systems). It allows you to manage metadata in variouos formats, using metadata profile descriptions. Available profiles include VRAcore, EBUcore, LIDO (contributed by LIBIS), etc
Managing, Presenting, Adding meta-information
— Whirl-i-Gig —
http://www.collectiveaccess.org/
https://github.com/collectiveaccess
Collection Management, Content Retrival Services, Exhibition Management
Quality of Documentation
Extensive documentation available on Wiki, well structured. Demo, Installation instructions, Upgrade instructions
Ease of Adaptation
PHP Web Application, easily extensible. No standards. Documentation for adaption, API documentation available.
Code Quality
Mainly PHP-based (PHP 86.7% JavaScript 10.8% CSS 1.9% Other 0.6%). Code hosted on Github. Highly active, last commit 2 days old. Code is well written and documented. Separation into modules, clean code layout. Bugtracker available.
Examples
http://collectiveaccess.org/about/users
License
GNU GPL v.2
Last release
1/2012
Conjecture
Conjecture is a modular, extensible, open-source C++ framework for Optical Character Recognition (OCR). Conjecture is not a single OCR, but rather is an extensible collection of OCRs that can be explored, analyzed, compared, extended, modified, and merged within a unified environment.
OCR
— unknown —
http://conjecture.sourceforge.net/conjecture/index.php
http://conjecture.sourceforge.net/conjecture/download.php
Digital Asset Management
License
GPL
Last release
2007
ConservationSpace
The goal of ConservationSpace is to develop an open-source software application that will address a core need of the conservation community for a shared solution to the problem of documentation management. The conservation community has long recognized that a digital approach to managing its documentation would improve continuity in procedures, increase access, expand research opportunities, and better ensure the preservation of its documents.
Managing Preserving
— Sirma Group (Sirma ITT and Ontotext). The project is led by the National Gallery of Art (Washington, D.C., USA) with participation from Yale University Special Collections Library, Courtauld Institute, British Museum, Indianapolis Museum of Art, Metropolitan Museum of Art, Statens Museum for Kunst, and the Denver Art Museum). —
www.conservationspace.org
Collection Management, Ontology/Vocabulary Building and Management
Quality of Documentation
Extensive project docs at https://www.wiztechno.com (user: guest, pwd: conservationspace)
Ease of Adaptation
See https://www.wiztechno.com/wiki/display/CSR2/System+Customization+and+Configuration+Requirements for specific requirements of project partners
Code Quality
The code is not yet publically available.
Examples
https://v9tsig.dm2303.livefilestore.com/y2m3Y4yon2CsUhV3hBrzdgUB2DroiygcRNR_6t0g3DJmZdGUNiW8klxe0JDoukJZkLmGHuvKj_pFYZOyuneWBCNkgFBwQcgLVIwSbTMJPl4LifVaOIBK2PHDTkQVVVtyBescrMibeY5OYIccnGXZ2VyEA/1Sirma%20ITT%20-%20Conservation%20Space%20PR%2003-2015%20-%20screenshot.png
Cross-Platform Authentication - Authorization Provider
Hybrid media devices, which can deliver audio, video and interactive content over both broadcast and broadband, create new opportunities and
challenges for broadcasters. Augmenting the broadcast experience with interactive content delivered over the Internet changes the classic one-to-many paradigm bringing it closer to a one-to-one relationship.
Cross-Platform Authentication (CPA) offers an open standard for associating any media device with an online identity, which facilitates delivery of personalized services to these devices.
Organizing
— EBU —
http://ebu.io/project/cpa
https://github.com/ebu/cpa-auth-provider
Media Applications
CubicWeb
CubicWeb is a semantic web application
framework, licensed under the LGPL, that empowers developers to efficiently
build web applications by reusing components (called cubes)
and following the well known object-oriented design principles.
Its main features are:
an engine driven by the explicit data model
of the application,a query language named RQL similar to
W3C’s SPARQL,a selection+view mechanism for
semi-automatic XHTML/XML/JSON/text generation,a library of reusable components
(data model and views) that fulfill common needs,the power and flexibility of the Python programming language,the reliability of SQL databases, LDAP directories, Subversion and
Mercurial for storage backends.
Web-developing
— LogiLab —
http://www.cubicweb.org/
https://docs.cubicweb.org/book/admin/setup
Collection Management, Semantic Extraction, Digital Asset Management
Cultural Enrichment Mashificator
Collaboration tools.
Collaborating
— Jeremy Ottevanger —
http://ccgi.ottevanger.plus.com/cgi-bin/ez/index.php
Content Retrieval Services, Metadata Retrival Services
Quality of Documentation
Nice presentation of the idea together with a demo. There does not seem to be the intention to disseminate the code. No documentation available. Broken links.
Ease of Adaptation
PHP integration documented.
Code Quality
No code available
Dédalo: Intangible Heritage management and Oral History
Semantic RDF data sources
Annotating, Managing
— Juan Franciso Onielfa, Alejandro Peña —
http://www.fmomo.org
http://www.fmomo.org/dedalo/pg/download.php
Collection Management, Content Retrival Services
Quality of Documentation
Documentation only available in Spanish, PDF documents.
Ease of Adaptation
No documentation available.
Code Quality
Web application (HTML, CSS, Javascript, PHP, MySQL). Download only possible after registration, code not available.
Examples
http://dedalo.antropolis.net/dedalo/home/
License
GNU GPL v3
Last release
3.5.56
Last activity
July 2012
D3-cartogram
Continuous area cartograms with d3 and TopoJSON
Imaging
— Shawn Allen —
http://prag.ma/code/d3-cartogram/
https://github.com/shawnbot/d3-cartogram/
Geo-spatial Apllications, Data Visulaisation
Quality of Documentation
The main site briefly describes what the tool is and what dependencies the tool has. A prominent, but somehow easy to overlook, link to GitHub is available. On GitHub there is a short readme, which mainly points to an example index.html.
Ease of Adaptation
The example HTML file includes quite a number of JavaScript functions that need to be examined in order to be able to understand how it all works. Besides this, the code is small and it shouldn’t be too difficult to adapt the code to use different (geographical) input.
Code Quality
There is not much code and it does look readable, but there are not many comments to help you. Because the code uses d3.js and uses geographical algorithms, it takes a while to get in to.
D3
D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG and CSS. D3’s emphasis on web standards gives you the full capabilities of modern browsers without tying yourself to a proprietary framework, combining powerful visualization components and a data-driven approach to DOM manipulation.
Managing, Visualizing
— Dustin Ewers —
http://d3js.org/
https://github.com/mbostock/d3/wiki
Data Visualization
Examples
http://bl.ocks.org/mbostock/4063318
License
BSD License
Last release
7/18/2014
dat
dat is an open source tool that enables the sharing of large datasets, allowing for a decentralized collaboration flow
Resource sharing
— Max Ogden —
http://dat-data.com/
https://github.com/maxogden/dat
Metadata Mapping/Conversion/normalisation, Collection Management
Quality of Documentation
The main page embeds a very informative YouTube video on a talk the creator gives about the tool. The documentation on GitHub is extensive, well written and even illustrated. It gets you started and enthuses you to contribute.
Ease of Adaptation
The code base is well documented and quite small. Adapting the system, e.g. to support new formats. Should not be too hard.
Code Quality
The code is well documented, concise and straightforward to understand. In the code however, it appears there are no comments.
License
BSD License
Last release
still pre-alpha
Data Pipes
Data Pipes is a service to provide streaming, “pipe-like” data transformations on the web – things like deleting rows or columns, find and replace, head, grep etc.
Modifying, Categorizing, Browsing
— Open Knowledge Foundation —
http://datapipes.okfnlabs.org/
https://github.com/okfn/datapipes
Metadata Mapping/Conversion/normalisation
Quality of Documentation
The site starts of with a useful example of what the tool can do. Following the example the API is detailed. At the bottom you can find a link to GitHub, which contains basic practical documentation on how to install & run the tool.
Ease of Adaptation
The documentation mentions that contributing is “super easy” and points you to a list of issues. Other than that there does not seem a specific plugin architecture or obvious way to extend the tool.
Code Quality
The code base is small. The code looks well written. Occasionally there are comments.
Examples
http://datapipes.okfnlabs.org/csv/head%20-n%2050/cut%200/delete%201:7/grep%20-i%20London/html?url=https://raw.github.com/okfn/datapipes/master/test/data/gla.csv
DBpedia Spotlight
DBpedia Spotlight is a tool for automatically annotating mentions
of DBpedia resources in text, providing a solution for linking
unstructured information sources to the Linked Open Data cloud through
DBpedia.
Annotating, Linking
— Pablo Mendes (Freie Universität Berlin), Jun 2010-present. Jo Daiber (Charles University in Prague), Mar 2011-present. Prof. Dr. Chris Bizer (Freie Universität Berlin), supervisor, Jun 2010-present. —
https://github.com/dbpedia-spotlight/dbpedia-spotlight
https://github.com/dbpedia-spotlight/dbpedia-spotlight
Linked Open Data, Data Annotation/Curation
Quality of Documentation
Well documented, short documentation on Github, further documentation on Wiki, well structured and written.
Ease of Adaptation
REST Webservices allow integration and adaption.
Code Quality
Java/Scala based project (Java 48.7% Scala 48.1% Shell 1.7% Python 1.5%), actively maintained, last commit days ago, continous integration, well-structured, maven-based build, tests available. Many branches and contributors.
License
Apache License, 2.0, LingPipe
Last release
10/2012
Dedupe
A python library for accurate and scaleable data deduplication and entity-resolution.
Curating
— Gregg, Forest, and Derek Eder —
https://github.com/datamade/dedupe
https://github.com/datamade/dedupe
Collection Management
Quality of Documentation
Well documented and structured documentation.
Ease of Adaptation
Extensible.
Code Quality
Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.
License
MIT License
Last activity
2013
Delving Platform: CultureHub and SIP-Creator
Delving has developed and refined an open source platform with tools specifically tailored to the needs of the cultural heritage domain.
Preserving
— Manuel Bernhardt, Gerald de Jong, Eric van der Meulen, Sjoerd Siebinga, Thomas Wikman, Juliane Stiller —
http://delving.eu
https://github.com/delving
Metadata Mapping/Conversion/normalisation, Metadata Checker/Validation, Metadata Retrival Services
Quality of Documentation
Well documented and structured documentation.
Ease of Adaptation
Extensible.
Code Quality
Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.
Examples
http://www.digitalecollectienederland.nl/ http://samlinger.kulturnett.no/
License
EUPL, Apache 2.0
Last release
Monthly cycle
Last activity
Active now
digilib
Digilib is a web based client/server technology for images. The image content is processed on-the-fly by a Java Servlet on the server side so that only the visible portion of the image is sent to the web browser on the client side. It supports a wide range of image formats and viewing options on the server side while only requiring an internet browser with javascript and a low bandwidth internet connection on the client side. digilib enables very detailed work on an image as required by scientists with elaborate zoom features like an option to show images on the screen in their original size. digilib facilitates cooperation of scientists over the internet and novel uses of source material by image annotations and stable references that can be embedded in URLs. digilib is used in the ALCATRAZ framework of image, text and annotation tools and many other projects. - See more at: http://dirtdirectory.org/resources/digilib#sthash.60AWtXSq.CDwoNbFA.dpuf
Editing, Visualizing, Annotating
— Max-Planck-Insitute for the History of Sciencethe University of Bern —
http://digilib.sourceforge.net/
https://sourceforge.net/p/digilib/code/ci/default/tree https://github.com/robcast/digilib
Media Annotation/Tagging
License
GNU GPL
Last release
4/9/2009
Last activity
4/5/2015
Digital Preservation Software Platform (DPSP)
The DPSP is a collection of software applications which support the goal of digital preservation.
Preserving
— National Archives of Australia —
http://dpsp.sourceforge.net/
http://dpsp.sourceforge.net/download.php
Digital Preservation
Quality of Documentation
Clean presentation, documentation short and concise, documentation hosted on sourceforge.
Ease of Adaptation
Unknown
Code Quality
Builds hosted on sourceforge, no source code available.
License
GPLv3
Last release
7/31/2013
DigitalLabels
Django app to generate an interactive gallery label interface, optimised for touch screens and tablets.
Annotating
— V&A Digital Media team —
http://vanda.github.io/DigitalLabels/
http://vanda.github.io/DigitalLabels/
Exhibition Management
Quality of Documentation
Nice presentation, Basic documentation, well written and structured, documentation hosted on github pages.
Ease of Adaptation
No APIs or documentation about interfaces available.
Code Quality
Python/JavaScript based ( Python 65.2% JavaScript 34.8%), hosted on Github, not being actively maintained, not many comments in code, but readable.
Digitate
Digitate is a free application designed for use on the iOS platform, specifically on iPad devices. The application allows scholars and enthusiasts with an interest in the visual and material elements of a cultural artefact to make notes and annotations directly on an image of such an artefact.For example, a literary scholar might use it to annotate the material or bibliographic elements of a rare text or first edition, while an art historian might do the same on an image of a painting.
Annotating
— OpenDHAT —
Media Annotation/Tagging
Examples
https://www.youtube.com/watch?v=69CKhuiak0Q
License
CC BY-SA
Last activity
1/2/2015
Diva.js
Diva.js (Document Image Viewer with AJAX) is a
Javascript frontend for viewing documents, designed to work with digital
libraries to present multi-page documents as a single, continuous item.
Only the pages that are being viewed at any given time are actually
present in the document, with the rest appended as necessary, ensuring
efficient memory usage and high loading speeds. Written as a jQuery plugin, diva.js requires the jQuery Javascript library. Diva’s back end is constituted by the IIPImage server.
Modifying, Browsing
— Distributed Digital Music Archives and Libraries —
http://ddmal.github.io/diva.js/about/
https://github.com/DDMAL/diva.js
Publication, Exhibition Management
License
Open w/ attribution
Last release
3/6/2015
Last activity
10/6/2015
Doc Viewer
HTML5 document viewer
Browsing
— The New York Times —
https://github.com/documentcloud/document-viewer
https://github.com/documentcloud/document-viewer
Media Applications
Quality of Documentation
Clean documentation, well written and structured, easily comprehensible.
Ease of Adaptation
Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.
Code Quality
Mainly JavaScript, some Ruby project (JavaScript 73.0% CSS 26.9% Ruby 0.1%). Clean code, many comments support readability of code. Actively maintained, many contributers, code hosted on github.
License
Apache License, v2.0
Last release
4/6/2010
Last activity
27/5/2015
DocSplit
Docsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages…)
Modifying
— Jeremy Ashkenas, DocumentCloud —
http://documentcloud.github.io/docsplit/
https://github.com/documentcloud/docsplit/
Collection Management
Quality of Documentation
Short and concise documentation, well written and structured, easily comprehensible.
Ease of Adaptation
Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.
Code Quality
Pure Ruby project. Clean code, tests available, many comments support readability of code. Actively maintained, many contributers, code hosted on github.
License
LGPL
Last release
17/11/21014
Last activity
5/2/2015
DSpace
DSpace open source software is a turnkey repository application
Preserving
— DSpace Foundation —
http://www.dspace.org/
https://github.com/DSpace/DSpace
Collection Management, Metadata Retrival Services, Repository Software
Quality of Documentation
Very high quality software documentation, commercial support.
Ease of Adaptation
Complex product ecosystem.
Code Quality
Maven-based, highly modularized, tests available, clean code, mainly java, many comments explaining concepts and supporting readability. Good object oriented design.
Examples
demo.dspace.org
License
Open Source: http://www.dspace.org/license/
Last release
20/5/2015
Last activity
23/6/2015
EADitor
EADitor is an EAD (Encoded Archival Description) editor based on Orbeon XForms. It uses various external services (eg Getty AAT, TGN, ULAN) for accessing LOD to be used in the descriptions. A companion tool is xEAC for creating and managing EAC-CPF records (corporates, persons, families)
Managing
— Ethan Gruber, American Numismatic Society —
http://eaditor.blogspot.com/
http://code.google.com/p/eaditor/
Collection Management, Metadata Retrival Services
Quality of Documentation
The documentation available on Github seems a bit like a copy& paste text blog that is not structured. But there is a lot of information available and blog posts explain the concepts of the software and give further examples.
Ease of Adaptation
REST or SOAP interfaces allow easy integration. No plugin mechanism.
Code Quality
CSS/XSLT/Java-Script-based project hosted on Github (CSS 62.3% XSLT 17.3% JavaScript 15.0% XProc 5.4%). One contributor (highly active Github committer), the code is being actively maintained. A lot of XSLT is not commented but readable.
Examples
http://numismatics.org/archives/
License
Apache License 2.0
Last release
.1112 beta (December 2011)
Last activity
June 2012
easyLOD
Simple framework for exposing Linked Open Data from arbitrary sources.
Presenting
— Mark Jordan —
https://github.com/mjordan/easyLOD
https://github.com/mjordan/easyLOD
Linked Open Data, Metadata Retrival Services
Quality of Documentation
Information available on Github page. Documentation and instructions for applying data source plugins, resource URIs and mapping namespaces in resource URIs to data source plugins. Also provides info on installing and testing Easy LOD. Instructions also available within the code repository. Easy to access, read, and follow.
Ease of Adaptation
The GitHub explains having a plugin structure for converting different datasources (to LOD), so creating a plugin for custom content sources should be fairly straightforward. Moreover, the codebase is small an clear, so further adapting the tool should not be too difficult.
Code Quality
The Slim PHP micro framework is used, making the code and all configurations not hard to grasp. The code itself contains several useful comments.
EasyRDF
A PHP library to quickly parse RDF. It supports URL redirect and simple “queries” via API.
Parsing, Data recognition
http://www.easyrdf.org/
http://www.easyrdf.org/downloads https://github.com/njh/easyrdf
Linked Open Data, Metadata Retrival Services
License
BSD 3-Clause
Last release
3/12/2014
Last activity
2/4/2015
Echoprint
Internet design and authoring tools.
Web-developing, Authoring
— The Echonest in partnership with MusicBrainz —
http://echoprint.me/
https://github.com/echonest/
Content Retrieval Services, Metadata Retrival Services
Quality of Documentation
Website available that is easy to navigate however it’s possible to get linked back to the massive Echonest webpage. Plenty of resources and tutorials to help users get started and along the way. Wrong linked listed for code repository at Github, the current link takes you to all the Echonest apps. This is the more specific link: https://github.com/echonest/echoprint-codegen.
Ease of Adaptation
Echoprint offers several useful services which can be used to build your own applications. Extending these services for your own purposes is most likely not needed, except for maybe solving bugs.
Code Quality
The code of the most tools/services seem to be fairly well organized and commented.
License
The code generator is MIT and the server is Apache 2
Last activity
9/1/2015
ElasticSearch
ElasticSearch is a distributed RESTful search engine built for the cloud.
Seeking
— http://elasticsearch.com/about/team/ —
http://www.elasticsearch.org/
https://github.com/elasticsearch/elasticsearch
Search and Browsing, Content Retrival Services, Metadata Retrival Services, Collection Management
Quality of Documentation
Very user friendly website with extremely thorough documentation. Training, development support, and production support are all available. Github page with notes and directions. Blog is updated regularly and is up-to-date. There are also very interesting and helpful case studies available to see how groups have made use of ElasticSearch. One downside of the documentation is that it’s sometimes hard to find concrete examples of every day use cases.
Ease of Adaptation
ElasticSearch has many configuration options for building the search engine needed for your use case. Extending the tool itself is possible by contributing to GitHub, but isn’t probably something you would need to do.
Code Quality
The (mainly Java) code seems well documented and commented.
License
Apache License, Version 2.0
Last release
16/7/2015
Last activity
24/7/2015
ERMes
Electronic Resource Management (ERM) Software
Managing
— William Doering, Galadriel Chilton —
http://murphylibrary.uwlax.edu/erm/
murphylibrary.uwlax.edu/erm/
Metadata Retrieval Services
Quality of Documentation
Website not very clear and requires a lot of downloading. There is a semi-active forum for discussion and help and some user generated instructional videos have been provided. No readily visible documentation.
Ease of Adaptation
The code is only available as a Access 2007 file, which makes adapting the code not very easy for most contemporary developers.
Code Quality
Without Access 2007 the code could not be inspected by the reviewer.
License
Attribution Assurance License
Last release
5/1/2014
Europeana Client
Java client for the Europeana Search API. Refactored and Mavenized version of Europeana4j
Seeking
— Sergiu Gordia —
https://github.com/europeana/europeana-client/
https://github.com/europeana/europeana-client/
Metadata Retrieval Services, Linked Open Data
EuropeanaHarvest
A script for harvesting metadata from Wikimedia Commons for the use in Europeana
Gathering, Harvesting
— Wikimedia Sverige —
https://github.com/lokal-profil/EuropeanaHarvest
https://github.com/lokal-profil/EuropeanaHarvest
Metadata Retrieval Services
Europeanap-dbpedia-disambiguation
A simple Python library and webservice, that allows named entity disambiguation against a label database. The idea is to use a Solr query to filter possible candidates and use the more detailed analysis on string similarity, number of inlinks and entity type to select the “best” candidate. It contains code to handle (multi-lingual) DBpedia dumps and load them into a Solr backend. It also contains helper code for the annotation of ALTO 2.1 files that are used in the context of the Europeana Newspapers project.
Named Entity Recognition
— KBNL Research —
https://github.com/KBNLresearch/europeananp-dbpedia-disambiguation
https://github.com/KBNLresearch/europeananp-dbpedia-disambiguation
Metadata Retrieval Services, Media Annotation/Tagging
Europeanap-ner
This tool takes container documents (MPEG21-DIDL, METS), parses all references to ALTO files and tries to find named entities in the pages (with most models: Location, Person, Organisation, Misc). The aim is to keep the physical location on the page available through the whole process to be able to highlight the results in a viewer.
Parsing
— KB Research —
http://europeana-newspapers.eu/
https://github.com/KBNLresearch/europeananp-ner
Metadata Retrieval Services, Semantic Extraction
Examples
http://researchkb.wordpress.com/2014/03/03/ner-newspapers/
License
European Union Public License
Last release
v.0.0.2 March 2014
EuropeanaXMLBuilder
A tool for downloading a full record descriptions in the ESE format from any OAI-PMH-compliant interface.
Seeking,Managing, Resource sharing
— PCSS Digital Libraries Team —
http://dl.psnc.pl/software/EXB/site/
http://dl.psnc.pl/software/EXB/site/source-repository.html
Metadata Retrieval Services
License
GNU LESSER GENERAL PUBLIC LICENSE
Last release
Unknown
Evergreen
Evergreen is a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries.
Seeking, Managing, Resource sharing
— Various —
http://www.open-ils.org/
http://www.open-ils.org/downloads.php
Collection Management, Search and Browsing
Quality of Documentation
Very clear and extensive documentation. Updated regularly. There is a documentation interest group that meets periodically and the minutes are recorded. The meetings are open for anyone to attend. While the documentation is very clear and organized some may find its vastness overwhelming.
Ease of Adaptation
Evergreen is a huge collection of software packages. To adapt it means a lot of reading, installing and testing. However since the documentation is very thorough and extensive, it should be possible.
Code Quality
The reviewer did not inspect any code, because of the size of the codebase and the several different tools that are available.
License
GNU GPL
Last release
16/6/2015
Last activity
4/9/2011
ExifTool
ExifTool is a platform-independent Perl library plus a command-line application for reading, writing and editing meta information in a wide variety of files.
Adding Metainformation
— Phil Harvey —
http://www.sno.phy.queensu.ca/~phil/exiftool/
http://www.sno.phy.queensu.ca/~phil/exiftool/
Metadata Retrival Services, Metadata Mapping/Conversation/normalisation, Metadata Checker/Validation
License
GNU General Public License, v.1 or later
Last release
10.20 (June 13, 2016)
eXtensible Catalog
eXtensible Catalog is open source, user-centered, next generation discovery interface software for libraries and consortia. It comprises four software components that can be used independently to address a particular need or combined to provide an end-to-end discovery system to connect library users with resources.
Seeking
— eXtensible Catalog Organization · University of Rochester —
http://www.extensiblecatalog.org/
https://www.drupal.org/project/xc
Collection Management, Search and Browsing, Linked Open Data
Examples
http://catalog.lib.kyushu-u.ac.jp/en/xc/search
License
MIT License (the Java based Toolkits), GNU General Public License (Drupal Toolkit)
Last release
1/28/2014
Last activity
2/26/2014
Fedora Commons
Fedora (Flexible Extensible Digital Object Repository Architecture) was originally developed by researchers at Cornell University as an architecture for storing, managing, and accessing digital content in the form of digital objects inspired by the Kahn and Wilensky Framework. Fedora defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking “behaviors” (i.e., services) to digital objects.
Managing, Storing, Direct Accessing
— various researchers at Cornell University —
http://fedora-commons.org/
http://www.fedora-commons.org/software
Digital Asset Management, Content Retrival Services, Collection Management
Quality of Documentation
Documentation is not visibly and clearly labeled but when found it is very thorough and fairly organized. The documentation is hard to navigate and not as clear as some others. The project is on going and a new version will be coming out. Explanations about upgrading are provided. No visible community or forum space.
Examples
http://www.fedora-commons.org/about/examples
License
Apache License, Version 2.0.
Last release
15/11/2012
Last activity
actively ongoing
Fedora Migrate
Migrates content from a Fedora3 repository to a Fedora4 one.
Migrating
— Penn State —
Collection Management
License
MIT
Last release
19/12/2015
File Analyzer and Metadata Harvester
This application functions like a digitization Swiss army knife. The application allows a user to analyze the contents of a file system or external drive and generates statistics about the contents of the contained directories. The application can be used to generate checksum values to ensure the bit-level integrity of files after they have been copied to a new device. After a collection of files have been converted from one digital format to another, this application can verify that there is a one-to-one match of before and after files.
Digitizing
— NARA —
http://blogs.archives.gov/online-public-access/?p=6270
https://github.com/Georgetown-University-Libraries/File-Analyzer
Digital Preservation, Metadata Mapping/Conversation/normalisation, Metadata Checker/Validation
Quality of Documentation
Documentation of the File Analyzer and Metadata Harvester has been thoroughly updated in the past few months. One can find the tool's wiki here : https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki which provides graphics as well as extensive documentation.
License
NARA OPEN SOURCE AGREEMENT VERSION 1.3
Last release
2011
Last activity
5/2014
File_MARC
Parse, modify, and create MARC records
Parsing, Modifying
— Dan Scott —
http://pear.php.net/package/File_MARC/
http://pear.php.net/package/File_MARC/download
Collection Management
Quality of Documentation
The documentation for File_MARC is very clear and easy to find on the Pear website. The documentation is complete and thorough but lacks the detail that some other documentations have.
License
GNU Lesser General Public License
Last release
10/30/2013
File Rename Tool (FRT)
The File Rename Tool (FRT):
Deliveries dates are obviously available on newspapers so that they have the
possibility to be saerched for by data later on. If a newspaper is not available
in day folders, the ‘File Renaming Tool’ can help to bring them into the
right structure and support libraries in renaming and reordering their
images according to the Europeana Newspapers project specifications.
The main idea of FRT is that images, which may be stored on year level,
can be quickly ordered on the basis of issues and publishing date.
Modifying, Managing
— University of Innsbruck —
http://dbis-halvar.uibk.ac.at/dokuwiki/doku.php?id=main:structify
https://github.com/dea-uibk/FRT
Content Retrieval Services, Digital Asset Management
License
MIT
Last release
13/12/2014
Fixity
Fixity is a utility for the documentation and regular review of stored files.
Managing
— AudioVisual Preservation Solutions —
http://www.avpreserve.com/avpsresources/tools/
https://github.com/avpreserve/fixity
Metadata Mapping/Conversion/normalisation, Media Checker/Validation
License
Apache License, Version 2.0.
Last release
1/13/2014, v.0.3
Franken +
The Initiative for Digital Humanities Media and Culture (IDHMC) at Texas A&M University as part of its Early Modern OCR Project (eMOP) has created a new tool called Franken+ that provides a way to create font training for the Tesseract OCR engine using page images. This is in contrast to Tesseract’s document method of font training which involves using a word processing program with a modern font. ‘’'’Franken+ works in conjunction with PRImA’s Aletheia tool and allows users to easily and quickly identify one or more idealized forms of each glyph found on a set of page images. These identified forms are then used to generate a set of Franken-page images matching the page characteristics documented in Tesseract’s training instructions but using a font used in an actual early modern printed document.
Imaging, Analysing
— Texas A&M/Bryan Tarpley —
http://dh-emopweb.tamu.edu/Franken+/
https://github.com/idhmc-tamu/FrankenPlus
Collection Management
Quality of Documentation
Good, webpage has full explanation
License
Apache 2.0
Last release
December 2013
Last activity
11/1/2014
Frog
Frog formerly known as Tadpole is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl the Tilburg memory-based learning software package. Most modules were created in the 1990s at the ILK Research Group (Tilburg University the Netherlands) and the CLiPS Research Centre (University of Antwerp Belgium). Over the years they have been integrated into a single text processing tool. More recently a dependency parser a base phrase chunker and a named-entity recognizer module were added.
Processing, Parsing, Name entity recognition
— University of Tilburg —
http://ilk.uvt.nl/frog/
http://software.ticc.uvt.nl/
Semantic Extraction
Quality of Documentation
poor. install & run, but 'we are in the process of writing a reference guide for Frog that explains all options in detail.'
Ease of Adaptation
limited. no guidelines how to add other languages
License
GNU GPL
Last release
1/29/2015
Last activity
1/29/2015
FromThePage
FromThePage is an open-source tool that allows volunteers to collaborate to transcribe handwritten documents.
Collaborating, Transcribing
— Ben W. Brumfield —
http://beta.fromthepage.com/
https://github.com/benwbrum/fromthepage
Social Applications, Contextualisation
Quality of Documentation
The FromThePage home page provides very little documentation and the Github page also has very little documentation. The documentation that is there is clear and visible but severely lacking in detail.
License
GNU AGPL v3.0
Last release
6/1/2015
Last activity
6/1/2015
GATE (General architecture for text engineering)
GATE is over 15 years old and is in active use for all types of computational task involving human language. GATE solves problems concerning text analysis or human language processing.
Processing, Parsing, Name entity recognition
— Various —
http://gate.ac.uk/
http://gate.ac.uk/download/
Alignment Tools, Metadata Mapping/Conversation/normalisation, Media Checker/Validation
Quality of Documentation
Excellent
Examples
http://gate.ac.uk/demos/
License
GNU LESSER GENERAL PUBLIC LICENSE v.3.0
Last release
6/2/2015
Last activity
actively ongoing
Geomapping Web
geomapping service
Georeferencing
— Sergiu Gordea —
https://github.com/europeana/Europeana-Creative/tree/master/geomapping-web
https://github.com/europeana/Europeana-Creative/tree/master/geomapping-web
Geo-spatial Apllications
Quality of Documentation
None
Examples
http://62.218.164.177:8080/geomapping/
Getty Vocabularies
The AAT, TGN, ULAN, and CONA contain structured terminology for art and other material culture, archival materials, visual surrogates, and bibliographic materials. Compliant with international standards, they provide authoritative information for catalogers and researchers, and can be used to enhance access to databases and Web sites. - See more at: http://www.getty.edu/research/tools/vocabularies/lod/sparql.html#sthash.sCQK6qfa.dpuf
Organizing
— Getty —
http://vocab.getty.edu/
http://vocab.getty.edu/ontology.html
Ontology/Vocabulary Building and Management, Linked Open Data
Quality of Documentation
Good
License
Open Data Commons Attribution License
Last release
v3.1
Last activity
6/5/2015
ggplot
ggplot2 is a plotting system for R, based on the grammar of graphics, which tries to take the good parts of base and lattice graphics and none of the bad parts. It takes care of many of the fiddly details that make plotting a hassle (like drawing legends) as well as providing a powerful model of graphics that makes it easy to produce complex multi-layered graphics.
Visualizing
— Hadley Wickham —
http://ggplot2.org/
https://github.com/hadley/ggplot2
Data Visualization
Quality of Documentation
OK
License
?
Last activity
10 minutes ago!
GIMP
GIMP is the GNU Image Manipulation Program. It is a freely distributed piece of software for such tasks as photo retouching image composition and image authoring.
Visualizing, Imaging
— GIMP Team —
http://www.gimp.org/
http://www.gimp.org/downloads/
Media Annotation/Tagging
Quality of Documentation
Good
Examples
http://www.gimp.org/screenshots/
License
GPL
Last release
v2.8.14
Last activity
6/4/2015
Global References Index to Biodiversity (GRIB)
It will be a tool to manage the taxonomic literature that is (a) already available in digital form, (b) in the process of being digitised, and (c) for which plans have been created for digitisation and to nominate literature to be digitised.
Bibliographic Management
— Boris Jacob —
http://bhleurope.gbv.de/
http://bhleurope.gbv.de/
Metadata Retrieval Services
Examples
http://gso.gbv.de/DB=1.83/
License
N/A
Last release
Unreleased
Goobi
Goobi is an open source software application for digitisation projects and workflow management in libraries, museums and archives.
Digitization, Managing
— Goobi —
http://www.goobi.org/en/
https://github.com/goobi
Collection Management, Media Checker/Validation, Digital Preservation
Examples
http://www.goobi.org/en/references/
License
GNU/GPL (General Public License)
Last release
12/20/2013
Last activity
7 May 2014 (community) 21 Jun 2015 (github)
Graphite
Graphite is a PHP Library, built on top of ARC2, to make it easy to do stuff with RDF data really quickly, without having to naff around with databases.
Adding Metainformation, Programming
— Christopher Gutteridge —
graphite.ecs.soton.ac.uk
https://github.com/cgutteridge/Graphite
Collection Management
Quality of Documentation
Good on the website
License
LGPL
Last release
3/11/2012
Last activity
7/20/2015
Grid
Grid is the Guardian’s new image management system, which provides a universal and fast experience accessing media that is organised and using it in an affordable way to produce high-quality content. Grid runs as a set of independent micro-services (Scala and Play Framework) exposed as hypermedia APIs (argo) and accessed using a rich Web user interface (AngularJS). Grid relies on Elasticsearch for blazing-fast searching, and AWS services as additional storage and communication mechanisms.
Publishing, managing
— The Guardian —
https://github.com/guardian/grid
https://github.com/guardian/grid
Digital Asset Management, Media Applications, Publication
Quality of Documentation
Extensive
Examples
https://www.youtube.com/watch?v=ZoddCAH9EPE
License
Apache 2.0
Last activity
ongoing
griot
Griot is an open-source iPad application that facilitates engagement with a collection of objects (artifacts, artwork, graphs, or anything else that can be represented visually) through annotations (points of interest on the object itself) and stories (related text-based and multimedia content, presented as a series of pages).
Annotating, Accessing
— Minneapolis Institute of Art —
https://github.com/artsmia/griot
https://github.com/artsmia/griot
Digital Asset Management, Collection Management, Media Applications
Quality of Documentation
Limited
Examples
http://artsmia.github.io/griot/#/
License
MIT
Last activity
2/21/2015
gttext
OCR free software and Ground Truthing tool: the gttext project extracts text from colour images.
Imaging, OCR
http://www.softocr.com/
http://www.softocr.com/downloads
Metadata Retrieval Services
Quality of Documentation
None unless with the download
License
GPL v2
Last release
2/27/2013
Harvester
Extract URLs from text, source code or search engine results. Produces a clean list of URLs.
Gathering
— Digital Methods Initiative —
https://wiki.digitalmethods.net/Dmi/ToolHarvester
Metadata Retrival Services
Heidrun
Heiðrún (a.k.a. Heidrun, pronounced [roughly] hey-droon) is
DPLA’s new metadata aggregation system, which we use to harvest
metadata from Hubs, map it to the DPLA Metadata Application Profile,
enrich it to clean up and add value, and to index it for use in the DPLA
Platform API. Heiðrún is implemented as a Ruby on Rails application
that builds on Krikri, a
Ruby gem for metadata harvesting, mapping, and enrichment. Heiðrún and
Krikri are both released as open source software under the MIT License.
Gathering, Adding Metainformation
— DPLA —
https://digitalpubliclibraryofamerica.atlassian.net/wiki/display/TECH/Heidrun
https://github.com/dpla/heidrun
Ingestion Tool
Quality of Documentation
Good on the website
License
MIT
Last release
22/2/2015
Last activity
7/1/2015
Hydra
Hydra is a repository solution that is being used by institutions worldwide to provide access to their digital content. Hydra provides a versatile and feature rich environment for end-users and repository administrators alike.
Curating, Managing, Preservation
http://projecthydra.org/
https://github.com/projecthydra
Digital Preservation, Digital Asset Management
Quality of Documentation
V Good
Examples
http://projecthydra.org/apps-demos-2-2/
License
Apache 2 license
Last release
7/14/2015
Last activity
ongoing
HyperImage
The HyperImage platform supports the linking of (audio)-visual objects,
texts and mixed-media documents. HyperImage allows any number of
details, or subregions, within an image to be highlighted and described,
and for annotations within a corpus to be linked to each other, making
them accessible in indices. Interim results as well as final versions
can be compiled at any time as an online/offline hypermedia publication.
This makes HyperImage a suitable research environment for digital
humanities and eScience projects, providing a common research and
publication environment for groups as well as individuals.
Annotating, Analyzing,
— Heinz-Günter Kuper, Dr.,JML Jens-Martin Loebel, Dr. —
http://hyperimage.ws/en/team-en/
http://sourceforge.net/p/hyperimage/hi3repos/ci/master/tree/
Media Annotation/Tagging
Quality of Documentation
Extensive
Examples
http://telota.bbaw.de/cvma/HyperCVMA/StendalDom/
License
Apache 2 license
Last release
6/15/2015
Last activity
ongoing
Hypothesis
Hypothes.is brings community peer review to The Internet. It is a web application which enables rich annotation of web content. The project acts as a community aggregator for annotations and identity provider for annotators. It also serves embed code for an annotation agent designed with modern web browsers in mind.
Crowdsourcing, Annotating
— Hypothesis —
https://hypothes.is/
https://github.com/hypothesis/h
Media Annotation/Tagging, Contextualisation
Quality of Documentation
It is a very simple tool to install and use.
License
mixed open source licenses. 'We typically use the 2-Clause BSD License (aka the “Simplified BSD License” or the “FreeBSD License”).'
Last activity
actively ongoing
ICA-AtoM
ICA-AtoM is a web-based archival description software that is based on the International Council on Archives (‘ICA’) standards. ‘AtoM’ is an acronym for ‘Access to Memory’.
ICA-AtoM is multi-lingual and supports multi-repository collections.
Adding metainformation, Curating
— Artefactual Systems in collaboration with the ICA Program Commission (PCOM —
https://www.ica-atom.org/
https://github.com/artefactual/atom
Collection Management, Repository Software
Quality of Documentation
The source code is maintained on GitHub https://github.com/artefactual/atom where the majority of the additions already happend in Oktober 2012. A live demo of the software is available at http://demo.ica-atom.org/ and different types of manuals can be found here https://www.ica-atom.org/doc/Main_Page.
Ease of Adaptation
There there are approximately 250 institutions worldwide running ICA-AtoM. A list of them can be found here https://www.ica-atom.org/doc/ICA-AtoM_users.
Code Quality
The code is maintained in GitHub. PHP is used as the main language. There are 8 contributor but only 4 of them are active. The last AtoM maintenance release (2.0.1) was on 16.12.13.
Examples
http://demo.ica-atom.org/
License
GPL
Last release
v1.3.1
Last activity
actively ongoing
IIPImage
IIPImage is an advanced high-performance feature-rich image server system for web-based streamed viewing and zooming of ultra high-resolution images.
Visualizing, Publishing
— Ruven Pillay —
http://iipimage.sf.net
https://github.com/ruven/iipsrv
Exhibition Management, Media Applications
Quality of Documentation
Documentation for IIPImage is extensive and complete. It's neatly organized and includes beneficial troubleshooting procedures. all documentation can be found at the following link. http://iipimage.sourceforge.net/documentation/. A live demo can be found on the IIPImage website: http://iipimage.sourceforge.net/demo/
Ease of Adaptation
You can use the IIPImage Server as it is. If you want to plug in another tool you will have to code it yourself. There is no plug-in ability. Customization is done on the client side (see IIPMooViewer). Quite a lot of institutions are using the IIPImageServer, some of them can be found here: http://iipimage.sourceforge.net/links/.
Code Quality
There are installation packages available for Debian/Ubuntu, Fedora/Redhat/CentOS, Windows and Mac OS X. Nevertheless it is recommended that you compile the server, if possible, in order to fully optimize it for your system! The program language being used C++.
Examples
http://iipimage.sourceforge.net/demo/
License
GNU General Public License, v. 3
Last release
4/9/2011
IIPMooViewer
IIPMooViewer is a high performance light-weight HTML5 Ajax-based javascript image streaming and zooming client designed for the IIPImage high resolution imaging system
Visualizing
— Ruven Pillay —
http://iipimage.sourceforge.net
https://github.com/ruven/iipmooviewer
Media Applications
Ease of Adaptation
If you have an IIPImage Server up and running it is easy to implement a client application with the IIPMooViewer.
Code Quality
The JavaScript files are hosted on GitHub https://github.com/ruven/iipmooviewer.
License
GNU General Public License
Image Similarity Client
Image similarity search source code
Seeking
— Sergiu Gordia —
https://github.com/europeana/Europeana-Creative/tree/master/image-similarity-client
https://github.com/europeana/Europeana-Creative/tree/master/image-similarity-client
Image Similarity
Examples
http://image-similarity.ait.ac.at/imagedemo/
Image Twinning
Beta Image Twinning plug-in for WP developed as part of Europeana Creative
Visualizing
— AIT —
https://wordpress.org/plugins/image-twinning/
https://wordpress.org/plugins/image-twinning/developers/
Image Similarity
ImageJ
It can display, edit, analyze, process, save and print 8-bit, 16-bit and 32-bit images. It can read many image formats including TIFF, GIF, JPEG, BMP, DICOM, FITS and “raw”. It supports “stacks”, a series of images that share a single window. It is multithreaded, so time-consuming operations such as image file reading can be performed in parallel with other operations.
Visualizing
— Wayne Rasband —
http://rsbweb.nih.gov/ij/docs/intro.html
http://rsbweb.nih.gov/ij/download.html
Media Checker/Validation, Digital Asset Management, Media Applications
ImageMontage
ImageMontage creates montages of images without the need to load them
into a stack first. The source images can be located in one or multiple
directories. The images can also have different sizes. To fit images
into a montage, the macro scales them vertically to the same
user-specified size. Images are added horizontally until they no longer
fit on the row and then a new row is started. This process is repeated
until all source images are rendered. If you see that the generated
montage does not include all source images, change the options to reduce
the size of images and/or increase canvas dimensions, and run the macro
again.
Visualizing
— Lev Manovich, Matias Giachino, Jay Chow. —
http://lab.softwarestudies.com/2014/03/how-to-visualize-4512-instagram-selfies.html?utm_content=buffer9ee7e&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer
http://rsbweb.nih.gov/ij/plugins/image-montage/index.html moved to https://github.com/culturevis
Media Applications, Exhibition Management
ImagePlot
ImagePlot is a free software tool that visualizes collections of images and video of any size. It is implemented as a macro which works with the open source image processing program ImageJ.
Visualizing
— Software Studies Initiative —
http://lab.softwarestudies.com/p/imageplot.html#features1
https://docs.google.com/document/d/1zkeik0v2LJmi1TOK4OxT7dVKJO7oCmx_fNP8SYdTG-U/edit?hl=en_US moved to https://github.com/culturevis
Media Applications
Internet Archive BookReader
The Internet Archive BookReader is used to view books from the Internet Archive online and can also be used to view other books
Resource Sharing, Visualizing
https://openlibrary.org/dev/docs/bookreader
https://github.com/openlibrary/bookreader
Content Retrieval Services
Quality of Documentation
Documentation can be found here: https://openlibrary.org/dev/docs. Everything necessary seems to be there.
Ease of Adaptation
The easiest way to find out how the Internet Archive BookReader works is to do a “git clone https://github.com/openlibrary/bookreader.git” and go to BookReaderDemo/index.html and open it in you Browser. Next have a look at BookReaderDemo/BookReaderJSSimple.js.
Code Quality
All the functionality is more or less put into BookReader/BookReader.js. There are some comments in the code but not too much! There are 8 contributors but only one of them is contributing.
Examples
http://www.archive.org/stream/birdbookillustra00reedrich#page/n3/mode/2up
interstitial
interstitial is a tool designed to detect dropped samples in audio digitization processes.
Preserving
— AudioVisual Preservation Solutions —
http://www.avpreserve.com/avpsresources/tools/
https://github.com/avpreserve/interstitial
Digital Preservation, Media Checker/Validation
License
Apache license, v.2.
Islandora
Islandora is an open source framework that combines the Drupal and Fedora open software applications to create a robust digital asset management system that can be fitted to meet the short and long term collaborative requirements of digital data stewardship. Additional open source applications are added to this core stack to create what we call Solution Packs.
Managing, Collaborating, Storing
— The Islandora Foundation —
http://islandora.ca/
https://github.com/Islandora
Digital Asset Management
Quality of Documentation
A Wiki is set up to provide documentation https://wiki.duraspace.org/display/ISLANDORA/Islandora.
Ease of Adaptation
There seems to be a very active user group: https://groups.google.com/forum/#!forum/islandora-dev and there are 63 installations listed: http://islandora.ca/islandora-installations.
Code Quality
Github: https://github.com/islandora hosts all the repositories for Islandora. All of the 58 repositories are public. The code is written mainly in PHP and JavaScript.
Examples
http://sandbox.islandora.ca/
License
GNU-GPL
Last release
16/04/2012
Last activity
actively ongoing
JHOVE
JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects.
Processing, Preserving
— JSTOR & the Harvard University Library —
http://openpreservation.org/technology/products/jhove/
https://github.com/openpreserve/jhove
Digital Preservation, Media Annotation/Tagging
Quality of Documentation
Documentation is at http://jhove.openpreservation.org/
License
LGPLv2
Last release
1.14 (May 12, 2016)
Last activity
ongoing
JHOVE2
Processing, Preserving
jMet2Ont
Modifying
Joomla Europeana Search Component
Seeking
Julius
Data recognition, Transcribing
Kaltura
A standard Drupal Module that integrates the Kaltura video platform capabilities into Drupal, allowing you to instantly enrich your Drupal site with video, audio, and images. Kaltura’s video platform and video module for Drupal is an enterprise grade solution that offers comprehensive powerful functionality and full control over content ownership, without the need to run your own video operation. The module handles every aspect of rich-media, including uploading and importing content, transcoding, content management, syndication, monetization, distribution, and more.
Web-developing, Publishing
— Kaltura —
http://corp.kaltura.com/
https://github.com/kaltura/platform-install-packages#documentation-and-guides
Media Annotation/Tagging, Media Applications
Kamailio
Kamailio is an industrial-strength, free server for realtime communication, based on the Session Initiation Protocol
Communicating
— Kamailio —
http://www.kamailio.org/w/
https://github.com/kamailio/kamailio
SIP
License
GNU GPL v2.0
Last release
10/2/2015
Kartograph
Kartograph is a simple and lightweight framework for building interactive map applications without Google Maps or any other mapping service.
Visualizing, Mapping, Capturing
— Gregor Aisch —
http://kartograph.org/
https://github.com/kartograph/kartograph.py and https://github.com/kartograph/kartograph.js
Geo-spatial Apllications, Data Visulaisation
Quality of Documentation
There are two different version of this application with different specialty features that are clearly stated on the main page. Each version has its own clear, descriptive documentation both on the homepage and Github. No technical support available but there are active demos.
License
LPGL (Kartograph.js) & AGPL (Kartograph.py)
KAT
Annotating
Koha
Koha is the first free and open source software library automation package (ILS). Development is sponsored by libraries of varying types and sizes, volunteers, and support companies from around the world.
Adding Metainformation
— Various —
http://koha-community.org/
http://git.koha-community.org/gitweb/
Metadata Retrieval Services, Search and Browsing
Quality of Documentation
Very thorough and detailed documentation that is updated periodically and is extremely easy to navigate on the Koha website. Also the documentation is available in multiple languages but mostly just for older versions. Technical support is also available.
License
GPL-2, or later
Last activity
4/13/2012
Korbo
Korbo is a Semantic Web basket manager. It allows users to search, import and augment Linked Data resources. Personal augmented collections created with Korbo, are then republished in the Linked Data cloud. Korbo is part of the Muruca suite.
Enriching
— Net7 —
http://korbo.muruca.org/
Semantic Enrichment
Last release
Not yet production ready but prototype released 1/7/2012
KriKri
A Rails engine for metadata aggregation, enhancement, and quality control.
Adding metainformation
— DPLA —
Metadata Retrieval Services
License
MIT
Last release
24/2/2015
Kuali OLE
Kuali OLE is the first system designed by and for academic and research libraries for managing and delivering intellectual information. A community of partners will deliver an enterprise-ready,
community-source software package to manage and provide access not only
to items in their collections but also to licensed and local digital
content. Kuali OLE (pronounced oh-LAY, for Open Library Environment)
features a governance model in which the entire library community can
collaborate to own the resulting intellectual property.
Managing, accessing
— Kuali OLE https://www.kuali.org/ole/organization —
www.kuali.org/ole
https://wiki.kuali.org/display/OLE/OLE+Published+Documentation
Collection Management
License
ECL 2.0
Last release
5/1/2015
Last activity
on-going
Leaflet-IIIF
Leaflet plugin for viewing IIIF images
Visualizing, Publishing
— Jack Reed/Standford University —
https://github.com/mejackreed/Leaflet-IIIF
https://github.com/mejackreed/Leaflet-IIIF
Publication, Geo-spatial Apllications
Leaflet
Leaflet is an open source JavaScript library for mobile-friendly interactive maps
Visualizing, Mapping
— Vladimir Agafonkin —
http://leafletjs.com/
https://github.com/Leaflet/Leaflet
Data Visualization, Geo-spatial Apllications
License
BSD license
Last release
11/18/2013
LinkedTV editorial-tool
In the LinkedTV project (http://linkedtv.eu), the Editor Tool (ET) enables moderating (named) entities and hyperlinks, which are automatically extracted from audiovisual content. This data is created to provide a useful starting point for editorial teams that are responsible for publishing interactive television programmes, which aim to offer contextual information to viewers.
Contextualizing, Extracting metadata
— Jaap Blom, NISV —
https://github.com/beeldengeluid/linkedtv-editortool
https://github.com/beeldengeluid/linkedtv-editortool
Metadata Retrieval Services, Digital Asset Management, Media Annotation/Tagging
Examples
http://linkedtv.eu/
LoCloud_bglink
Background link service for LoCloud
This repository contains the background link service module developed
within the LoCloud project. The module consists of a PHP script that
implements a REST service, calls DBpedia
Spotlight
for the actual processing, and wraps the answer into a suitable
format.
The background link service uses DBpedia Spotlight as a backbone for
performing the linking. In principle, the service can be used in any
language, the only requirement being that a DBpedia spotlight instance for
this particular language is running.
enriching
— LoCloud —
http://www.locloud.eu/
https://github.com/ixa-ehu/locloud_bglink
Contextualisation, Linked Open Data
License
Apache v 2.0
Last activity
11/26/2014
LodLive
LodLive project provides a demonstration of the use of Linked Data standards (RDF, SPARQL) to browse RDF resources
Browisng
— Diego Valerio Camarda e Silvia Mazzini (http://www.regesta.com/) , Alessandro Antonuccio (http://www.hstudio.it/) —
http://en.lodlive.it/
https://github.com/dvcama/LodLive
Metadata Retrieval Services
License
MIT License
Last release
7/4/2012
Last activity
7/4/2012
LODmilla-frontend
LODmilla - a graph-based Linked Open Data browser. Intended as a prototype tool for generic LOD browsing. This frontend of LODmilla is written in “plain” jQuery, no js frameworks used (yet). Graph visualization is done by jsPlumb.
Browisng
— dsd-sztaki-hu —
https://github.com/dsd-sztaki-hu/LODmilla-frontend
https://github.com/dsd-sztaki-hu/LODmilla-frontend
Search and Browsing, Linked Open Data
Examples
http://munkapad.sztaki.hu/lodmilla/
License
Apache v 2.0
Last activity
2/9/2014
Loris IIIF Image Server
Archiving,
— Jon Stroop —
https://github.com/pulibrary/loris
https://github.com/pulibrary/loris
Media Applications
Examples
http://libimages.princeton.edu/osd-demo/
License
GNU General Public License, v. 3 or later
MAchine Readable Cataloging (MARC)
MARC is a ruby library for reading and writing MAchine Readable Cataloging (MARC)
Adding metainformation
— Kevin Clarke, Bill Dueber, William Groppe, Ross Singer, Ed Summers —
http://marc.rubyforge.org/
https://github.com/ruby-marc/ruby-marc/
Collection Management, Metadata Retrival Services
Quality of Documentation
A Wiki is set up to provide documentation https://wiki.duraspace.org/display/ISLANDORA/Islandora.
Ease of Adaptation
There seems to be a very active user group: https://groups.google.com/forum/#!forum/islandora-dev and there are 63 installations listed: http://islandora.ca/islandora-installations.
Code Quality
Github: https://github.com/islandora hosts all the repositories for Islandora. All of the 58 repositories are public. The code is written mainly in PHP and JavaScript.
License
MIT License
Last release
2012
Last activity
2012
Map vectorizer
An open-source map vectorizer
Visualizing
— Mauricio Giraldo Arteaga / New York Public Library Labs —
http://www.nypl.org/collections/labs
https://github.com/NYPL/map-vectorizer
Metadata Retrieval Services, Geo-spatial Apllications
MARCXML Toolkit
The MARCXML toolkit is a set of Java programs which allow users to convert to and from the MARC file format (including full character set conversion) and other formats available in the MARCXML architecture.
Conversioning
http://www.loc.gov/standards/marcxml/
http://www.loc.gov/standards/marcxml/
Metadata Mapping/Conversion/normalisation
Quality of Documentation
There is no documentation of the source code (marcxml.jar) locatable.
Ease of Adaptation
It is not clear where to find the source code!
Code Quality
A Windows Batch file (marcxml.bat) has to be used to convert to and from the MARC file format. The recommended Java Version is 1.4 which was released in 2002!
License
GNU LGPL, Version 2.1
MDQC
A cross-platform tool designed to expedite metadata quality control across large numbers of digital assets.
Adding metainformation, Validating
— AudioVisual Preservation Solutions —
http://www.avpreserve.com/avpsresources/tools/
https://github.com/avpreserve/mdqc
Metadata Retrieval Services, Collection Management, Metadata Mapping/Conversation/normalisation
License
Apache License, V. 2
MediaInfo
A convenient unified display of the most relevant technical and tag data for video and audio files.
Presenting
— MediaArea —
http://mediainfo.sourceforge.net/en
Ontology/Vocabulary Building and Management, Media Applications
Quality of Documentation
The software is very easy to use. Documentations is not locatable but also not necessary.
Ease of Adaptation
Bug reports (http://sourceforge.net/p/mediainfo/bugs/) and feature requests (http://sourceforge.net/p/mediainfo/feature-requests/) can be filed and there is also a forum for all other questions (http://sourceforge.net/p/mediainfo/discussion/). They are all very active.
Code Quality
The code is hosted on sourceforge http://sourceforge.net/p/mediainfo/code/commit_browser and there seems to be development ongoing. Binaries for nearly all operating systems are available: http://mediaarea.net/en/MediaInfo/Download.
License
LGPL license
Last release
0.7.86 (May 31, 2016)
Last activity
actively ongoing
MediaMosa
MediaMosa is open source software to build a Full Featured, Webservice Oriented Media Management and Distribution platform.
Managing, Presenting
— MediaMosa —
www.mediamosa.org
https://github.com/mediamosa
Collection Management, Content Retrival Services, Media Applications
Quality of Documentation
It is quite easy to find documentation on http://www.mediamosa.org.
Ease of Adaptation
MediaMosa is based on the Representational State Transfer (REST) architecture. This makes it easy to attach your own tools. There is a forum http://www.mediamosa.org/forum/index.php for bugs and features requests. The community seems to be very helpful.
Code Quality
Most of the code is written in PHP. There is sufficient documentation in the code. There are 9 contributors registered on GitHub. Two of them are comitting the majority of the code. You can find the code here: https://github.com/mediamosa/mediamosa.
License
GNU General Public License
Last release
7/4/2012
Mediathread
Mediathread is a Django site for multimedia annotations facilitating
collaboration on video and image analysis. Developed at the Columbia
Center for New Media Teaching and Learning (CCNMTL)
Annotating, Collaborating, Analyzing
— Columbia Center for New Media Teaching and Learning —
http://mediathread.info/content/about-mediathread
https://github.com/ccnmtl/mediathread
Media Annotation/Tagging, Interactive User Interface
License
GNU GPL
Last release
31/1/2015
MediaWiki
MediaWiki is free server-based software which is licensed under the GNU General Public License (GPL). It’s designed to be run on a large server farm for a website that gets millions of hits per day.
MediaWiki is an extremely powerful, scalable software and a feature-rich wiki implementation that uses PHP to process and display data stored in a database, such as MySQL.
Pages use MediaWiki’s wikitext format, so that users without knowledge of XHTML or CSS can edit them easily.
When a user submits an edit to a page, MediaWiki writes it to the
database, but without deleting the previous versions of the page, thus
allowing easy reverts in case of vandalism or spamming. MediaWiki can
manage image and multimedia files, too, which are stored in the
filesystem. For large wikis with lots of users, MediaWiki supports
caching and can be easily coupled with Squid proxy server software.
Processing, Presenting
— Wikimedia —
https://www.mediawiki.org/wiki/MediaWiki
https://www.mediawiki.org/wiki/Download
Collection Management, Linked Open Data
Examples
http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham
MFCS Metadata Form Creation System
The Metadata Form Creation System (MFCS) is WVU Libraries answer for providing an easy to use interface for librarians, staff, and students for entering metadata and uploading digital items for our digital collections. MFCS is also our archival and preservation system. MFCS is a delivery and repository agnostic system.
Processing, Archiving, Preserving
— Michael Bond WVU Libraries —
https://github.com/wvulibraries/mfcs
https://github.com/wvulibraries/mfcs
Collection Management, Digital Asset Management, Digital Preservation
Examples
http://www.youtube.com/watch?v=8RVqZNPsf8A
License
WVU Open Source License
Last release
20/5/2014
MINT ingestion tool
MINT services compose a web based platform that was designed and developed to facilitate aggregation initiatives for cultural heritage content and metadata in Europe.
Collecting, Adding metainformation, Aggregating
— Katerina Komninou, Nasos Drosopoulos, Vasilis Tzouvaras, Anna Christaki, Arne Stabenau, Costas Pardalis, Fotis Xenikoudakis, Nikos Simou —
http://mint.image.ece.ntua.gr/
https://github.com/mint-ntua/Mint-Athena
Metadata Mapping/Conversion/normalisation, Collection Management, Metadata Retrival Services
Quality of Documentation
The first release of MINT was authored 2 years ago: https://github.com/mint-ntua/Mint-Athena. A more recent version is developt by the DM2E (Digitised Manuscripts to Europeana) project: https://github.com/DM2E/dm2e-mint. A user manual is available http://mint.image.ece.ntua.gr/mint2/documentation/ although some screenshots would help for a better understanding.
Ease of Adaptation
Mint was designed to operate on different aggregation schemes. Hence it is easy to plug-in your own schema.
Code Quality
As a programming language mainly Java is used. There are enough comments in the code and a JavaDoc can be created. The main contribution are from EU funded projects. It is hard to say if there is also a user community outside of these projects.
Examples
http://oreo.image.ece.ntua.gr:8080/mint/Login_input.action
License
GNU Affero General Public License
Last release
11/30/2011
Mirador
An open-source, web-based ‘multi-up’ viewer that supports zoom-pan-rotate functionality, ability to display/compare simple images, and images with annotations
Presenting
— Stanford University —
http://projectmirador.org/
https://github.com/IIIF/mirador
Media Applications, Media Annotation/Tagging
Quality of Documentation
The quality of the documentation for developers is excellent. Everything you need can be found here: https://github.com/IIIF/mirador/blob/master/README.md. There is no tutorial for users available but this is also not necessary.
Ease of Adaptation
Mirador can connect to repositories that provide a IIIF-compliant Metadata API.
Code Quality
The project started in August 2013. At the moment there are 4 contributors. JavaScript is the language of choice. A “first-time-setup” https://github.com/openseadragon/openseadragon#first-time-setup is quite easy to do. Just install GRUNT (http://gruntjs.com/getting-started) beforehand. On the command line, in the mirador folder type “grunt server” and open in your browser http://localhost:8000.
Examples
http://projectmirador.org/demo/#0e503169-f07e-4b0b-b1c9-8f08066b1547
License
Apache License, Version 2.0
Last release
15/14/2015
mozjpeg
Modern JPEG encoder designed to reduce the size and load time of webpages that carry a lot of pictures. It has been announced in March 2014 and is today supported by big Websites like Facebook and used in tools like ImageOptim. Mozjpeg supports optimized Huffmann table, custom quanitization matrices and modern techniques like trellis quantization while maintaining the same values for the structured similarity index (SSIM).
Encoding
— Josh Aas, Mozilla research —
https://github.com/mozilla/mozjpeg
https://github.com/mozilla/mozjpeg
License
custom (with Copyleft)
Last release
5/18/2015 (v3.1)
Last activity
5/18/2015
Muse Open Source
Muse Open Source is a free software platform for publishing native iPad apps for digital libraries with cultural heritage content.
Publishing
— Glimworm IT —
http://muse-opensource.org/
https://github.com/jonathanrcarter/muse-opensource/wiki
Media Applications, Exhibition Management, Social Applications
Quality of Documentation
The link to the documentation (wiki) does not work!
Ease of Adaptation
The software is based on the Appcelerator Titanium Platform to Create iOS, Android, and mobile web apps from a single code base. There are high dependencies to the Appcelerator Platform.
Code Quality
The Muse Open Source platform seems to be developed by two people. No activities in the last half year.
Last release
5/30/2013
Last activity
5/30/2013
Music 21
Music21 is a set of tools for helping
scholars and other active listeners answer questions about music quickly
and simply. If you’ve ever asked yourself a question like, “I wonder
how often Bach does that” or “I wish I knew which band was the
first to use these chords in this order,” or “I’ll bet we’d know more
about Renaissance counterpoint (or Indian ragas or post-tonal pitch structures or the form of minuets) if I could write a program to automatically write more of them,” then music21 can help you with your work.
Seeking
— MIT —
http://web.mit.edu/music21/
https://code.google.com/p/music21/downloads/list
Data Visualization
License
LGPL
Last activity
2014
Neatline
The Scholars’ Lab designed Neatline as a suite of plugins for the open-source Omeka framework, which provides a powerful platform for content management and web publication. Through Neatline, you can create create rich representations of places, objects, events, narratives, and documents.
Managing, Publishing, Presenting
— Adam Soroka —
http://neatline.org/
https://github.com/scholarslab/Neatline
Exhibition Management, Social Applications, Media Annotation/Tagging
Quality of Documentation
Neatline is extremely organized and thorough with their documentation. They provide visible demos of the applications and its capabilities. There is no technical support but the website is updated periodically.
Examples
http://neatline.org/neatline-in-action/
License
Apache License v 2.0
Last release
2012
Last activity
actively ongoing
Neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Machine Learning
— Standford University —
http://cs.stanford.edu/people/karpathy/deepimagesent/
https://github.com/karpathy/neuraltalk
Media Applications, Media Checker/Validation
Examples
http://cs.stanford.edu/people/karpathy/deepimagesent/
License
BSD License
Last release
November 2014
Nodel
Nodel is an open source digital media control system built on Java and Python.
It uses a series of nodes that perform “actions” or respond to
“signals” to control various devices on a network, allowing quick and
easy management of digital media devices.
Nodel is typically used to control digital media in galleries, museums, corporate meeting rooms and digital signage.
In short, Nodel can manage any programmable device across a wide range of platforms.
Managing
— Museum Victoria —
http://nodel.io/
https://github.com/museumvictoria/nodel
Collection Management, Exhibition Management
Examples
https://raw.githubusercontent.com/museumvictoria/nodel/gh-pages/docs/White_Paper-Nodel.pdf
License
Mozilla Public License 2.0/MIT License
Last release
5/20/2014
NotreDAM
NotreDAM is a multi-user, web-based Digital Asset Management platform
Managing
www.notredam.org
http://code.google.com/p/notredam/
Collection Management, Digital Asset Management, Metadata Retrival Services
Quality of Documentation
The documentation for NotreDAM is very well organized and thorough. There are demos available as well as an active forum.
Examples
http://notredam.org/demo/
Numishare
Numishare is an open source suite of applications for managing digital cultural heritage artifacts, with a particular focus on coins and medals.
Managing
— Ethan Gruber, American Numismatic Society —
http://numishare.blogspot.com/
https://github.com/ewg118/numishare
Collection Management, Metadata Mapping/Conversation/normalisation, Exhibition Management
Quality of Documentation
The documentation for Numishare is very minimal and not easy to find. There also appear to be no regular updates or an active forum for discussion.
Examples
http://numismatics.org/search/ http://numismatics.org/ocre/
License
Apache License 2.0
Last release
December 2010
Last activity
26/10/2012 - constant development
Nuxeo Platform
Nuxeo provides a modular, extensible Java-based open source software platform for enterprise content management, and packaged applications for document management, digital asset management and case management.
Managing
— Nuxeo —
http://www.nuxeo.com
http://nuxeo.github.io/
Collection Management, Content Retrival Services, Digital Asset Management
Quality of Documentation
The documentation for Nuxeo is very thorough and visible. There is also a very active forum that people actually use and comment on and additionally there are regular blog posts.
License
LGPL
Last release
10/30/2013
Last activity
actively ongoing
Ocrad.js
Optical character recognition program hat can convert scanned images of text back into text.
Data Recognition
— Kevin Kwok —
http://antimatter15.github.io/ocrad.js/demo.html
https://github.com/antimatter15/ocrad.js
Content Retrieval Services, Digital Asset Management, Image Similarity
Examples
https://github.com/antimatter15/ocrad.js/tree/master/examples
License
GNU GPL
Last release
January 2014
ol3-cesium
OpenLayers - Cesium integration library. Create your map using OpenLayers 3, and visualize it on a globe with Cesium.
Visualizing
— OpenLayers —
http://blog.klokantech.com/2014/11/ol3-cesium-third-dimension-for.html
https://github.com/openlayers/ol3-cesium
Geo-spatial Apllications
Examples
http://openlayers.org/ol3-cesium/examples/
License
Need to review their license
Last release
January 2015
ol3
A high-performance, feature-packed library for all your mapping needs
Visualizing
— OpenLayers —
http://openlayers.org/
https://github.com/openlayers/ol3
Geo-spatial Apllications
Examples
http://openlayers.org/en/v3.2.1/examples/
License
Apace 2.0, BSD, MIT
Last release
13/2/2015
Omeka Contribution Plugin
Makes an Omeka site into one that accepts public contributions. The plugin provides a form to collect stories, images, or other files from the public and manages those contributions in your Omeka archive as items.
Publishing
— Center for History and New Media, George Mason University —
http://omeka.org/codex/Plugins/Contribution
https://github.com/omeka/plugin-Contribution
Exhibition Management, Interactive User Interface, Social Applications
License
GNU GPL
Last release
v 3.0.1 August 2014
Omeka S
A multisite reworking of Omeka on newer tech standards and aiming at interoperability with more systems
Publishing
— Roy Rosenzweig Center for History and New Media, George Mason University —
Exhibition Management
Quality of Documentation
In progress
Last release
v 0.4.0-alpha https://github.com/omeka/omeka-s/releases/tag/v0.4.0-alpha
Omeka
A Collection/Exhibition Management System
Publishing
— Roy Rosenzweig Center for History and New Media, George Mason University —
http://omeka.org/
https://github.com/omeka/Omeka
Exhibition Management, Collection Management
License
GNU GPL
Last release
2.4.1
Last activity
2016-05-25
Open Exhibits
Mulititouch and multiuser software
Collaborating
— Ideum —
http://openexhibits.org/
https://github.com/ideum
Interactive User Interface, Exhibition Management
License
Simplified BSD License
Last release
v.3.0, 13 November, 2013
Open Licenses Service
Data on more than 100 open licenses. Including all OSI-approved open source licenses and all Open Definition conformant open data and content licenses. Provided in easy to use, machine readable JSON – perfect if you need to drop a license chooser into your app.
Seeking
— Open Knowledge Foundation —
http://licenses.opendefinition.org/
http://licenses.opendefinition.org/
Linked Open Data, Content Retrival Services
License
MIT License
Last activity
1/2012
Open Monograph Press
Open Monograph Press is an open source software platform for managing the editorial workflow required to see monographs, edited volumes and, scholarly editions through internal and external review, editing, cataloguing, production, and publication. OMP can operate, as well, as a press website with catalog, distribution, and sales capacities.
Managing, Editing, Cataloguing, Producing, Publishing
— Public Knowledge Project —
https://pkp.sfu.ca/omp/
https://pkp.sfu.ca/omp/omp_documentation/
Collection Management
Examples
https://pkp.sfu.ca/omp/omp_demo/
Open SKOS Client Ruby
A Ruby client for searching and retrieving SKOS concepts from an OpenSKOS instance over its RESTful API
Retrieving
— Europeana —
https://github.com/europeana/openskos-client-ruby
https://github.com/europeana/openskos-client-ruby
Linked Open Data
License
European Union Public License 1.1
Last release
25/6/2014
OpenSeadragon
An open-source, web-based viewer for zoomable images, implemented in pure JavaScript.
Editing, Browsing
http://openseadragon.github.io/
https://github.com/openseadragon/openseadragon
Media Applications
Quality of Documentation
The quality of the documentation for developers is excellent. Everything you need can be found here: https://github.com/IIIF/openseadragon/blob/master/README.md. There is no tutorial for users available but this is also not necessary.
Ease of Adaptation
Plugins can be used to displays your image's scale in real-world measurements, enhance OpenSeadragon, provide coordinate conversion, pan, and zoom methods in a simplified coordinate system and provide hooks into an OpenSeadragon.Viewer and/or OpenSeadragon.MouseTracker for overriding/extending the default user-input event handling behavior. An issue tracker on GitHub is found here: https://github.com/openseadragon/openseadragon/issues.
Code Quality
The project started in January 2013. At the moment there are 23 contributors. JavaScript is the language of choice. A “first-time-setup” https://github.com/openseadragon/openseadragon#first-time-setup is quite easy to do. Just install GRUNT (http://gruntjs.com/getting-started) beforehand. On the command line, in the openseadragon folder type “grunt connect watch” and open in your browser http://localhost:8000/test/demo/basic.html.
License
BSD license http://openseadragon.github.io/license/
Last release
12/2013
Pallete-server
palette-server is a small Flask based HTTP-pony to extract colours from an image.
Data Recognition, Extracting Data
— Cooper Hewitt —
http://labs.cooperhewitt.org/2013/giv-do/
https://github.com/cooperhewitt/palette-server
Content Retrieval Services, Search and Browsing, Image Search
License
N/A
Last release
2/2013
pdfhtmlEX
pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies, aims to provide an accuracy rendering, while keeping optimized for Web display.
Conversioning
— Lu Wang —
http://coolwanglu.github.io/pdf2htmlEX/
https://github.com/coolwanglu/pdf2htmlEX
Semantic Extraction, Publication
License
GPLv3+
Last release
22/07/2015
Last activity
22/07/2015
PDFMiner
PDFMiner is a tool for extracting information from PDF documents.
Machine Learning, Extracting Data
— Yusuke Shinyama —
https://euske.github.io/pdfminer/
https://github.com/euske/pdfminer/
Content Retrieval Services, Semantic Extraction
Examples
http://pdf2html.tabesugi.net:8080/
License
MIT/X License
Last release
31/12/2007 - initial release
Last activity
5/4/2015
Pencilcase
Pencil Case is an appweb which offers tools for designers & developers. It showcases over 750 resources in design, development, learning, productivity, collaboration, publishing, testing, and more - all tracked by popularity in realtime.
Seeking
— Pencil Case —
http://pencilcaseapp.com/
Collection Management, Digital Asset Management
Last release
23/02/2013 (terms & guidelines)
Last activity
Copyright 2015
Plumi
Plumi is a free open-source video-sharing app based on Plone
Resource Sharing
— EngageMedia in collaboration with Unweb.me —
http://blog.plumi.org/
https://github.com/plumi/plumi.app
Exhibition Management, Media Applications
Examples
http://blog.plumi.org/sites-using-plumi/ http://demo.plumi.org/
License
As Plumi pulls together a range of different products, different licenses apply to different elements of the software. However most are covered either by the GNU GPL or the Zope Public License
Last release
4.5.2 (june 2015)
Last activity
2015
Popcorn.js
Popcorn.js is a Mozilla’s HTML5 video and media library for the open web. It allows web developers, filmmakers, artists, designers and others to easily create timeline based web productions. Popcorn.js helps simplify media API and implementation differences between browsers and includes a powerful event system and a rich plugin architecture and plugins.
Web-developing
— Mozilla —
http://popcornjs.org/
https://github.com/mozilla/popcorn-js
Media Applications
Quality of Documentation
Excellent
Examples
http://popcornjs.org/demos
License
MIT License
Last release
1.5.6
Posest
Posest is a library for 3D camera pose estimation implemented in C.
Posest is a C/C++ Library for Robust 6DoF Pose Estimation from 3D-2D Correspondences
Analyzing
— Manolis Lourakis (FORTH) The development of posest has been partially supported by the EC FP7 programme under grant no. 270138 DARWIN —
http://users.ics.forth.gr/~lourakis/posest/
Media Applications
Quality of Documentation
Rather weak
License
GNU GPL
Last release
1.1
Public Domain Calculator
Public Domain Calculator - determine what is public domain and what’s not.
Filtering
— Open Knowledge Foundation —
http://publicdomainworks.net/
https://github.com/okfn/pdcalc
Search and Browsing, Content Retrival Services, Metadata Retrival Services
License
Open Source
Last release
2014
Pundit
Pundit is a semantic web annotation tool. It allows user to create structured data in their annotations by creating semantic relations between different kind of items, being them portions of texts in a web page, images, Linked Data entities or entries from a custom vocabulary. Annotations can be private or public and can be consumed by external applications via REST API.
Annotating
— Net7 —
http://thepund.it
https://github.com/net7/pundit2
Semantic Enrichment, Data Annotation/Curation, Linked Open Data
Examples
http://thepund.it/try-pundit/
License
Pundit Code is divided into Pundit Server Code and Pundit Client code (http://thepund.it/license/)
Last release
2.0.51 beta
Last activity
2015
PyBossa
PyBossa is an open source platform for crowd-sourcing online (volunteer) assistance to perform tasks that require human cognition, knowledge or intelligence (e.g. image classification, transcription, information, location etc).
Crowdsourcing
— Shuttleworth OKFN Scifabric —
http://www.pybossa.com
https://github.com/PyBossa/pybossa
Digital Asset Management, Media Annotation/Tagging, Social Applications
Quality of Documentation
Excellent
License
GNU Affero General Public License
Last release
v0.2.2 (11/05/2015)
Last activity
2015
pymarc
Pymarc is a python library for working with bibliographic data encoded in MARC21. It should work under python 2.x and 3.x. It provides an API for reading, writing and modifying MARC records. It was mostly designed to be an emergency eject seat, for getting your data assets out of MARC and into some kind of saner representation.
Organizing, Bibliographic Management
— Gabriel Farrell, Mark Matienzo, Geoffrey Spear, Ed Summers —
http://pypi.python.org/pypi/pymarc
https://github.com/edsu/pymarc
Data Annotation/Curation
License
BSD
Last release
v3.0.4 (27/07/2015)
Last activity
2015
Question2Answer
A Q2A site helps your online community to share knowledge. People with questions get the answers they need. The community is enriched by commenting, voting, notifications, points and rankings.
Consulting
— Gideon Greenspan —
http://www.question2answer.org/
https://github.com/q2a/question2answer
Collection Management
Examples
http://demo.question2answer.org/
License
GNU GPL v2
Last release
1.7.1 (27/07/2015)
Last activity
2015
Quick and Dirty RDF browser
this tool returns all the triples in the file and verbosely lists each of their predicate and object values. Quick and easy. This is a good for reading everything about a particular resource. The tool does not seem to support content negotiation.
Browsing
— Christopher Gutteridge & University of Southampton —
http://graphite.ecs.soton.ac.uk/browser/
http://graphite.ecs.soton.ac.uk/download.php/browser.php
Content Retrieval Services, Semantic Extraction
Examples
http://education.data.gov.uk/id/school/118217 http://eprints.ecs.soton.ac.uk/id/eprint/10053 http://id.southampton.ac.uk/building/59 http://data.totl.net/playingcards/ http://dbpedia.org/resource/Southampton
Razuna DAM
Open source digital asset management
Managing
— Razuna —
http://razuna.org/
https://github.com/razuna/razuna
Digital Asset Management
License
GNU Affero Public License v.3 or later
Last release
10/2013
RDF.rb
RDF.rb is a pure-Ruby library for working with Resource Description Framework (RDF) data.
Managing
— Arto Bendiken, Ben Lavender, Gregg Kellogg —
https://rubygems.org/gems/rdf
https://github.com/ruby-rdf/rdf
Collection Management
License
Public Domain
Last release
11/8/2013
RDF Refine
a Google Refine extension for exporting RDF
Validating, Extracting Data
— Fadi Maali and Richard Cyganiak —
refine.deri.ie
https://github.com/fadmaa/grefine-rdf-extension
Data Visualization, Collection Management
Examples
http://refine.deri.ie/showcases
Recline Chrome CSV Viewer
A Chrome extension which allows you to view, search, graph and map CSV files in the browser
Seeking, Visualizing
— Rufus Pollock —
https://chrome.google.com/webstore/detail/recline-csv-viewer/ibfcfelnbfhlbpelldnngdcklnndhael
https://github.com/rgrp/chrome-csv-viewer
Data Visualization
License
MIT license
Last release
5/1/2013
Recline.js
A simple but powerful library for building data applications in pure Javascript and HTML.
Programming
— Max Ogden, Rufus Pollock —
http://reclinejs.com/
https://github.com/okfn/recline
Infrastructure
Examples
http://reclinejs.com/demos/
License
MIT license
Last release
sept. 2012
Last activity
actively ongoing
RecordManager
A metadata record management system written in PHP, intended to be used in conjunction with VuFind or another Solr-based discovery interface.
Managing
https://github.com/KDK-Alli/RecordManager
https://github.com/KDK-Alli/RecordManager/wiki
Collection Management
License
GNU General Public License, Version 2
ReSearcher
ReSearcher is an award-winning integrated suite of open source products for locating and managing electronic information resources, designed for use by students and researchers in academic libraries.
Managing, Locating
— Simon Fraser University Library —
http://researcher.sfu.ca/
Search and Browsing, Digital Asset Management
License
GNU GPL
Last activity
actively ongoing
ResourceSpace
ResourceSpace is a web-based, open source digital asset management system (DAM)
Managing
— Montala (under contract with Oxfam) —
http://www.resourcespace.org
http://svn.montala.net/svn/resourcespace
Collection Management, Digital Asset Management
License
BSD style license
Last release
10/4/2012
Last activity
11/23/2012
Retailer
Retailer is a platform for web applications written in XSLT.
Essentially it is a kind of XML transforming web proxy, able to present a RESTful API as another API.
Web-developing, Presenting
— Conal Tuohy —
https://github.com/Conal-Tuohy/Retailer
https://github.com/Conal-Tuohy/Retailer
Metadata Retrieval Services
Examples
https://github.com/Conal-Tuohy/Retailer/blob/master/etc/trove.xsl
Roundware
Roundware is a flexible, distributed framework which collects, stores, organizes and re-presents audio content.
Seeking, Archiving, Presenting
— http://www.roundware.org/category/team/ —
http://www.roundware.org/
https://github.com/hburgund/roundware-server
Digital Asset Management
Examples
http://www.roundware.org/category/examples/
ruby-marc
A Ruby library to parse, create and manage MARC records
Parsing, Producing, Managing
— University of Michigan —
https://github.com/ruby-marc/ruby-marc
https://github.com/ruby-marc/ruby-marc
Digital Asset Management
Quality of Documentation
Besides a README file: https://github.com/ruby-marc/ruby-marc/blob/master/README.md there is no documentation available.
Code Quality
Github: https://github.com/ruby-marc/ruby-marc hosts all the ruby scripts. There are 5 contributors. Just one is realy active. The project already started in 2005. It seems to be still maintained.
Sanddragon Image Service
The Sanddragon Image Service is a WCF (Windows Communication Foundation) web service which implements the IIIF image API, deployed on Microsoft’s IIS (Internet Information Server)
Visualizing
— British Library —
http://sanddragon.bl.uk/#home
https://github.com/britishlibrary/SanddragonImageService
Exhibition Management, Media Applications
Examples
http://sanddragon.bl.uk/#imageservice
License
BSD-3 Licence
Last release
2013
Sanddragon Item Viewer
The Sanddragon Item Viewer is an item navigation control, enabling navigation between a set of images working with the Sanddragon widget.
Managing, Browsing
— British Library —
http://sanddragon.bl.uk/#home
https://github.com/britishlibrary/SanddragonItemViewer
Exhibition Management
Examples
http://sanddragon.bl.uk/#itemviewer
License
BSD-3 Licence
Last release
2013
Sanddragon Widget
The Sanddragon Widget is an image viewer that is based upon Open Seadragon which has been modified to work with an IIIF server, such as the Sanddragon Image Service.
Managing, Browsing
— British Library —
http://sanddragon.bl.uk/#home
https://github.com/britishlibrary/SanddragonWidget
Exhibition Management
Examples
http://sanddragon.bl.uk/#widget
License
BSD-3 Licence
Last release
2013
scikit-learn
scikit-learn is a Python module for machine learning built on top of SciPy
Machine Learning
— David Cournapeau, Google —
http://scikit-learn.org/stable/
https://github.com/scikit-learn/scikit-learn
Metadata Retrieval Services, Data Visulaisation
ScoreToChart
A visualization of the amount of notes played in each score
Visualizing
— David Oliveri —
https://github.com/davideoliveri/ScoreToChart/
https://github.com/davideoliveri/ScoreToChart/
Data Visualization
Examples
http://davideoliveri.github.io/ScoreToChart/
License
Unknown
Last release
2014
Last activity
27/8/2014
shannon-server
shannon-server is a small WSGI-compliant httpony to calculate the Shannon Entropy for a given image.
Measuring
— Cooper Hewitt —
http://labs.cooperhewitt.org/2013/default-sort-or-what-would-shannon-do/
https://github.com/cooperhewitt/shannon-server
Digital Asset Management, Image Search
Shelf.io
Build virtual shelves for book/cd/record/film collections using linked data
Managing
— Harvard Library Innovation Lab —
http://shelf.io/
https://github.com/harvard-lil/shelfio
Data Visualization, Linked Open Data, Social Applications
License
MIT License/GPL License
Last release
2012
Shred.js
Javascript framework to enable annotating of diverse media from diverse sources
Annotating
— Columbia Center for New Media Teaching and Learning —
Media Annotation/Tagging
License
unknown
Last release
26/8/2014
sigil
Sigil is a free, open source, multi-platform e-book editor, designed for editing books in EPUB format.
Editing
— Strahinja Marković, John Schember —
http://sigil-ebook.com/
https://github.com/user-none/Sigil
Publication
License
GNU GPL v3
Last release
3/2/2015
SobekCM
SobekCM allows users to discover online resources via semantic and full-text searches, as well as a variety of different browse mechanisms. For each digital resource in the repository there are a plethora of display options, which may be selected by an appropriately authenticated use. This repository includes online metadata editing and online submissions in support of institutional repositories.
Browsing, Visualizing, Editing
— Mark Sullivan —
http://ufdc.ufl.edu/sobekcm
https://github.com/MarkVSullivan/SobekCM-Web-Application
Content Retrieval Services, Collection Management, Search and Browsing
License
GNU General Public License
SOCH
SOCH (Swedish Open Cultural Heritage) is a web service used to search and retrieve data from any organisation holding information or media relating to Swedish cultural heritage. The Swedish name for SOCH is “K-samsök”, which can be translated as “Cultural cross-search”.
Browsing, Retrieving
— K-Samsök —
http://www.ksamsok.se/in-english/
http://www.ksamsok.se/api/kodexempel/
Metadata Retrieval Services, Linked Open Data
Solr
Enterprise search platform
Browsing
— James Casbon —
http://lucene.apache.org/solr/
http://www.apache.org/dyn/closer.cgi/lucene/solr/3.6.1
Search and Browsing, Repository Software
License
Apache License Version 2.0
Spira
Spira is a framework for using the information in RDF.rb repositories as model
objects. It gives you the ability to work in a resource-oriented way without
losing access to statement-oriented nature of linked data, if you so choose.
It can be used either to access existing RDF data in a resource-oriented way,
or to create a new store of RDF data based on simple defaults.
Managing
— RubyGems —
https://rubygems.org/gems/spira
https://github.com/ruby-rdf/spira
Linked Open Data
License
Unlicense
Last release
28/1/2014
Spotlight
Spotlight is a plugin for Blacklight, that enables libraries
and other cultural heritage institutions to build online exhibits from
content in their repositories to better highlight their digital
collections. It extends Blacklight by providing a self-service,
forms-based user interface that allows exhibit-builders, such as
librarians or faculty, to customize the search interface and homepage,
and to build media-rich feature pages to better contextualize their
collections.
Editing, Web-Developing
— Chris Beer —
https://github.com/sul-dlss/spotlight
https://github.com/sul-dlss/spotlight
Contextualisation, Exhibition Management, Collection Management
Examples
https://www.youtube.com/watch?v=_A7vTbbiF4g&feature=youtu.be
License
Apache 2.0
Last release
6/30/2014
Stack View
Library collection visualization tool. Color codes and organizes books based on popularity and use.
Visualizing
— Harvard Library Innovation Lab —
http://librarylab.law.harvard.edu/blog/stack-view/
https://github.com/harvard-lil/stackview
Collection Management, Data Visulaisation
License
MIT License/ GPL
Last activity
2012
Stacklife
StackLife is a community-based wayfinding tool for navigating the vast
resources of the combined Harvard Library System. It enables
researchers, teachers, scholars, and students to find what they need and
help others learn from them and their paths.
Browsing
— Harvard Library Innovation Lab —
http://stacklife.harvard.edu/explainer.php
https://github.com/harvard-lil/stacklife
Collection Management
Examples
http://stacklife.harvard.edu/item/the-elements-of-graphic-design/612B1DC6-D389-24A7-C6EC-A73298255537
License
MIT
Last release
17/4/2013
Stanford Named Entity Recognizer (NER)
Stanford NER is a Java implementation of a Named Entity Recognizer.
Named Entity Recognition
http://nlp.stanford.edu/software/CRF-NER.shtml
http://nlp.stanford.edu/software/CRF-NER.shtml#Download
Metadata Mapping/Conversion/normalisation, Metadata Retrival Services
Examples
http://nlp.stanford.edu:8080/ner/
License
GNU General Public License (v2 or later)
Last release
11/12/2013
Stanford Natural Language Processing Group (NLP)
Stanford CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities, etc. Stanford CoreNLP is an integrated framework, which make it very easy to apply a bunch of language analysis tools to a piece of text. Starting from plain text, you can run all the tools on it with just two lines of code. Its analyses provide the foundational building blocks for higher-level and domain-specific text understanding applications.
Analyzing
— Standford —
http://nlp.stanford.edu/software/corenlp.shtml
http://nlp.stanford.edu/software/corenlp.shtml#Download
Ontology/Vocabulary Building and Management
Examples
http://nlp.stanford.edu/software/corenlp.shtml#Demo
License
GNU GPL v2 or later
StoryMap.JS
StoryMapJS is a free tool to help you tell stories on the web that highlight the locations of a series of events.
Locating, Visualizing
— Northwestern University Knight Lab —
http://storymap.knightlab.com/
http://storymap.knightlab.com/advanced.html
Geo-spatial Apllications, Media Applications, Data Visulaisation
Examples
http://apps.washingtonpost.com/g/page/world/map-how-isis-is-carving-out-a-new-country/1095/
License
Mozilla Public License v 2.0
Structify
The Structify tool is a graphical tool for viewing and editing METS files and creating or correcting structural metadata. It requires Java and is available for Windows, Linux and OSX.
Editing
— University of Innsbruck —
http://dbis-halvar.uibk.ac.at/dokuwiki/doku.php?id=main:structify
http://dbis-halvar.uibk.ac.at/dokuwiki/doku.php?id=main:structify#download
Publication, Data Annotation/Curation, Collection Management
SubjectsPlus
SubjectsPlus is a free and open source tool to help you manage several interrelated parts of your library website.
Managing
— Joyner Library East Carolina University/ University of Miami Libraries —
http://www.subjectsplus.com/
https://github.com/subjectsplus/SubjectsPlus
Collection Management, Digital Asset Management
License
GNU GPL
Last release
19/12/2014
Supplejack
Supplejack is DigitalNZ’s tool for aggregating, searching and sharing metadata records. Supplejack is used to collect metadata about millions of items, from hundreds of data sources, across many different data formats including HTML, RSS, XML, OAI-PMH and RDF/XML. It transforms messy data, creates a unified search index and makes consistent metadata widely available via an open API data service. It was built using open source technology by DigitalNZ and the National Library of New Zealand.
Seeking, Organizing, Resource Sharing
— DigitalNZ —
http://www.digitalnz.org/developers/supplejack
https://github.com/DigitalNZ
Metadata Retrieval Services, Metadata Mapping/Conversation/normalisation
License
GNU GPL
Last activity
16/2/2015
Tabula
Tabula is a tool for liberating data tables trapped inside PDF files.
Extracting Data
— Manuel Aristarán —
http://tabula.nerdpower.org/
https://github.com/jazzido/tabula
Semantic Extraction
TAL (TV Application Layer)
The TV Application Layer (TAL) is an open source library for building applications for Connected TV devices.
Programming
— BBC Future Media Platforms —
http://fmtvp.github.io/tal/getting-started/introducing-tal.html
https://github.com/fmtvp/tal
Media Applications
License
Apache 2.0
Last release
14/3/2015
TAP
TAP is a collection of free and open-source tools which support the creation and delivery of mobile tours. Currently TAP consists of authoring tools built on top of the content management system Drupal, a native iOS mobile application, and a web-based mobile application built upon the jQuery Mobile library.
Programming
— IMA Museums —
http://tapintomuseums.org/TAP
https://github.com/IMAmuseum
Exhibition Management
Examples
http://tapintomuseums.org/case-studies
Telemeta
Telemeta is a free and open source web audio archiving software which introduces useful and secure methods to organize, backup, index, transcode, analyse, share and publish any digitalized audio or video file with extensive metadata in accordance with open web standards. It is dedicated to collaborative media archiving projects, research laboratories, librairies and digital humanities.
Archiving, Indexing, Archiving, Resource Sharing, Analyzing
— Guillaume Pellerin, Thomas Fillon —
http://telemeta.org
https://github.com/Parisson/Telemeta
Collection Management, Digital Preservation, Media Annotation/Tagging, Metadata Mapping/Conversation/normalisation, Interactive User Interface
Examples
http://archives.crem-cnrs.fr/
License
CeCILL-B (GPLv2 compatible)
Last release
20/2/2015
Last activity
continuous
TemaTres
TemaTres is an open source vocabulary server, web application to manage and exploit vocabularies, thesauri, taxonomies and formal representations of knowledge.
Adding Metainformation
— Diego Ferreyra —
http://www.vocabularyserver.com/
http://sourceforge.net/projects/tematres/
Ontology/Vocabulary Building and Management, Collection Management
Examples
http://r020.com.ar/tematres/demo/index.php?setLang=en
License
GPLv2 license
Last release
08/2013
Tesseract
Tesseract is probably the most accurate open source OCR engine available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 60 languages.
Conversioning
— Ray Smith —
https://github.com/tesseract-ocr/tesseract
https://github.com/tesseract-ocr/tesseract
Collection Management, Digital Asset Management, Ontology/Vocabulary Building and Management, Digital Preservation
License
Apache License 2.0
Last release
23/10/2012
The Guide on the Side software
A freely available tool that allows librarians to quickly and easily create online, interactive tutorials that are based on the principles of authentic and active learning
Web-Developing
— Leslie Sult, Justin Spargur, Gregory Hagedon, and Ginger Bidwell at the University of Arizona Libraries —
http://code.library.arizona.edu/gots/
https://github.com/ualibraries/Guide-on-the-Side
Social Applications, Collection Management, Linked Open Data
Examples
http://www.library.arizona.edu/applications/quickHelp/tutorial/searching-the-ua-library-catalog
License
GNU
Last release
2012
Last activity
actively ongoing
Timeline JS
TimelineJS is an open-source tool that enables you to build visually-rich interactive timelines and is available in 40 languages.
Visualizing
— Northwestern University Knight Lab —
http://timeline.knightlab.com/
https://github.com/NUKnightLab/TimelineJS
Exhibition Management, Data Annotation/Curation, Media Applications
Quality of Documentation
Very well written documentation for users. Also very well written documentation for programmers (found on the Github page). Frequent documentation updates and live demo.
Ease of Adaptation
This is a web-application that can be pluged in on websites. Information for pluging in the webapp is available even on the first page of the project’s custom website. More extensive information exists on the GitHub page of the project. There is a list of known installations.
Code Quality
Excellent code quality and package structure. The project is a work of academic level and follows almost every direction towards openness for collaboration and excplicity of descriptions.
Examples
http://timeline.knightlab.com/
License
Mozilla Public License, v. 2.0
Last release
October 2013
Last activity
actively ongoing
TimeMapper
Timeliner is a simple javascript app to create and share timelines and timemaps build using ReclineJS.
Visualizing, Resource Sharing
— Rufus Pollock —
http://timemapper.okfnlabs.org/
https://github.com/okfn/timemapper
Exhibition Management
Quality of Documentation
The documentation describes the three steps needed in order to get the application to work and produce the desired results for the user. This information can be found on the custom website of the project. All information related to programmers’ views can be found on the GitHub page of the project. Information exists there for the programmers to install and configure the tool based on their needs, but the related text could be of higher quality.
Ease of Adaptation
As this is a software used for one specific operation the ease of adaptation depends on the need that the operation tends to cover. In our case, TimeMapper’s ease of adaptation is bound by the support of JS and availability of a Google account, for the side of the user, and the will to use JS, for the side of the programmer. There is no documentation for adapting/extending the tool.
Code Quality
Very well written documentation for users. Also very well written documentation for programmers (found on the Github page). Frequent documentation updates and live demo.
Examples
http://timeliner.okfnlabs.org/view/?url=https://docs.google.com/spreadsheet/ccc?key=0Al6mO9_3Hr2PdGZnRjEwUWxOekhreTNNZEFEMWRZbkE
Traject
Tools for reading MARC records, transforming them with indexing rules, and indexing to Solr.
Browsing, Modifying, Indexing
— Jonathan Rochkind (Johns Hopkins Libraries), Bill Dueber (University of Michigan Libraries) —
https://github.com/traject-project/traject/
https://github.com/traject-project/traject/
Collection Management, Metadata Mapping/Conversation/normalisation
Quality of Documentation
No custom website for the tool. The docs on Github contain all the necessary information for the tool’s operation. Since this is not a software but (could be considered to be) large scale script performing a specific funtion, the lack of a custom website cannot be considered as a substantial drawback in the project’s ebaluation, as long as all the necessary information is exists.The tutorials for users and programmers are very well written and descriptive.
Ease of Adaptation
The ease of adaptation needed for this tool’s specific funtionality is a part of its operation. Namely that would be the index rules setting, for which the information given over the documentation is quite extensive.
Code Quality
The package structure for the tool is clear and there are numerous tests for users and programmers to look upon. The number of contibutors is small but the support and updates frequency is adequate for a tool of that size. The tool is written in JRuby.
Examples
https://github.com/traject-project/traject/blob/master/test/test_support/demo_config.rb
License
MIT License
Last activity
ongoing
UniversalViewer
The Universal Viewer is an open source project to enable cultural heritage
institutions to present their digital artifacts in an IIIF-compliant and highly customisable user interface
Visualizing, Publishing
— Edward Silverton —
https://github.com/UniversalViewer/universalviewer
https://github.com/UniversalViewer/universalviewer
Media Applications, Digital Asset Management, Publication
Quality of Documentation
Rather thorough documentation found on the wiki https://github.com/UniversalViewer/UniversalViewer/wiki
Examples
http://universalviewer.io/examples/
License
MIT License
Last release
15/1/2016
Last activity
ongoing
UNT Digital Collections Opensearch
A Drupal 7 search module for Opensearch compatible resources.
Web-Developing
— University of North Texas —
https://github.com/unt-libraries/untdl_opensearch
https://github.com/unt-libraries/untdl_opensearch
Metadata Retrieval Services, Collection Management
License
https://github.com/unt-libraries/untdl_opensearch/blob/master/LICENSE
Ushahidi
Make smart decisions with a data management system that rapidly collects
data from the crowd and visualizes what happened, when and where.
Managing, Browsing, Visualizing
— Ushahidi —
http://www.ushahidi.com/blog/product/ushahidi/
http://www.ushahidi.com/get-help/
Data Visualization, Geo-spatial Apllications, Social Applications
License
GNU LGPL
Last release
v 3.0 beta 6
VIVO
VIVO enables the discovery of researchers across institutions. Participants in the network include institutions with local installations of VIVO or those with research discovery and profiling applications that can provide semantic web!-compliant data. The information accessible through VIVO’s search and browse capability will reside and be controlled locally, within institutional VIVOs or other semantic web-compliant applications.
Networking
— Cornell University —
http://vivoweb.org/about
https://wiki.duraspace.org/display/VIVO/VIVO
Semantic Extraction
License
Atlassian Confluence Open Source Project License
Last activity
30/6/2014
VoxForge
VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines
Archiving
http://www.voxforge.org/home
http://www.voxforge.org/home/dev
Ontology/Vocabulary Building and Management
VuFind
Library resource portal.
Archiving
— Villanova University's Falvey Memorial Library. —
http://vufind.org/
http://vufind.org/downloads.php
Collection Management
Quality of Documentation
The official website is extensively descriptive about the software. Well written documentation for use and deployment. User and developer manuals are present and there is also a live demo to depict how the tool is used. The tool is even presented in conferences so continuous feedback is given on its use and possible improvements. The project is present on Github. Highlight: used by more than 120 organizations worldwide, mostly universities and public libraries.
Ease of Adaptation
There is a support mailing list dating back to the time of the tool’s release. The list’s archive is open for access. List of known installations publicly available. JIRA issue tracker is used. Several companies offer commercial support for organizations using the tool. Lots of configurable options and adaptations are possible.
Code Quality
Github hosted project having a very clear and well organized package structure. PHP coded with explanatory comments for the code methods. Test files to demonstrate functionalities are included in the Github repository. 21 contributors with 3077 commits are shown in official statistics on April 10th, 2014. Oldest change is 2 years ago, last change 16 hours ago.
License
GPL
Last release
8/26/2013
Wagtail
Content management system designed for the Royal Academy of Art focused on flexibility and user experience. Light, fast, agile, portable.
Managing
— Torchbox —
wagtail.io
https://github.com/torchbox/wagtail/
Collection Management
Wandora
Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.
Managing, Publishing, Extracting Data
— Wandora Team —
http://www.wandora.org
http://www.wandora.org/wiki/Download
Collection Management, Metadata Retrival Services, Metadata Mapping/Conversation/normalisation
Examples
http://kokoelmat.fng.fi/
License
GNU GPL v3
Last release
2012
Last activity
2012-06-21
Wayback
Tool for archiving websites
Archiving
— the IIPC consortium —
http://netpreserve.org/netpreserve.org/tools/openwayback
https://github.com/iipc/openwayback
Digital Preservation
License
Apache License, v.2
WEKA
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.
Machine Learning, Data Mining
— The University of Waikato —
http://www.cs.waikato.ac.nz/ml/weka/
http://www.cs.waikato.ac.nz/ml/weka/downloading.html
Metadata Retrieval Services
Wikipedia JS
Wikipedia.JS is a small Javascript library for accessing information in Wikipedia articles such as dates, places, abstracts and more …
Locating
— Open Knowledge Foundation Labs (Rufus Pollock) —
http://okfnlabs.org/wikipediajs/
https://github.com/okfn/wikipediajs/
Metadata Retrieval Services
WorldCat Search
worldcat is a Python module that works with OCLC’s WorldCat Affiliate web
services (see http://www.worldcat.org/affiliate/). worldcat currently works
with the WorldCat Search API, the xID (xISBN, xISSN, and xOCLCNUM) APIs, and
lookups using the WorldCat Registry API.
Seeking
— OCLC —
https://www.worldcat.org/advancedsearch
https://github.com/anarchivist/worldcat
Search and Browsing
License
GNU Lesser General Public License v2
Last release
Unreleased prototype
xDams Platform OS
xDams is a web based platform
for creating, managing and publishing finding aids EAD and EAC-CPF schema compliant.
Producing, Managing, Publishing
— http://www.regesta.com/ —
http://www.xdams.org/
https://github.com/xdamsorg/xDams-core/
Metadata Retrieval Services, Collection Management, Exhibition Management
Quality of Documentation
Nearly everything is written in italian! Even https://github.com/xdamsorg/xDams-core/blob/master/README.md.
Code Quality
Also comments in the code are in italian.
Examples
http://www.youtube.com/user/xdamsplatform
License
GNU GPL v3
Last release
7/10/2012
Last activity
7/10/2012
xEAC
xEAC is an XForms-based prosopographical information management system for creating and publishing Encoded Archival Context (EAC-CPF) records. A companion tool is EADitor for editing EAD (encoded archival description) records
Managing, Producing, Publishing
— Ethan Gruber, American Numismatic Society —
Collection Management, Publication
Quality of Documentation
The installation instructions http://wiki.numismatics.org/xeac:generic_installation a quite straight forward. Just configure and deploy the applicaton on Tomcat. You also have to set up an Apache Solr instance. The user manual is a little bit sparse, but there are at least some screenshots and the application does not seem to be too complicated.
Ease of Adaptation
The software seems to be very tailored to the needs of the American Numismatic Society.
Code Quality
xEAC is based on Orbeon Forms (http://www.orbeon.com/) a solution to build and deploy web forms.
XTF
The eXtensible Text Framework (XTF) is a powerful open source platform for providing access to digital content.
Direct Accessing
— California Digital Library (CDL) —
http://xtf.cdlib.org/
http://xtf.cdlib.org/download/
Content Retrieval Services
Quality of Documentation
There is a lot of documentation on the website: http://xtf.cdlib.org/documentation/, for the user, the developer and the system administrator.
Ease of Adaptation
A look in the change log shows a steady developement since at least 2006. Unfortunately there are no new versions released since July 2012!
Code Quality
Building XTF from the Source Code is quite simple. Apache Ant is used to do it. The Java code looks good.
Examples
http://xtf.cdlib.org/xtf/
License
Mozilla Public License 1.0 (MPL), BSD License, Apache Software License
Last release
7/27/2012