Cairo project

Pobieranie 49.31 Kb.
Rozmiar49.31 Kb.

CAIRO project - Curating Artistic Research Output

Our key output will be a post-graduate teaching and learning module which will offer researchers the skills and knowledge to effectively self-archive or to communicate their needs to third parties in order to negotiate appropriate levels of service.

Tu jest wiki tego projektu:

Sam projekt chyba nie wyprodukował softu (jest za to np. sylabus), ale podaje narzędzia:

Digital preservation tools

  • Checking tools (format identification, file validation, error checking, checksum)

  • DROID (DROID is designed to meet the fundamental requirement of any digital repository to be able to identify the precise format of all stored digital objects, and to link that identification to a central registry of technical information about that format and its dependencies. DROID uses internal signatures to identify and report the specific file format and version of digital files. These signatures are stored in an XML signature file, generated from information recorded in the PRONOM technical registry. New and updated signatures are regularly added to PRONOM, and DROID can be configured to automatically download updated signature files. DROID requires Java 1.6 or 1.7 Standard Edition (SE). )

  • JHOVE (JHOVE provides functions to perform format-specific identification, validation, and characterization -properties of the format of digital objects. JHOVE is implemented as a Java application, written to conform to J2SE 1.4, using the Sun SDK 1.4.1., command line or Swing GUI)

  • Preservation action tools

  • PLATO (PLANETS) (The planning tool Plato is a decision support tool that implements a solid preservation planning process and integrates services for content characterisation, preservation action and automatic object comparison in a service-oriented architecture to provide maximum support for preservation planning endeavours. The software itself is a J2EE web application relying on open frameworks such as Java Server Faces and AJAX for the presentation layer and Enterprise Java Beans for the backend.)

  • PRONOM (PRONOM is an on-line information system about data file formats and their supporting software products. Originally developed to support the accession and long-term preservation of electronic records held by the National Archives, PRONOM is now being made available as a resource for anyone requiring access to this type of information.)

  • JISC 'file formats for preservation' - tylko manuale, best practicies itp.

Library of Congress -rejestr 44 narzędzi

NDIIPP - National Digital Information Infrastructure and Preservation Program – program Library of Congress dot. prywatnych zbiorów. Na stronie są tutoriale jak archiwizować swoje dane (obrazy, audio, filmy, e-maile, www i in.) na poziomie b. ogólnym (np. bez nazw narzędzi). Jest link do platformy Viewshare.

Viewshare (Viewshare is a free platform for generating and customizing views (interactive maps, timelines, facets, tag clouds) that allow users to experience your digital collections. Steps:

  • create account

  • download data (xls, csv, xml (MODS), OAI), augment the data, add collection data

  • generate view of the collection data (part of webpage)

  • Embed and Share your Views (embed the view in own webpage or manage access to the view on Viewshare).

NDSA - National Digital Stewardship Alliance

The mission of the National Digital Stewardship Alliance is to establish, maintain, and advance the capacity to preserve our nation's digital resources for the benefit of present and future generations.

Projekt posiada kilka grup roboczych, które wypracowują best practicies (selection and aquisition of digital collections, digital formats and best practices, development and maintenance of tools, innovation, outreach) Raport: podaje zestaw narzędzi do archiwizowania stron www, blogów itp.

AQuA project

Lista ~50 użytecznych tooli, w tym np DROID, JHOVE

Lista przebadanych kolekcji i zidentyfikowanych issues z krótkim opisem jak je rozwiązać (z wykorzystaniem ww tooli), wyszukiwanie tej bazy wg słów kluczowych

Open Planets Foundation (OPF)

The Open Planets Foundation (OPF) addresses core digital preservation challenges by engaging with its members and the community to develop practical and sustainable tools and services to ensure long-term access to digital content.


  • FIDO Format Identification for Digital Objects is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.

  • jpylyzer JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was pecifically created to check that a JP2 file really conforms to the format's specifications. Additionally jpylyzer is able to extract the technical characteristics of each image.

Rejestr narzędzi

Na razie w rejestrze są 3 narzędzia

Baza kolekcji, issues i rozwiązań -spora, AquA jest podzbiorem tej bazy


DigiBIC is a network of European leaders in research, innovation support, industry and finance working together to deploy the latest technology and tools to creative industries. Over 100 companies have deployed technologies (many free downloads) to help their business, so why not have a look at the DigiBIC technology catalogue and see if there's anything for you?

-> katalog narzędzi dla różnych dziedzin, lista z b. krótkim opisem + link do strony domowej. Część narzędzi płatnych -nie tylko software

SPAR project

SPAR stands for "Système de Préservation et d'Archivage Réparti", meaning "Distributed Archiving and Preservation System". Functions:

  • data replication and monitoring of possible corruptions

  • file format transformations

  • digital signatures

  • external access

Concerning the metadata, the SPAR system uses the most advanced standards:

  • Dublin Core for the descriptive information, i.e. the description of the object that is archived,

  • MIX to code the technical metadata for image files,

  • textMD to code the technical metadata for text files,

  • ODRL to code the usage license of the digital objects,

  • PREMIS for the provenance information, i.e. the documentation of the history of the data-objects

System został napisany dla BNF (Franc. Bibl. Narodowa) przez firmę komercyjną i chyba nie jest dostępny.


With SIARD (Software Independent Archiving of Relational Databases), the Swiss Federal Archives (SFA) provide a sustainable solution for the long-term preservation of relational databases. This includes an open format for archiving of relational databases as well as a freeware package - "SIARD Suite" - for converting relational databases into the SIARD format. SIARD format and SIARD Suite offer a unique archival solution to preserve and access database content including metadata2 and relations over the long-term.

The open archiving format SIARD released in 2008 is used in the SFA and the Swiss gov-ernment agencies. In May 2008, it was accepted as the official format of the European PLANETS project for the archiving of relational databases.

SIARD Suite is based on international standards such as XML, SQL:1999 and UNICODE. At present the application supports the following databases: Oracle, Microsoft SQL Server, MySQL und Microsoft Access.

Prerequisite is the installation of JAVA SE 1.5 or higher.

Na ile się przyda, jeśli dot. Tylko baz relacyjnych?

Soft dostępny za darmo, ale zamknięty, do wykupienia możliwość supportu, szkoleń, rozwoju aplikacji.

DPE - Digital Preservation Europe

DPE fosters collaboration and synergies between many existing national initiatives across the European Research Area. DPE addresses the need to improve coordination, cooperation and consistency in current activities to secure effective preservation of digital materials. DPE's project partners lead work to:

  • raise the profile of digital preservation;

  • promote the ability of Member States acting together to add value to digital preservation activities across Europe;

  • use cross-sectoral cooperation to avoid redundancy and duplication of effort;

  • ensure auditable and certificated standards for digital preservation processes are selected and introduced;

  • facilitate skills development through training packages;

  • enable relevant research coordination and exchange;

  • develop and promote a research agenda roadmap; and

  • help both citizens and specialist professionals recognise the central role that digital preservation plays in their lives and work

There are registries on the webpage:

  • Registry of Trainers

  • Registry of Training Materials

  • Registry of Competence Centres

  • Registry of Repositories

  • Registry of Online Resources

  • Registry of Research Projects

Brak bezpośredniej informacju

SHAMAN – Sustaining Heritage Access through Multivalent ArchiviNg

The overall aim of the SHAMAN Integrated Project is to develop a next generation Digital Preservation (DP) framework. It is furthermore developing corresponding reference implementations of exemplar preservation tools for analysing, ingesting, managing, accessing and reusing information objects and data across digital archives based on standardized reference architecture.

The SHAMAN Project addresses the need of Digital Preservation from the information life-cycle management perspective, for maintaining the information accessible beyond the bounds of the media technological change, while ensuring authenticity and integrity during all information and object lifecycle. As prosperity for future generations is linked to keeping value of today’s digital assets by granting proper access to digital content in the future, Digital Preservation has been seen in SHAMAN Project as communication with the future.

Projekt proponuje podejście oparte na analizie kontekstu i możliwości (metadane obiektu + operacje/workflow + definicja aktorów) i użycie OWL (ontologii) do definicji świata.

Planets, Preservation and Long-term Access through Networked Services

The Planets project ended on 31 May 2010. Planets results will be maintained and developed by a follow-on organisation called the Open Planets Foundation (OPF).

CASPAR - Cultural, Artistic and Scientific knowledge for Preservation, Access and Retrieval

CASPAR intends to:

  • Implement, extend, and validate the OAIS reference model (ISO:14721:2003)

  • Enhance the techniques for capturing Representation Information and other preservation related information for content objects

  • Design virtualisation services supporting long term digital resource preservation, despite changes in the underlying computing (hardware and software) and storage systems, and the Designated Communities.

  • Integrate digital rights management, authentication, and accreditation as standard features of CASPAR.

  • Research more sophisticated access to and use of preserved digital resources including intuitive query and browsing mechanisms

  • Develop case studies to validate the CASPAR approach to digital resource preservation across different user communities and assess the conditions for a successful replication.

  • Actively contribute to the relevant standardisation activities in areas addressed by CASPAR.

  • Raise awareness about the critical importance of digital preservation among the relevant user-communities and facilitate the emergence of a more diverse offer of systems and services for preservation of digital resource


  • REPINF - Representation Information Toolkit

  • VIRT - Virtualisation

  • REG - Registry

  • PACK - Packaging

  • PDS - Preservation Data Stores

  • FIND - Finding

  • KM - Knowledge Manager

  • POM - Preservation Orchestration Manager

  • DAMS - Data Access Manager and Security

  • DRM - Digital Rights Manager

  • AUTH – Authenticity

strona z downloadem jest niedostępna. Ostatni release był w 2009, soft jest w Javie.

NESTOR - Network of Expertise in long-term STOrage of digital Resources in Germany

nestor is a cooperation association including partners from different fields, but all connected in some way with the subject of "digital preservation".

APARSEN - Alliance Permanent Access to the Records of Science in Europe Network

APARSEN is a Network of Excellence that brings together an extremely diverse set of practitioner organisations and researchers in order to bring coherence, cohesion and continuity to research into barriers to the long-term accessibility and usability of digital information and data, exploiting our diversity by building a long-lived Virtual Centre of Digital Preservation Excellence.

Jednym z rezultatów ma być Interoperability Framework dotyczący Persistent Identifiers.

ENSURE - - Enabling kNowledge Sustainability Usability and Recovery for Economic value

Guaranteeing long term usability for spiraling amounts of data produced or controlled by organizations with commercial interests is quickly becoming a major problem. Guided by real world use cases in health care, finance and clinical trials, ENSURE extends the state of the art in digital preservation, which to-date has primarily focused on relatively homogeneous cultural heritage data information through innovative solutions considering:

  • Cost and Value: Evaluate the cost and benefit of different quality solutions, enabling a business to choose the most cost effective solution.

  • Preservation Lifecyle Management: Build on industry standard lifecycle management approaches to manage the preservation lifecycle, meet regulatory compliance, allow changes in the preservation approach to reflect environmental changes, address evolution of ontologies and manage the quality of digital objects over time.

  • Content-aware Long Term Data Protection: Provide data protection over long periods of time, addressing changes to personally identifiable information, new and evolving regulations, and manage user identities over the decades.

  • Utilize Emerging ICT: Evaluate the costs, risks and benefits and demonstrate how to use emerging, commonly available Information Technology to enable scalable solutions for digital preservation, in particular considering Cloud Storage and virtualization techniques.

Projekt w trakcie,skupia się na aspektach ekonomicznych (koszt). Najnowszy deliverable opisuje architekturę -rozwinięcie OAIS.


The project is developing ways to describe these file formats in a way to make the comparison of the information contained within files in different formats possible. This is done with two formal languages, called the Extensible Characterisation Definition Language (XCDL) and the Extensible Characterisation Extraction Language (XCEL), which describe formats and the information contained within individual files.

Narzędzia pozwalające wyciągnąć właściwości danych zapisanych w określonym formacie i porównujące te właściwości. Pozwala to np. na weryfikację czy automatczna konwersja między formatami plików była poprawna.

Soft dostępny na stronie projektu (instalator dla Win32, źródła C++ do zbudowania pod Linux/Mac)

The SCAPE - SCAlable Preservation Environments

The SCAPE project will develop scalable services for planning and execution of institutional preservation strategies on an open source platform that orchestrates semi-automated workflows for large-scale, heterogeneous collections of complex digital objects. SCAPE will enhance the state of the art of digital preservation in three ways: by developing infrastructure and tools for scalable preservation actions; by providing a framework for automated, quality-assured preservation workflows and by integrating these components with a policy-based preservation planning and watch system. These concrete project results will be validated within three large-scale Testbeds from diverse application areas.

SCAPE Components are implemented as Taverna workflows that follow a set of conventions.

Projekt w trakcie, Java.


A research project (STREP) in the seventh framework programme. The PROTAGE approach to digital preservation is based on pro-active autonomous software agents that are independent of hardware and software technologies. This represents a shift of focus in digital preservation from information systems to preservation-friendly digital objects. The idea is to link these digital objects to long-term digital preservation processes by using agent-based software technology. The PROTAGE project will, based on the latest research on digital preservation strategies and on autonomous systems, build and validate flexible and extensible software agents for long-term digital preservation and access that can cooperate with and be integrated in existing and new preservation systems.

Projekt zakonczony w 2010. Klient -Java, dostępny. Serwer -testowy w Gironie, soft niedoostępny. Agenci posiadają plany wykonania określonych akcji, mogą się nawzajem odpytywać o plany, pozyskany plan jest oceniany wg wartości zaufania do agenta. Plan może być wykonany w systemie.

Jakie operacje są obecnie dostępne (w manualu są konwersje formatów)?

DigCurV - Digital Curator Vocational Education Europe Project

DigCurV, a project funded by the European Commission’s Leonardo da Vinci programme to establish a curriculum framework for vocational training in digital curation launched today.


KEEP (Keeping Emulation Environments Portable) is developing emulation services (KEEP Emulation Services) to enable accurate rendering of both static and dynamic digital objects: text, sound, and image files; multimedia documents, websites, databases, videogames etc.

The overall aim of the project is to facilitate universal access to our cultural heritage by developing flexible tools for accessing, manipulating and storing a wide range of digital objects using emulation tools either to reproduce the original environment in which they were created or to enable those objects to be migrated accurately to another environment.

In addition to the development of a KEEP Emulation Framework, within which 3rd party emulators are hosted, the project is also supporting the development of a Virtual Machine which will permit other environments to operate independently of the actual software and hardware environments.

Tools Framework – baza danych ze spakowanymi toolami (narzędzie nie wymagające instalacji + xml je opisujący), Możliwość def. workflows składających się z tych narzędzi za pomocą GUI. Baza przechowuje też oceny narzędzi.

Emulation Framework – przechowuje i może uruchamiać VM w celu emulacji starego środowiska na którym można odtwarzać archiwalne zasoby, np. Aplikacje, gry itp.

Projekt w trakcie.

Magazzini Digitali

Magazzini Digitali is a young and dynamic company. We combine technological expertise, particularly in the field of Java and communication and usability to achieve a common objective (goal): to create beautiful and easy to use solutions, that improve the business of our customers. Our consulting activities being with gathering the customer's requirements, verifying the feasibility and possible solutions, in order to propose innovative and scalable throughout time solutions. After the on line publishing we offer solutions and guarantee all the customer assistance and the evolving maintenance services to follow our customers throughout time.

Turkish National Library


International Network for a Digital Cultural Heritage e-Infrastructure is an European Union FP7 project which aims to establish a network of common interest made up of experts and researchers in the field of e-infrastructures and digital cultural heritage at Euro Mediterranean level.


ATHENA, presented as a Network of Best Practice within the eContentplus Programme, takes its origins from the existing MINERVA network.

MINERVA Technical Guidelines for Digital Cultural Content Creation Programmes -próba standaryzacji, ale nie wskazuje konkretnych narzędzi

Report on existing standards applied by European museums

ATHENA Digitisation: standards landscape for european museums,archives, libraries raport wymienia i opisuje używane standardy (np. JPG, HTML, DOC, PDF). Implementation plan and access to content of museums through Europeana raport porusza kwestie konwersji metadanych z różnych instytucji


MINT services compose a web based platform that was designed and developed to facilitate aggregation initiatives for cultural heritage content and metadata in Europe. It is employed from the first steps of such workflows, corresponding to the ingestion, mapping and aggregation of metadata records, and proceeds to implement a variety of remediation approaches for the resulting repository. The platform offers a user and organization management system that allows the deployment and operation of different aggregation schemes (thematic or cross-domain, international, national or regional) and corresponding access rights. Registered organizations can upload (http, ftp, oai-pmh) their metadata records in xml or csv serialization in order to manage, aggregate and publish their collections.

Testowa instancja nie działa

Carolina Digital Repository

Używane technologie: iRODS (storage grid), Fedora Commons (object, model and services provider).


TextGrid has, since its start in 2006, established the infrastructure for a respective virtual research environment. In continuous exchange with the scientific community, TextGrid has developed a variety of tools and services available for free download in a stable version. Together with the TextGrid Repository, the Virtual Research environment TextGrid offers humanist researcher in the humanities sustainable editing, storing and publishing of their data in a thoroughly tested and safe environment.

Projekt oferuje zestaw serwisów (autoryzacja; tworzenie, przechowywanie, usuwanie obiektów; PID, publikacja; workflow) oraz TextGridLab -klient oparty na Eclipse.

UCL - University College London ????


The grand vision for DARIAH is to facilitate long-term access to, and use of, all European Arts and Humanities (A+H) digital research data. The DARIAH infrastructure will be a connected network of people, information, tools, and methodologies for investigating, exploring and supporting work across the broad spectrum of the digital humanities. The core strategy of DARIAH is to bring together national, regional, and local endeavours to form a cooperative infrastructure where complementarities and new challenges are clearly identified and acted upon.

Dariah uses TextGrid.

Fedora Commons

Fedora (Flexible Extensible Digital Object Repository Architecture) was originally developed by researchers at Cornell University as an architecture for storing, managing, and accessing digital content in the form of digital objects inspired by the Kahn and Wilensky Framework. Fedora defines a set of abstractions for expressing digital objects, asserting relationships among digital objects, and linking "behaviors" (i.e., services) to digital objects. The Fedora Commons refers to the community surrounding the Fedora Repository Project. This community joins together with common needs, use cases, and projects. The Fedora Commons community is very active in producing additional tools, applications, and utilities that augment the Fedora repository. Many of these creations are available to the entire community as open source.

Server: Java+wybrana relacyjna BD+Tomcat.

In a Fedora repository, all content is managed as data objects, each of which is composed of components ("datastreams") that contain either the content or metadata about it (np. metadane wg Dublin Core, plik w formacie DOC i w formacie PDF. Each datastream can be either managed directly by the repository or left in an external, web-accessible location to be delivered through the repository as needed. A data object can have any number of data and metadata components, mixing the managed and external datastreams in any pattern desired.

Each object can assert relationships to any number of other objects, providing a way to represent complex information as a web of significant meaningful entities without restricting the parts to a single context.

Each data object is represented by an XML file that is managed in the file system, which contains information about how to find all of the components of the object, as well as important information needed to ensure its long-term durability. The system keeps an audit trail of actions that have affected the object, any formal policies may be asserted about the object and its use, and things like checksums, all within that XML file. As long as both the XML files and the content files that are managed by the repository are backed up properly, the entire running instance of the repository can be reconstructed from the XML files. There is no dependence upon any software to do so, no relational database that cannot be completely reconstructed from the files.


dArceo is composed of several services which provide functionality for long-term preservation, primarily images, text and a/v documents. One of the most important aspects of the dArceo is the migration function, which has been implemented using the transformation approach of the OAIS model. Functions:

  • Data storage and versioning

  • Metadata management.

  • OAI-PMH Repository.

  • Data manipulation (migration, conversion, data delivery)

  • Data monitoring

  • Common space of the data manipulation functions



© 2016
wyślij wiadomość

    Strona główna