Reccomendations for metadata and data formats for
online availability and long-term preservation, version
Succeed is supported by the European Union under FP7-ICT and coordinated by Universidad de Alicante.
3. RELATED WORK – ONGOING AND EMERGING ACTIVITIES
This chapter presents ongoing and emerging technologies and formats that are used by
various consortia, projects and initiatives in order to enhance digitization-related
activities. The chapter covers semantic technologies, OCR and linguistic resources,
as well as tools packaging issues.
3.1 Semantic technologies
It is already common understanding that semantic technologies play an important role
in digitization-related activities. Various projects and initiatives (re)designs their data,
so that it is possible to expose them with semantic technologies. The idea of Linked Open
Data (LOD) has been especially investigated in this area. The following subsections
provide an overview of the applications and usages of LOD in the cultural heritage
context and digitization.
Digitised Manuscripts To Europeana
Digitized Manuscripts To Europeana (DM2E) is an EU-funded Europeana satellite
project. Its primary aims are to enable as many content providers as possible to get their
data into Europeana and to stimulate the creation of new tools and services for reuse of
Europeana Data in the Digital Humanities. Being coordinated by Humboldt-Universität
zu Berlin, the duration of the project is three years from 2012 to 2015.
The DM2E model is a specialization of the Europeana Data Model
(EDM) for the
domain of handwritten manuscripts. The EDM has been developed within the
Europeana v1.0 project as an RDF-based data model for describing rich metadata
records for Europeana, the European digital library. It can handle huge metadata record
collections represented by heterogeneous metadata standards that must be accessible via
the same platform. The EDM covers Cultural Heritage Objects (CHOs) that are collected
and delivered to Europeana by diverse cultural heritage institutions. The model is as
generic as possible and can be specialized for domain-specific descriptions like it is the
case in DM2E.
In May the project finished the first operational version of its DM2E model (v1.0), a
specialization of the EDM for handwritten manuscripts. The ontology has been
developed within work package 2 with a lot of input from others in the project.
Especially results of extensive mapping workshops with the data providers of DM2E
were integrated into the model. Metadata of diverse formats like TEI, EAD and MARC21
was analyzed and used to create new classes and properties that specialize the current
ERIC stands for European Resarch Infrastructure Consortium