Practise of CRM-based Data Integration

Presenters: George Bruseker ICS-FORTH (GR), Achille Felicetti, PIN (IT), Franco Nicollucci PIN (IT), Mark Fichtner GNM (DE)

25 September, 09.30 - 18.00 CIDOC CRM (incl. lunch break - 1 hour) 

This workshop will offer a hands-on introduction to the process of data integration from heterogeneous CH data sources.

Increasingly CH professionals working in the museum sector are faced with the challenge of bringing together a holistic view of the information available from different specialists and institutions with a field from multiple data sources, for the purposes of research or greater institutional knowledge sharing. Whether the goal is data integration for research or for intra- or inter-institutional data integration and harmonization, the skills necessary to plan and execute a data integration project are in demand.

Adopting the CIDOC CRM standard ontology, this workshop will present the overall workflow for a data integration project, highlighting the basic steps and the skills and tools needed to achieve this end. It will provide a high level picture of the data integration process, the CIDOC CRM as standard, how to plan and go about cleaning and mapping data.

For a focus, the workshop will concentrate on archaeological excavation data and how it can be mapped into a common expression using CIDOC CRM and its archaeological extension, CRMarchaeo.

While explaining the general picture of how to go about the data integration process, the workshop will also introduce participants to the basic use of practical software tools for realizing such a project including: the X3ML Toolkit, the WissKi and the ResearchSpace platform.

George Bruseker (Greece)

George Bruseker has a PhD in philosophy from the University of Athens (Greece). He has worked with major cultural heritage institutions in Europe and the Gulf in key information management roles, including the British School at Athens in Greece and Qatar Museums in Doha. He presently works as a researcher at the Foundation for Research and Technology Hellas (FORTH) in the Centre for Cultural Information (CCI) of the Institute of Computer Science. CCI focuses on developing information systems for cultural heritage (CH) institutions. The subject of George’s research is information management, focussing on the development and application of formal ontologies for the generation of semantically encoded datasets. As part of this research, George is a member of the CIDOC Conceptual Reference Model (CRM) Special Interest Group (SIG) which maintains and develops this ISO standard formal ontology for CH data. Currently, he is part of the Parthenos project, a European Union funded research infrastructure (RI) project to build cross-domain support for managing RI related information. A particular focus of his research is to understand and build sustainable systems for supporting the life cycle of information at an intra and inter institutional level, with a special interest in CH and humanities data.

Achille Felicetti (Italy)

Achille Felicetti has a degree in Archaeology and a diploma in computer programming. He coordinated the teams working on the creation of semantic tools, such as the AMA mapping tool and the SAD semantic query system used in EPOCH and COINS projects, as well as the semantic annotation tools of 3D-COFORM. He coordinated the development of the ARIADNE platform and portal for interoperability of archaeological information and the definition of terminological tools and vocabularies for standardization of archaeological information. He is currently coordinator of the team in charge of the definition and application of the CIDOC CRM CRMarchaeo extension for the encoding of excavation information, and of the CRMtex extension for the modelling of inscriptions and ancient texts. He is responsible for the development of annotation and NLP tools for knowledge extraction from textual archaeological documentation within the EOSCpilot initiative. Achille is also system administrator of the VAST-LAB laboratory at PIN and is interested in new technologies research and web applications development. He is the creator of various tool sets for Digital Libraries building and for the management of cultural heritage datasets using semantic technologies. He is the author of many papers dealing with the conceptual modelling and application of semantic technologies to the standardisation, interoperability and management of archaeological documentation.

Franco Niccolucci (Italy)

Director of VAST-LAB at PIN, University of Florence, Italy. A former professor at the same University until 2007, he has then directed the Science and Technology in Archaeology Research Center at the Cyprus Institute, Nicosia, Cyprus until 2013, when he returned to Italy to direct the laboratory at PIN he had created and directed since 2001.
Since 2004, Prof. Niccolucci has participated in and led a number of important European projects on the applications of Information Technology to Cultural Heritage, incl. ARIADNE, a research infrastructure with 28 partners to manage and integrate archaeological documentation from all European countries, which has presently registered about two millions archaeological datasets and made them available on-line; and PARTHENOS, clustering all the most important European actors in the domain of digital humanities and cultural heritage and providing a policy strategic framework and tools for heritage-related digital activities. He also participates in E-RIHS, the European Research Infrastructure on Heritage Science, where he coordinates the development of DIGILAB, a European registry and repository for heritage-related scientific data. Prof Niccolucci has also led a number of projects concerning museum organization and communication, inc. CREATIVE CH, focused on the support to small museums as a factor of economic and social development.
Niccolucci’s current scientific interests focus on the documentation of archaeological research. He has contributed to the creation of two CIDOC CRM extensions, CRMarchaeo for archaeological data and CRMba for architecture and archaeological standing structures.
Niccolucci is the author of about 100 articles on scientific journals, chapter in books and conference papers. They range from analysing the foundations of archaeological documentation and contributing to developing the CRM, to critically evaluating the use and impact of visual technologies in archaeological research. On the latter topic, he has developed, co-chaired and curated the London Charter for the Visualization of Cultural Heritage. His most recent editorial work has concerned the special issue of the International Journal of Digital Libraries (IJDL) titled “Extending, Mapping and Focusing the CIDOC CRM” where he contributed and was the guest editor, and a special issue of the Journal of Computing and Cultural Heritage (JOCCH) on “Digital Infrastructures for Cultural Heritage”, which he co-edited with two colleagues.
He has chaired a number of international conferences on the above subjects starting in 2004 with CAA2004 and continuing with the VAST series he initiated in 2000 and continued to chair until 2012.

Mark Fichtner (Germany)

Mark Fichtner studied computer science at the Friedrich Alexander University Erlangen-Nürnberg with main focus on artificial intelligence. After finishing his diploma in 2008 he was working in the DFG funded research project “Wissenschaftliche KommunikationsInfrastruktur” (Scientific CommunicationInfrastruktur, WissKI, for the Zoologisches Forschungsmuseum Alexander Koenig in Bonn. After the end of the project he switched to the Germanisches Nationalmuseum in 2012, where he made the IT-integration in the research project “The early Dürer”. Since December 2012 he is permanent employee in the IT department of the Germanisches Nationalmuseum and second head of the department. He was coordinator in the second phase of the DFG-funded WissKI project from 2014 to 2016. He specialized in semantic web technology and practical deployment of the CIDOC CRM in the domain of cultural heritage. Since 2008 he constantly contributes to the implementation of the CIDOC CRM in OWL (Erlangen CRM, at the University and thereafter.