Ontology matching and alignment

We pursue our work on ontology matching and alignment support with contributions to evaluation, data interlinking and multilingual matching.


Participants : Cássia Trojahn dos Santos [Contact] , Jérôme Euzenat, Jérôme David.

Evaluation of ontology matching algorithms requires to confront them with test ontologies and to compare the results. Since 2004, we run the Ontology Alignment Evaluation Initiative (OAEI ) which organises evaluation campaigns for assessing the degree of achievement of actual ontology matching algorithms [4] . This year, the evaluation campaign had 16 different teams entered the evaluation which consisted of 5 different sets of tests. The participating systems and evaluation results were presented in the 6th Ontology Matching workshop, that was held in Bonn, DE [17] [9] .

The main activities carried out in 2011 were related to the automation and execution of the OAEI 2011 campaign, in the framework of the SEALS project (see § 8.2.1 ). This involved the following main tasks:

This work has been used in the OAEI 2011 evaluation campaign. More information on OAEI can be found at .

Ontology matching for linked data

Participants : Zhengjie Fan, Jérôme Euzenat [Contact] , Jérôme David.

The web of data consists of using semantic web technologies to publish data on the web in such a way that they can be interpreted and connected together. It is thus critical to be able to establish links between these data, both for the web of data and for the semantic web that it contributes to feed.

In the context of the Datalift project (see § 8.1.1 ), we are developing a data interlinking module. Based on our analysis of the relationships between ontology matching and data interlinking [13] , our goal is to generate data interlinking scripts on from ontology alignments. For that purpose, we have integrated existing technologies within the Datalift platform: the Alignment API, for taking advantage of the EDOAL language and Silk, developed by Frei Universtität Berlin, for processing linking scripts. So far we have demonstrated the ability to process simple scripts.

This work is part of the PhD of Zhengjie Fan, co-supervised with François Scharffe (LIRMM ), within the Datalift project.

Multilingual ontology matching

Participants : Cássia Trojahn dos Santos [Contact] , Jérôme David, Jérôme Euzenat, Giuseppe Pirrò.

We have participated in the creation of a benchmark for multilingual ontology matching, the MultiFarm dataset. This dataset is composed of a set of ontologies translated in different languages and the corresponding alignments between these ontologies. It is based on the OntoFarm dataset, which has been used successfully for several years in the Ontology Alignment Evaluation Initiative. By translating the ontologies of the OntoFarm set into eight different languages -- Chinese, Czech, Dutch, French, German, Portuguese, Russian, and Spanish -- we created a comprehensive set of realistic test cases. We plan to include this new dataset in the OAEI 2012 campaign.

Finally, in the context of the Cameleon project (see § 8.3.1 ) we have been working on the creation of a multilingual comparable corpora using as seed a set of multilingual aligned ontologies. These resources will be exploited in the process of populating and enriching ontologies as well as in the process of cross-lingual ontology alignment.