Section: Other Grants and Activities
ISO TC37 SC4
Theme: Standardisation of language resources
Description: The standardisation of language resources is an essential aspect of natural language processing since it allows one both to reuse linguistic data such as lexica or grammars from one application to another and to deploy interoperable linguistic components in complex processing chains (e.g., a man-machine dialogue system).
ISO committee TC 37/SC 4 (Language Resource Management) has been launched in 2002 to cover all standardisation needs in the domain of language resources. Under the responsibility of Laurent Romary (chair) and Prof. Key-Sun Choi (Secretary), the committee aims at providing ways to ensure a high level of interoperability in applications related to human language technology. Beyond the administrative responsibility of the group, the team is more specifically involved in, and has contributed to the following work items:
lexical data representation;
feature structure representation;
representation of data categories for language resources.
Administrative context: ISO (International Organisation for Standardisation)
Remarks: Our participation to ISO/TC 37/SC 4 occurs in the context of the INRIA corporate action ``Syntax'', and more particularly, the Technolangue/RNIL, RNTL/Outilex, and (since August 2006) the MLIF projects.
Theme: Lexical Semantics (Frame Semantics, Cognitive Linguistics, Lexicography)
Description: The FR.FrameNet project aimed at exploring methodologies, resources and techniques for the construction of a French semantic lexicon in the FrameNet family. The final purpose was to establish conditions under which a real-scale project of lexicon construction could be carried on, depending on available funding and partnerships, with an optimal cost/coverage ratio. More detail on this project (which finished at the end of 2006) can be found elsewhere in this document.
Administrative context: France-Berkeley Fund
Person(s) in charge: Laurent Romary (Nancy), Charles Fillmore (Berkeley)
Period: start 2005-07-01 / end 2006-06-30
Contact: Guillaume Pitel
Partner(s): ICSI, ATILF, Saarbrücken University, NLT team (ICSI), Strasbourg University
Delph-In: Deep Linguistic Processing with HPSG
Theme: multilingual grammar development
Description: Delph-In is a consortium of researchers committed to open-source development of grammars, parsers, and other linguistic resources in the framework of Head-Driven Phrase Structure Grammar (HPSG). The members of the project use the Linguistic Knowledge Builder (LKB) platform and the grammars are developed in accordance with the principles of the Grammar Matrix, in particular with respect to semantic representation using Minimal Recursion Semantics (MRS).
At present, grammar development projects are underway for the following languages: English, Catalan, Danish, French, German, Modern Greek, Japanese, Korean, Norwegian, Portuguese, Spanish, Swedish. LED is responsible for the development of the French grammar.
Administrative context: Open international collaboration network
Web site: http://www.delph-in.net
Person(s) in charge: Hans Uszkoreit (DFKI) and Dan Flickinger (Stanford)
Period: start 2005-08-01
Contact: Jesse Tseng
Partner(s): DFKI, Stanford University, Saarland University, University of Washington, University of Oslo, NTNU (Trondheim, Norway), Cambridge University, NTT (Japan)
Description: On January 1st, 2005, Nancy became the fourth host of the TEI consortium (Text Encoding Initiative - http://www.tei-c.org ). This resulted from the wish of ATILF, Loria (LED) and INIST to act together in their contribution to standardisation activities for the encoding of textual information. In this context, LED is more particularly active in the domains of spoken corpora, terminology and dictionary encoding techniques.
Administrative context: Text Encoding Initiative
Web site: http://www.tei-c.org
Period: start 2005-08-01
Contact: Laurent Romary
Partner(s): ATILF, INIST
Passepartout: Interactive Television
Theme: Linguistic and multimedia resources
Description: Digitisation of society is always accelerating. A key factor in this acceleration is now software technology. This project focuses on the convergence of digital systems and applications in home media-centres in compliance with the ITEA roadmap ``The Road towards Convergence'' thus matching the vision of industries, institutions, SME and government partners. New technologies are expected to emerge from this project, that propel the European software industries on to convergence, over terminals and network towards the final goal of ambient intelligence.
The project aims at coupling home media-centres to home networks for rendering scalable content from high definition television (HDTV) to lower definitions in a seamless fashion. Integral to the content will be reactive access and interactivity of high-resolution graphics using ISO and W3C standards for object oriented TV. With the project's goal to make a step towards ambient intelligence through mass personalisation of reactive content (RAMPEG), implementation shall use the most practical elements of MPEG-4 and MPEG-7 with W3C standards such as SMIL, related content synthesis and syndication in XML. Implications will stretch far beyond infrastructure and basic services but will also affect content, human system interaction and engineering.
Implementation will be based on content access using a PVR media-centre as server to new generations of access networks, including Blu-ray optical storage and WIMAX wireless technology. These networks will support the creation of home media-centres that move beyond current STB and PVR-DVD players using MPEG-2 technology, to create true mass-customisation device for family entertainment, with the goals of content packaging and personalisation to match the cultural and linguistic needs of the states of the EU and their economies.
Administrative context: ITEA
Web site: http://www.citi.tudor.lu/passepartout
Period: start 2005-01-01 / end 2007-03-31
Contact: Samuel Cruz-Lara
Partner(s): Cybercultus, Centre de Recherche Publique Henry Tudor, INT, Thomson, RTL, Philips, Telvent, Universidad Politécnica de Madrid, Universidad de Vigo, CharToon, Stoneroos, ETRI, VTT Electronics, V2, CWI, Technische Universiteit Eindhoven, Gradient