Team Alpage

Overall Objectives
Scientific Foundations
Application Domains
New Results
Contracts and Grants with Industry
Other Grants and Activities

Section: New Results

Large scale corpus processing

Participant : Éric Villemonte de La Clergerie.

In the context of the PASSAGE action, we have continued the explore the use of distributed computing for processing of large corpora, largely using GRID 5000 and a local cluster at INRIA Rocquencourt. We use more and more such resources also for the post-parsing phases and the ambition is to use them for machine-learning phases.

GRID5000 and the local cluster were specially useful for the parsing evaluation campaign (October and November 2009), even such real life experiments tend to show that scripts in such complex environments are never robust enough.


Logo Inria