Team classic

Overall Objectives
Scientific Foundations
Application Domains
New Results
Contracts and Grants with Industry
Other Grants and Activities

Section: Application Domains

Computational linguistics

The aim is to propose and study new language models which could hopefully bridge the gap between models oriented towards statistical analysis of large corpora and grammars oriented towards the description of syntactic features as understood by academic experts. Combining ideas from variable-order Markov chains and lossless compression schemes of the Lempel-Ziv family, a new model is presently under construction, which should derive syntactic patterns using as few observations as possible. (Note: this application was not present in the project we submitted to create the team; it is dealt with by Thomas Mainguy, who started on September 2010 a thesis about corpus linguistics, supervised by Olivier Catoni.)


