Team, Visitors, External Collaborators
Overall Objectives
Research Program
Application Domains
Highlights of the Year
New Software and Platforms
New Results
Bilateral Contracts and Grants with Industry
Partnerships and Cooperations
Dissemination
Bibliography
XML PDF e-pub
PDF e-Pub


Bibliography

Major publications by the team in recent years
[1]
D. Fišer, B. Sagot.
Constructing a poor man's wordnet in a resource-rich world, in: Language Resources and Evaluation, 2015, vol. 49, no 3, pp. 601-635. [ DOI : 10.1007/s10579-015-9295-6 ]
https://hal.inria.fr/hal-01174492
[2]
P. Lopez, L. Romary.
HUMB: Automatic Key Term Extraction from Scientific Articles in GROBID, in: SemEval 2010 Workshop, Uppsala, Sweden, ACL SigLex event, July 2010, pp. 248-251.
https://hal.inria.fr/inria-00493437
[3]
C. Ribeyre, É. Villemonte de La Clergerie, D. Seddah.
Because Syntax does Matter: Improving Predicate-Argument Structures Parsing Using Syntactic Features, in: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, USA, United States, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, June 2015.
https://hal.archives-ouvertes.fr/hal-01174533
[4]
L. Romary.
TEI and LMF crosswalks, in: JLCL - Journal for Language Technology and Computational Linguistics, 2015, vol. 30, no 1.
https://hal.inria.fr/hal-00762664
[5]
B. Sagot.
The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French, in: 7th international conference on Language Resources and Evaluation (LREC 2010), Valletta, Malta, May 2010.
https://hal.inria.fr/inria-00521242
[6]
B. Sagot, É. Villemonte de La Clergerie.
Error mining in parsing results, in: The 21st International Conference of the Association for Computational Linguistics (ACL 2006), Sydney, Australia, July 2006, pp. 329-336.
https://hal.inria.fr/hal-02270412
[7]
D. Seddah, B. Sagot, M. Candito, V. Mouilleron, V. Combet.
The French Social Media Bank: a Treebank of Noisy User Generated Content, in: COLING 2012 - 24th International Conference on Computational Linguistics, Mumbai, Inde, Kay, Martin and Boitet, Christian, December 2012.
http://hal.inria.fr/hal-00780895
[8]
R. Tsarfaty, D. Seddah, Y. Goldberg, S. Kübler, Y. Versley, M. Candito, J. Foster, I. Rehbein, L. Tounsi.
Statistical Parsing of Morphologically Rich Languages (SPMRL) What, How and Whither, in: Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, États-Unis Los Angeles, Association for Computational Linguistics, 2010, pp. 1–12.
[9]
R. Tsarfaty, D. Seddah, S. Kübler, J. Nivre.
Parsing Morphologically Rich Languages: Introduction to the Special Issue, in: Computational Linguistics, March 2013, vol. 39, no 1, 8 p. [ DOI : 10.1162/COLI_a_00133 ]
https://hal.inria.fr/hal-00780897
[10]
É. Villemonte de La Clergerie.
Improving a symbolic parser through partially supervised learning, in: The 13th International Conference on Parsing Technologies (IWPT), Naria, Japan, November 2013.
https://hal.inria.fr/hal-00879358
Publications of the year

Articles in International Peer-Reviewed Journals

[11]
D. Reineke, L. Romary.
Bridging the gap between SKOS and TBX, in: edition - Die Fachzeitschrift für Terminologie, November 2019, vol. 19, no 2.
https://hal.inria.fr/hal-02398820
[12]
L. Romary, C. Riondet.
Towards multiscale archival digital data, in: Umanistica digitale, 2019. [ DOI : 10.6092/issn.2532-8816/9045 ]
https://hal.inria.fr/hal-01586389

Invited Conferences

[13]
M. Fabre, Y. Dupont, É. Villemonte de La Clergerie.
Syntactic Parsing versus MWEs: What can fMRI signal tell us, in: PARSEME-FR 2019 consortium meeting, Blois, France, PARSEME-FR 2019, June 2019.
https://hal.inria.fr/hal-02272288
[14]
L. Romary, M. Khemakhem, F. Khan, J. Bowers, N. Calzolari, M. George, M. Pet, P. Bański.
LMF Reloaded, in: AsiaLex 2019: Past, Present and Future, Istanbul, Turkey, June 2019, https://arxiv.org/abs/1906.02136.
https://hal.inria.fr/hal-02118319
[15]
G. Walther, B. Sagot.
Morphological complexities, in: 16th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, Florence, Italy, August 2019.
https://hal.inria.fr/hal-02266999

International Conferences with Proceedings

[16]
F. Alva-Manchego, L. Martin, C. Scarton, L. Specia.
EASSE: Easier Automatic Sentence Simplification Evaluation, in: EMNLP-IJCNLP 2019 - Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (demo session), Hong Kong, China, November 2019, pp. 49-54.
https://hal.inria.fr/hal-02272950
[17]
H. Bohbot, F. Frontini, F. Khan, M. Khemakhem, L. Romary.
Nénufar: Modelling a Diachronic Collection of Dictionary Editions as a Computational Lexical Resource, in: ELEX 2019: smart lexicography, Sintra, Portugal, October 2019.
https://hal.inria.fr/hal-02272978
[18]
J. Bowers, M. Khemakhem, L. Romary.
TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries, in: ELEX 2019: Smart Lexicography, Sintra, Portugal, October 2019.
https://hal.inria.fr/hal-02264033
[19]
J. Bowers, L. Romary.
TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language, in: 6th International Conference on Language Documentation and Conservation (ICLDC), Honolulu, United States, February 2019.
https://hal.inria.fr/hal-02075475
[20]
B. Crabbé, M. Fabre, C. Pallier.
Variable beam search for generative neural parsing and its relevance for the analysis of neuro-imaging signal, in: EMNLP-IJCNLP 2019 - Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Hong-Kong, China, November 2019.
https://hal.inria.fr/hal-02272303
[21]
M. Dinarelli, L. Grobol.
Hybrid Neural Networks for Sequence Modelling : The Best of Three Worlds, in: TALN-RECITAL 2019 - 26ème Conférence sur le Traitement Automatique des Langues Naturelles, Toulouse, France, Conférence sur le Traitement Automatique des Langues Naturelles (TALN-RECITAL), ATALA, July 2019.
https://hal.archives-ouvertes.fr/hal-02157160
[22]
M. Dinarelli, L. Grobol.
Seq2Biseq: Bidirectional Output-wise Recurrent Neural Networks for Sequence Modelling, in: CICLing 2019 - 20th International Conference on Computational Linguistics and Intelligent Text Processing, La Rochelle, France, April 2019.
https://hal.inria.fr/hal-02085093
[23]
L. Foppiano, L. Romary, M. Ishii, M. Tanifuji.
Automatic Identification and Normalisation of Physical Measurements in Scientific Literature, in: DocEng '19 - ACM Symposium on Document Engineering 2019, Berlin, Germany, ACM Press, September 2019, pp. 1-4. [ DOI : 10.1145/3342558.3345411 ]
https://hal.inria.fr/hal-02294424
[24]
S. Gabay, L. Rondeau Du Noyer, M. Khemakhem.
Selling autograph manuscripts in 19th c. Paris: digitising the Revue des Autographes, in: IX Convegno AIUCD, Milan, Italy, AIUCD, January 2020.
https://hal.archives-ouvertes.fr/hal-02388407
[25]
L. Grobol.
Neural Coreference Resolution with Limited Lexical Context and Explicit Mention Detection for Oral French, in: Second Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC19), Minneapolis, United States, June 2019.
https://hal.inria.fr/hal-02151569
[26]
G. Jawahar, B. Sagot, D. Seddah.
What does BERT learn about the structure of language?, in: ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, July 2019.
https://hal.inria.fr/hal-02131630
[27]
A. F. Khan, H. Bohbot, F. Frontini, M. Khemakhem, L. Romary.
Historical Dictionaries as Digital Editions and Connected Graphs: the Example of Le Petit Larousse Illustré, in: Digital Humanities 2019, Utrech, Netherlands, July 2019.
https://hal.inria.fr/hal-02111199
[28]
B. Muller, B. Sagot, D. Seddah.
Enhancing BERT for Lexical Normalization, in: The 5th Workshop on Noisy User-generated Text (W-NUT), Hong Kong, China, November 2019.
https://hal.inria.fr/hal-02294316
[29]
P. J. Ortiz Suárez, B. Sagot, L. Romary.
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures, in: 7th Workshop on the Challenges in the Management of Large Corpora (CMLC-7), Cardiff, United Kingdom, P. Bański, A. Barbaresi, H. Biber, E. Breiteneder, S. Clematide, M. Kupietz, H. Lüngen, C. Iliadi (editors), Leibniz-Institut für Deutsche Sprache, July 2019.
https://hal.inria.fr/hal-02148693
[30]
M. Regnault, S. Prévost, É. Villemonte de La Clergerie.
Challenges of language change and variation: towards an extended treebank of Medieval French, in: TLT 2019 - 18th International Workshop on Treebanks and Linguistic Theories, Paris, France, August 2019.
https://hal.inria.fr/hal-02272560
[31]
M. Regnault.
Adapting a Metagrammar for Contemporary French to Medieval French, in: TALN-RECITAL 2019 - 26e édition de la conférence TALN (Traitement Automatique des Langues Naturelles) et 21e édition de la conférence jeunes chercheur·euse·s RECITAL, Toulouse, France, July 2019.
https://hal.inria.fr/hal-02147686
[32]
L. Romary.
The place of lexicography in (computer) science, in: The Future of Academic Lexicography: Linguistic Knowledge Codification in the Era of Big Data and AI, Leiden, Netherlands, Frieda Steurs and Dirk Geeraerts and Niels Schiller and Marian Klamer and Iztok Kosem, November 2019.
https://hal.inria.fr/hal-02358218
[33]
L. Rondeau Du Noyer, S. Gabay, M. Khemakhem, L. Romary.
Scaling up Automatic Structuring of Manuscript Sales Catalogues, in: TEI 2019: What is text, really? TEI and beyond, Graz, Austria, September 2019.
https://hal.inria.fr/hal-02272962
[34]
B. Sagot.
Development of a morphological and syntactic lexicon of Old French, in: 26ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Toulouse, France, July 2019.
https://hal.inria.fr/hal-02148701

Conferences without Proceedings

[35]
S. Bassett, L. Wessels, S. Krauwer, B. Maegaard, H. Hollander, F. Admiraal, L. Romary, F. Uiterwaal.
Connecting the Humanities through Research Infrastructures, in: 4th Digital Humanities in the Nordic Countries (DHN 2019), Copenhagen, Denmark, March 2019.
https://hal.inria.fr/hal-02047512
[36]
B. Caron, M. Courtin, K. Gerdes, S. Kahane.
A Surface-Syntactic UD Treebank for Naija, in: TLT 2019, Treebanks and Linguistic Theories, Syntaxfest, Paris, France, August 2019.
https://hal.archives-ouvertes.fr/hal-02270530
[37]
A. Chagué, V. Le Fourner, M. Martini, É. Villemonte de La Clergerie.
Deux siècles de sources disparates sur l'industrie textile en France : comment automatiser les traitements d'un corpus non-uniforme ?, in: Colloque DHNord 2019 "Corpus et archives numériques", Lille, France, MESHS Lille Nord de France, October 2019.
https://hal.inria.fr/hal-02448921
[38]
X. Chen, K. Gerdes.
The relation between dependency distance and frequency, in: Quasy 2019, Quantitative Syntax 2019, Syntaxfest, Paris, France, August 2019.
https://hal.archives-ouvertes.fr/hal-02270528
[39]
C. Dong, Y. Li, K. Gerdes.
Character-level Annotation for Chinese Surface-Syntactic Universal Dependencies, in: Depling 2019 - International Conference on Dependency Linguistics, Paris, France, August 2019.
https://hal.archives-ouvertes.fr/hal-02270535
[40]
K. Gerdes, S. Kahane, X. Chen.
Rediscovering Greenberg's Word Order Universals in UD, in: UDW, Universal Dependencies Workshop 2019, Syntaxfest, Paris, France, August 2019.
https://hal.archives-ouvertes.fr/hal-02270531
[41]
G. Jawahar, D. Seddah.
Contextualized Diachronic Word Representations, in: 1st International Workshop on Computational Approaches to Historical Language Change 2019 (colocated with ACL 2019), Florence, Italy, August 2019.
https://hal.archives-ouvertes.fr/hal-02194763
[42]
M. Khemakhem, I. Galleron, G. Williams, L. Romary, P. J. Ortiz Suárez.
How OCR Performance can Impact on the Automatic Extraction of Dictionary Content Structures, in: 19th annual Conference and Members' Meeting of the Text Encoding Initiative Consortium (TEI) - What is text, really? TEI and beyond, Graz, Austria, September 2019.
https://hal.archives-ouvertes.fr/hal-02263276
[43]
P. J. Ortiz Suárez, L. Romary, B. Sagot.
Preparing the Dictionnaire Universel for Automatic Enrichment, in: 10th International Conference on Historical Lexicography and Lexicology (ICHLL), Leeuwarden, Netherlands, June 2019.
https://hal.inria.fr/hal-02131598
[44]
J. C. Rosales Nunez, D. Seddah, G. Wisniewski.
A Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content, in: The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa'19), Turku, Finland, September 2019.
https://hal.archives-ouvertes.fr/hal-02270524

Scientific Books (or Scientific Book chapters)

[45]
Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, SyntaxFest 2019), August 2019.
https://hal.inria.fr/hal-02450315
[46]
PARTHENOS (editor)
Share - Publish - Store - Preserve. Methodologies, Tools and Challenges for 3D Use in Social Sciences and Humanities, PARTHENOS and consortium 3D-SHS and LIA MAP-ISTI, Marseille, France, May 2019.
https://hal.archives-ouvertes.fr/hal-02155055
[47]
J. Edmond, F. Fischer, L. Romary, T. Tasovac.
9. Springing the Floor for a Different Kind of Dance : Building DARIAH as a Twenty-First-Century Research Infrastructure for the Arts and Humanities, in: Digital Technology and the Practices of Humanities Research, Open Book Publishers, February 2020, pp. 207-234. [ DOI : 10.11647/OBP.0192.09 ]
https://hal.inria.fr/hal-02464622
[48]
J. Edmond, L. Romary.
3. Academic Publishing, in: Digital Technology and the Practices of Humanities Research, Open Book Publishers, February 2020, pp. 49-80. [ DOI : 10.11647/OBP.0192.03 ]
https://hal.inria.fr/hal-02464616
[49]
K. Gerdes, S. Kahane, R. Bawden, J. Beliao, É. Villemonte de La Clergerie, I. Wang.
Annotation tools for syntax, in: Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, June 2019.
https://hal.inria.fr/hal-02450311
[50]
S. Kahane, K. Gerdes, R. Bawden.
The microsyntactic annotation, in: Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, June 2019.
https://hal.inria.fr/hal-02450018
[51]
S. Kahane, P. Pietrandrea, K. Gerdes.
The annotation of list structures, in: Rhapsodie: A Prosodic and Syntactic Treebank for Spoken French, June 2019.
https://hal.inria.fr/hal-02450034
[52]
L. Romary, J. Edmond.
A Tangential View on Impact for the Arts and Humanities through the Lens of the DARIAH-ERIC, in: Stay Tuned To The Future - Impact of the Research Infrastructures for Social Sciences and Humanities, B. Maegaard, R. Pozzo (editors), Leo S. Olschki Editore, 2019.
https://hal.inria.fr/hal-02094713

Other Publications

[53]
A. Bertino, L. Foppiano, L. Romary, P. Mounier.
Leveraging Concepts in Open Access Publications, March 2019, working paper or preprint.
https://hal.inria.fr/hal-01981922
[54]
J. Bowers.
Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec, February 2019, working paper or preprint.
https://hal.inria.fr/hal-02004005
[55]
J. Bowers.
Pathways and patterns of metaphor & metonymy in Mixtepec-Mixtec body-part terms, February 2020, working paper or preprint.
https://hal.inria.fr/hal-02075731
[56]
Y. Dupont.
Un corpus libre, évolutif et versionné en entités nommées du français, July 2019, TALN 2019 - Traitement Automatique des Langues Naturelles, Poster.
https://hal.archives-ouvertes.fr/hal-02448590
[57]
M. Fabre, S. Bhattasali, C. Pallier, J. Hale.
Modeling Conventionalization and Predictability in Multi-Word Expressions at Brain-level, September 2019, CRCNS 2019, Poster.
https://hal.inria.fr/hal-02272435
[58]
M. Fabre, B. Crabbe, C. Pallier.
Variable beam search for generative neural parsing and its fit with neuro-imaging signal, September 2019, CRCNS 2019, Poster.
https://hal.inria.fr/hal-02272475
[59]
K. Gerdes, B. Guillaume, S. Kahane, G. Perrier.
Pourquoi se tourner vers le SUD : L'importance de choisir un schéma d'annotation en dépendance surface-syntaxique, November 2019, LIFT 2019 - Journées scientifiques "Linguistique informatique, formelle & de terrain".
https://hal.inria.fr/hal-02449922
[60]
L. Martin, B. Muller, P. J. Ortiz Suárez, Y. Dupont, L. Romary, É. Villemonte de La Clergerie, D. Seddah, B. Sagot.
CamemBERT: a Tasty French Language Model, October 2019, https://arxiv.org/abs/1911.03894 - Web site: https://camembert-model.fr.
https://hal.inria.fr/hal-02445946
[61]
L. Martin, B. Sagot, É. Villemonte de La Clergerie, A. Bordes.
Controllable Sentence Simplification, October 2019, https://arxiv.org/abs/1910.02677 - Code and models: https://github.com/facebookresearch/access. [ DOI : 10.02677 ]
https://hal.inria.fr/hal-02445874
[62]
A. Málaga Sabogal, S. Troubetzkoy.
Unique ergodicity for infinite area Translation Surfaces, August 2019, https://arxiv.org/abs/1908.04019 - working paper or preprint.
https://hal.archives-ouvertes.fr/hal-02265283
[63]
C. Rochereau, B. Sagot, E. Dupoux.
Modeling German Verb Argument Structures: LSTMs vs. Humans, December 2019, https://arxiv.org/abs/1912.00239 - working paper or preprint.
https://hal.archives-ouvertes.fr/hal-02417640
[64]
L. Romary, D. Biabiany, K. Illmayer, M. Puren, C. Riondet, D. Seillier, L. Tadjou.
SSK by example - Make your Arts and Humanities research go standard, May 2019, DARIAH Annual Event, Poster.
https://hal.inria.fr/hal-02151788
[65]
L. Romary.
The TEI as a modeling infrastructure: TEI beyond the TEI realms, July 2019, Ringvorlesung Digital Humanities.
https://hal.inria.fr/hal-02265036
[66]
A. Srivastava, B. Muller, D. Seddah.
Unsupervised Learning for Handling Code-Mixed Data: A Case Study on POS Tagging of North-African Arabizi Dialect, October 2019, EurNLP - First annual EurNLP, Poster.
https://hal.archives-ouvertes.fr/hal-02270527
References in notes
[67]
A. Abeillé, L. Clément, F. Toussenel.
10, in: Building a Treebank for French, Kluwer, Dordrecht, 2003, pp. 165-187.
[68]
M. J. Aranzabe, A. D. De Ilarraza, I. Gonzalez-Dios.
Transforming complex sentences using dependency trees for automatic text simplification in Basque, in: Procesamiento del lenguaje natural, 2013, vol. 50, pp. 61–68.
[69]
S. Bhattasali, M. Fabre, W.-M. Luh, H. Al Saied, M. Constant, C. Pallier, J. R. Brennan, R. N. Spreng, J. Hale.
Localising Memory Retrieval and Syntactic Composition: An fMRI Study of Naturalistic Language Comprehension, in: Language, Cognition and Neuroscience, 2018, vol. 34, no 4, pp. 1-20. [ DOI : 10.1080/23273798.2018.1518533 ]
https://hal.archives-ouvertes.fr/hal-01930201
[70]
O. Bonami, B. Sagot.
Computational methods for descriptive and theoretical morphology: a brief introduction, in: Morphology, 2017, vol. 27, no 4, pp. 1-7. [ DOI : 10.1017/CBO9781139248860 ]
https://hal.inria.fr/hal-01628253
[71]
A. Bouchard-Côté, D. Hall, T. Griffiths, D. Klein.
Automated Reconstruction of Ancient Languages using Probabilistic Models of Sound Change, in: Proceedings of the National Academy of Sciences, 2013, no 110, pp. 4224–4229.
[72]
J. Bowers, L. Romary.
Bridging the Gaps between Digital Humanities, Lexicography, and Linguistics: A TEI Dictionary for the Documentation of Mixtepec-Mixtec, in: Dictionaries: Journal of the Dictionary Society of North America, 2018, vol. 39, no 2, pp. 79-106.
https://hal.inria.fr/hal-01968871
[73]
J. C. K. Cheung, G. Penn.
Utilizing Extra-sentential Context for Parsing, in: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Cambridge, Massachusetts, EMNLP '10, 2010, pp. 23–33.
[74]
M. Constant, M. Candito, D. Seddah.
The LIGM-Alpage Architecture for the SPMRL 2013 Shared Task: Multiword Expression Analysis and Dependency Parsing, in: Fourth Workshop on Statistical Parsing of Morphologically Rich Languages, Seattle, United States, October 2013, pp. 46-52.
https://hal.archives-ouvertes.fr/hal-00932372
[75]
J. Devlin, M. Chang, K. Lee, K. Toutanova.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2-7, 2019, Volume 1 (Long and Short Papers), 2019, pp. 4171–4186.
https://www.aclweb.org/anthology/N19-1423/
[76]
Y. Fang, M. Chang.
Entity Linking on Microblogs with Spatial and Temporal Signals, in: TACL, 2014, vol. 2, pp. 259–272.
https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/323
[77]
K. Gerdes, B. Guillaume, S. Kahane, G. Perrier.
SUD or Surface-Syntactic Universal Dependencies: An annotation scheme near-isomorphic to UD, in: Universal Dependencies Workshop 2018, Brussels, Belgium, November 2018.
https://hal.inria.fr/hal-01930614
[78]
J. Hewitt, C. D. Manning.
A Structural Probe for Finding Syntax in Word Representations, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, 2019.
https://nlp.stanford.edu/pubs/hewitt2019structural.pdf
[79]
J. E. Hoard, R. Wojcik, K. Holzhauser.
An automated grammar and style checker for writers of Simplified English, in: Computers and Writing: State of the Art, 1992, pp. 278–296.
[80]
D. Hovy, T. Fornaciari.
Increasing In-Class Similarity by Retrofitting Embeddings with Demographic Information, in: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, 2018, pp. 671–677.
http://aclweb.org/anthology/D18-1070
[81]
D. Hruschka, S. Branford, E. Smith, J. Wilkins, A. Meade, M. Pagel, T. Bhattacharya.
Detecting Regular Sound Changes in Linguistics as Events of Concerted Evolution, in: Current Biology, 2015, vol. 1, no 25, pp. 1–9.
[82]
G. Jawahar, B. Muller, A. Fethi, L. Martin, É. Villemonte de La Clergerie, B. Sagot, D. Seddah.
ELMoLex: Connecting ELMo and Lexicon features for Dependency Parsing, in: CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 2018. [ DOI : 10.18653/v1/K18-2023 ]
https://hal.inria.fr/hal-01959045
[83]
M. Khemakhem, L. Foppiano, L. Romary.
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields, in: electronic lexicography, eLex 2017, Leiden, Netherlands, September 2017.
https://hal.archives-ouvertes.fr/hal-01508868
[84]
M. Khemakhem, L. Foppiano, L. Romary.
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields, in: electronic lexicography, eLex 2017, Leiden, Netherlands, September 2017.
https://hal.archives-ouvertes.fr/hal-01508868
[85]
S. Kübler, M. Scheutz, E. Baucom, R. Israel.
Adding Context Information to Part Of Speech Tagging for Dialogues, in: NEALT Proceedings Series, M. Dickinson, K. Muurisep, M. Passarotti (editors), 2010, vol. 9, pp. 115-126.
[86]
A.-L. Ligozat, C. Grouin, A. Garcia-Fernandez, D. Bernhard.
Approches à base de fréquences pour la simplification lexicale, in: TALN-RÉCITAL 2013, 2013, 493 p.
[87]
L. Martin, S. Humeau, P.-E. Mazaré, A. Bordes, É. Villemonte de La Clergerie, B. Sagot.
Reference-less Quality Estimation of Text Simplification Systems, in: 1st Workshop on Automatic Text Adaptation (ATA), Tilburg, Netherlands, November 2018.
https://hal.inria.fr/hal-01959054
[88]
H. Martínez Alonso, D. Seddah, B. Sagot.
From Noisy Questions to Minecraft Texts: Annotation Challenges in Extreme Syntax Scenarios, in: 2nd Workshop on Noisy User-generated Text (W-NUT) at CoLing 2016, Osaka, Japan, December 2016.
https://hal.inria.fr/hal-01584054
[89]
M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee, L. Zettlemoyer.
Deep Contextualized Word Representations, in: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2018, New Orleans, Louisiana, USA, June 1-6, 2018, Volume 1 (Long Papers), 2018, pp. 2227–2237.
https://www.aclweb.org/anthology/N18-1202/
[90]
J. Pyssalo.
System PIE: the Primary Phoneme Inventory and Sound Law System for Proto-Indo-European, University of Helsinki, 2013.
[91]
L. Rello, R. Baeza-Yates, S. Bott, H. Saggion.
Simplify or help?: text simplification strategies for people with dyslexia, in: Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility, ACM, 2013, 15 p.
[92]
L. Rello, R. Baeza-Yates, L. Dempere-Marco, H. Saggion.
Frequent words improve readability and short words improve understandability for people with dyslexia, in: IFIP Conference on Human-Computer Interaction, Springer, 2013, pp. 203–219.
[93]
C. Ribeyre, M. Candito, D. Seddah.
Semi-Automatic Deep Syntactic Annotations of the French Treebank, in: The 13th International Workshop on Treebanks and Linguistic Theories (TLT13), Tübingen, Germany, Proceedings of TLT 13, Tübingen Universität, December 2014.
https://hal.inria.fr/hal-01089198
[94]
L. Romary, M. Khemakhem, F. Khan, J. Bowers, N. Calzolari, M. George, M. Pet, P. Bański.
LMF Reloaded, in: AsiaLex 2019: Past, Present and Future, Istanbul, Turkey, June 2019.
https://hal.inria.fr/hal-02118319
[95]
L. Romary, P. Lopez.
GROBID - Information Extraction from Scientific Publications, in: ERCIM News, January 2015, vol. 100.
https://hal.inria.fr/hal-01673305
[96]
A. M. Rush, R. Reichart, M. Collins, A. Globerson.
Improved Parsing and POS Tagging Using Inter-sentence Consistency Constraints, in: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea, EMNLP-CoNLL '12, 2012, pp. 1434–1444.
[97]
B. Sagot, H. Martínez Alonso.
Improving neural tagging with lexical information, in: 15th International Conference on Parsing Technologies, Pisa, Italy, September 2017, pp. 25-31.
https://hal.inria.fr/hal-01592055
[98]
B. Sagot, D. Nouvel, V. Mouilleron, M. Baranes.
Extension dynamique de lexiques morphologiques pour le français à partir d'un flux textuel, in: TALN - Traitement Automatique du Langage Naturel, Les sables d'Olonne, France, June 2013, pp. 407-420.
https://hal.inria.fr/hal-00832078
[99]
B. Sagot, M. Richard, R. Stern.
Annotation référentielle du Corpus Arboré de Paris 7 en entités nommées, in: Traitement Automatique des Langues Naturelles (TALN), Grenoble, France, G. Antoniadis, H. Blanchon, G. Sérasset (editors), Actes de la conférence conjointe JEP-TALN-RECITAL 2012, June 2012, vol. 2 - TALN.
https://hal.inria.fr/hal-00703108
[100]
B. Sagot.
DeLex, a freely-avaible, large-scale and linguistically grounded morphological lexicon for German, in: Language Resources and Evaluation Conference, Reykjavik, Iceland, European Language Resources Association, May 2014.
https://hal.inria.fr/hal-01022288
[101]
B. Sagot.
External Lexical Information for Multilingual Part-of-Speech Tagging, Inria Paris, June 2016, no RR-8924.
https://hal.inria.fr/hal-01330301
[102]
B. Sagot.
Extracting an Etymological Database from Wiktionary, in: Electronic Lexicography in the 21st century (eLex 2017), Leiden, Netherlands, September 2017, pp. 716-728.
https://hal.inria.fr/hal-01592061
[103]
C. Scarton, M. De Oliveira, A. Candido Jr, C. Gasperin, S. M. Aluísio.
SIMPLIFICA: a tool for authoring simplified texts in Brazilian Portuguese guided by readability assessments, in: Proceedings of the NAACL HLT 2010 Demonstration Session, Association for Computational Linguistics, 2010, pp. 41–44.
[104]
Y. Scherrer, B. Sagot.
A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages, in: Language Resources and Evaluation Conference, Reykjavik, Iceland, European Language Resources Association, May 2014.
https://hal.inria.fr/hal-01022298
[105]
S. Schuster, É. Villemonte de La Clergerie, M. Candito, B. Sagot, C. D. Manning, D. Seddah.
Paris and Stanford at EPE 2017: Downstream Evaluation of Graph-based Dependency Representations, in: EPE 2017 - The First Shared Task on Extrinsic Parser Evaluation, Pisa, Italy, Proceedings of the 2017 Shared Task on Extrinsic Parser Evaluation, September 2017, pp. 47-59.
https://hal.inria.fr/hal-01592051
[106]
D. Seddah, M. Candito.
Hard Time Parsing Questions: Building a QuestionBank for French, in: Tenth International Conference on Language Resources and Evaluation (LREC 2016), Portorož, Slovenia, Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC 2016), May 2016.
https://hal.archives-ouvertes.fr/hal-01457184
[107]
D. Seddah, B. Sagot, M. Candito, V. Mouilleron, V. Combet.
The French Social Media Bank: a Treebank of Noisy User Generated Content, in: COLING 2012 - 24th International Conference on Computational Linguistics, Mumbai, India, Kay, Martin and Boitet, Christian, December 2012.
https://hal.inria.fr/hal-00780895
[108]
D. Seddah, B. Sagot, M. Candito.
The Alpage Architecture at the SANCL 2012 Shared Task: Robust Pre-Processing and Lexical Bridging for User-Generated Content Parsing, in: SANCL 2012 - First Workshop on Syntactic Analysis of Non-Canonical Language , an NAACL-HLT'12 workshop, Montréal, Canada, June 2012.
https://hal.inria.fr/hal-00703124
[109]
M. Shardlow.
A survey of automated text simplification, in: International Journal of Advanced Computer Science and Applications, 2014, vol. 4, no 1, pp. 58–70.
[110]
A. Søgaard, Y. Goldberg.
Deep multi-task learning with low level tasks supervised at lower layers, in: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Berlin, Germany, 2016, pp. 231–235.
[111]
É. Villemonte de La Clergerie, B. Sagot, D. Seddah.
The ParisNLP entry at the ConLL UD Shared Task 2017: A Tale of a #ParsingTragedy, in: Conference on Computational Natural Language Learning, Vancouver, Canada, Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, August 2017, pp. 243-252. [ DOI : 10.18653/v1/K17-3026 ]
https://hal.inria.fr/hal-01584168
[112]
É. Villemonte de La Clergerie.
Jouer avec des analyseurs syntaxiques, in: TALN 2014, Marseilles, France, ATALA, July 2014.
https://hal.inria.fr/hal-01005477
[113]
G. Walther, B. Sagot.
Speeding up corpus development for linguistic research: language documentation and acquisition in Romansh Tuatschin, in: Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Vancouver, Canada, Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, August 2017, pp. 89-94. [ DOI : 10.18653/v1/W17-2212 ]
https://hal.inria.fr/hal-01570614