Team Parole

Members
Overall Objectives
Scientific Foundations
Application Domains
Software
New Results
Contracts and Grants with Industry
Dissemination
Bibliography

Bibliography

Major publications by the team in recent years

[1]
M. Abbas, K. Smaïli, D. Berkani.
Multi-category support vector machines for identifying Arabic topics, in: Journal of Research in Computing Science, 2009, vol. 41.
[2]
A. Bonneau, Y. Laprie.
Selective acoustic cues for French voiceless stop consonants, in: The Journal of the Acoustical Society of America, 2008, vol. 123, p. 4482-4497
http://hal.inria.fr/inria-00336049/en/.
[3]
C. Cerisara, S. Demange, J.-P. Haton.
On noise masking for automatic missing data speech recognition: a survey and discussion, in: Computer Speech and Language, 2007, vol. 21, no 3, p. 443-457.
[4]
C. Cerisara, D. Fohr.
Multi-band automatic speech recognition, in: Computer Speech and Language, April 2001, vol. 15, no 2, p. 151-174.
[5]
C. Cerisara, L. Rigazio, J.-C. Junqua.
$ \alpha$ -Jacobian environmental adaptation, in: Speech Communication, January 2004, vol. 42, no 1, p. 25–41, Special Issue on Adaptation Methods for Automatic Speech Recognition.
[6]
K. Daoudi, D. Fohr, C. Antoine.
Dynamic Bayesian Networks for Multi-Band Automatic Speech Recognition, in: Computer Speech and Language, 2003, vol. 17, p. 263-285.
[7]
J.-P. Haton, C. Cerisara, D. Fohr, Y. Laprie, K. Smaïli.
Reconnaissance Automatique de la Parole. Du signal à son interprétation, Dunod, 2006
http://hal.inria.fr/inria-00105908/en/.
[8]
D. Langlois, A. Brun, K. Smaïli, J.-P. Haton.
Événements impossibles en modélisation stochastique du langage, in: Traitement Automatique des Langues, Jul 2003, vol. 44, no 1, p. 33-61.
[9]
C. Lavecchia, K. Smaïli, D. Langlois, J.-P. Haton.
Using inter-lingual triggers for Machine translation, in: Eighth conference INTERSPEECH 2007, Antwerp/Belgium, 08 2007
http://hal.inria.fr/inria-00155791/en/.
[10]
S. Ouni, Y. Laprie.
Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion, in: Journal of the Acoustical Society of America (JASA), 2005, vol. 118 (1), p. 444–460
http://hal.archives-ouvertes.fr/hal-00008682/en/, PACS numbers: 43.70.h, 43.70.Bk, 43.70.Aj [DOS].
[11]
I. Zitouni, K. Smaïli, J.-P. Haton.
Statistical Language Modeling Based on Variable-Length Sequences, in: Computer Speech and Language, Jan 2003, vol. 17, no 1, p. 27-41.

Publications of the year

Articles in International Peer-Reviewed Journal

[12]
J. Cai, G. Bouselmi, Y. Laprie, J.-P. Haton.
Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition, in: Computer Speech & Language / Computer Speech and Language, 2009, vol. 23, no 2, p. 147-256
http://hal.inria.fr/inria-00432533/en/.
[13]
C. Cerisara.
Automatic discovery of topics and acoustic morphemes from speech, in: Computer Speech & Language / Computer Speech and Language, 2009, vol. 23, no 2, p. 220-239
http://hal.inria.fr/inria-00330698/en/.
[14]
S. Demange, C. Cerisara, J.-P. Haton.
Missing data mask estimation with frequency and temporal dependencies, in: Computer Speech & Language / Computer Speech and Language, 2009, vol. 23, no 1, p. 25-41
http://hal.inria.fr/inria-00338397/en/.
[15]
E. Didiot, I. Illina, D. Fohr, O. Mella.
A wavelet-based parameterization for speech/music discrimination, in: Computer Speech & Language / Computer Speech and Language, 2010, vol. 24, no 2, p. 341-357
http://hal.archives-ouvertes.fr/hal-00435076/en/.
[16]
P. Kral, C. Cerisara.
Dialogue act recognition approaches, in: Computing And Informatics, 2009
http://hal.inria.fr/inria-00431396/en/.
[17]
L. Sprenger-Charolles, P. Colé, A. Kipffer-Piquard, F. Pinton, C. Billard.
Reliability and prevalence of an atypical development of phonological skills in French-speaking dyslexics, in: Reading and Writing, 2009, vol. 22, no 7, p. 811-842
http://hal.archives-ouvertes.fr/hal-00414110/en/.

Invited Conferences

[18]
S. Ouni, Y. Laprie.
Studying pharyngealisation using an articulograph, in: International Workshop on Pharyngeals and Pharyngealisation, Royaume-Uni Newcastle, 2009
http://hal.archives-ouvertes.fr/hal-00431829/en/.

International Peer-Reviewed Conference/Proceedings

[19]
M. Abbas, K. Smaïli, D. Berkani.
Multi-category support vector machines for identifying Arabic topics, in: 10th International Conference on Intelligent Text Processing and Computational Linguistics - CICLing 2009, Mexique Mexico, 2009, vol. 41
http://hal.inria.fr/inria-00403102/en/.
[20]
M. Aron, A. Toutios, M.-O. Berger, E. Kerrien, B. Wrobel-Dautcourt, Y. Laprie.
Registration of Multimodal Data for Estimating the Parameters of an Articulatory Model, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taiwan Taipei, 2009
http://hal.inria.fr/inria-00350298/en/.
[21]
A. Bonneau, J. Busset, B. Wrobel-Dautcourt.
Contextual effects on protrusion and lip opening for /i,y/, in: 10th Annual Conference of the International Speech Communication Association - Interspeech 2009, Royaume-Uni Brighton, ISCA, 2009
http://hal.inria.fr/inria-00433381/en/.
[22]
M. Cadot, A. Lelu.
Massive Pruning for Building an Operational Set of Association Rules: Metarules for Eliminating Conflicting and Redundant Rules., in: International Conference on Information, Process, and Knowledge Management - eKnow09, Mexique Cancun, A. Kusiak, Sang-goo. Lee (editors), 2009, p. 90-98
http://hal.inria.fr/inria-00337067/en/.
[23]
J. Cai, Y. Laprie, J. Busset, F. Hirsch.
Articulatory Modeling Based on Semi-polar Coordinates and Guided PCA Technique, in: 10th Annual Conference of the International Speech Communication Association - INTERSPEECH 2009, Royaume-Uni Brighton, 2009
http://hal.inria.fr/inria-00433067/en/.
[24]
C. Cerisara, O. Mella, D. Fohr.
JTrans, an open-source software for semi-automatic text-to-speech alignment, in: Proceedings of the 10th Annual Conference of the International Speech Communication Association - Interspeech 2009, Royaume-Uni Brighton, 2009
http://hal.inria.fr/inria-00431398/en/.
[25]
I. Jemaa, O. Rekhis, K. Ouni, Y. Laprie.
An Evaluation of Formant Tracking methods on an Arabic Database, in: 10th Annual Conference of the International Speech Communication Association - INTERSPEECH 2009, Royaume-Uni Brighton, 2009
http://hal.inria.fr/inria-00433057/en/.
[26]
K. Meftouh, K. Smaïli, M. T. Laskri.
Comparative study of Arabic and french statistical language models, in: International Conference On agents and Artificial Intelligence - ICAART'09, Portugal Porto, INSTICC, 2009
http://hal.inria.fr/inria-00352927/en/.
[27]
B. Potard, Y. Laprie.
A robust variational method for the acoustic-to-articulatory problem, in: 10th Annual Conference of the International Speech Communication Association - INTERSPEECH 2009, Royaume-Uni Brighton, 2009
http://hal.inria.fr/inria-00433053/en/.
[28]
S. Raybaud, D. Langlois, K. Smaïli.
Efficient Combination of Confidence Measures for Machine Translation, in: 10th Annual Conference of the International Speech Communication Association - INTERSPEECH 2009, Royaume-Uni Brighton, 2009
http://hal.inria.fr/inria-00417546/en/.
[29]
S. Raybaud, C. Lavecchia, D. Langlois, K. Smaïli.
New Confidence Measures for Statistical Machine Translation, in: International Conference On Agents and Artificial Intelligence - ICAART 09, Portugal Porto, 2009
http://hal.inria.fr/inria-00333843/en/.
[30]
S. Raybaud, C. Lavecchia, D. Langlois, K. Smaïli.
Word- and sentence-level confidence measures for machine translation, in: 13th Annual Meeting of the European Association for Machine Translation - EAMT 09, Espagne Barcelona, 2009
http://hal.inria.fr/inria-00417541/en/.
[31]
F. Stouten, D. Fohr, I. Illina.
Detection of OOV words by combining acoustic confidence measures with linguistic features, in: The eleventh biannual IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), Italie Merano, 2009, p. 1-4
http://hal.archives-ouvertes.fr/hal-00435087/en/.

National Peer-Reviewed Conference/Proceedings

[32]
J. Busset.
Utilisation d'une grille polaire adaptative pour la construction d'un modèle articulatoire de la langue, in: Rencontres Jeunes Chercheurs en Parole - RJCP 2009, France Avignon, 2009, 72 p
http://hal.archives-ouvertes.fr/hal-00433299/en/.
[33]
C. Cerisara, C. Gardent.
Analyse syntaxique du français parlé, in: Journée ATALA, France Paris, 2009
http://hal.inria.fr/inria-00432754/en/.

Scientific Books (or Scientific Book chapters)

[34]
M. Aron, M.-O. Berger, E. Kerrien, Y. Laprie.
Acquisition multimodale de données articulatoires, in: L'imagerie médicale pour l'étude de la parole, A. Marchal, C. Cavé (editors), Hermes Science Publications, 2009, p. 175-196
http://hal.inria.fr/inria-00429585/en/.
[35]
M. Embarki, S. Ouni, M. Yeou, C. Guilleminot, S. Al Maqtari.
Acoustic and EMA study of pharyngealization : Coarticulatory effects as index of stylistic and regional distinction, in: Instrumental Studies in Arabic Phonetics, M. Hassan, B. Heselwood (editors), Benjamins, 2009, p. 1-56
http://hal.archives-ouvertes.fr/hal-00348775/en/.

References in notes

[36]
C. Abry, T. Lallouache.
Le MEM: un modèle d'anticipation paramétrable par locuteur: Données sur l'arrondissement en français, in: Bulletin de la communication parlée, 1995, vol. 3, no 4, p. 85–89.
[37]
D. Achlioptas.
Database-friendly random projections, in: PODS '01: Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, New York, NY, USA, ACM, 2001, p. 274–281.
[38]
C. Barras, E. Geoffrois, Z. Wu, M. Liberman.
Transcriber: development and use of a tool for assisting speech corpora production, in: Speech Communication, 2001, p. 5–22.
[39]
P. F. Brown, etal.
A statistical Approach to MAchine Translation, in: Computational Linguistics, 1990, vol. 16, p. 79-85.
[40]
A. Brun.
Détection de thème et adaptation des modèles de langage pour la reconnaissance automatique de la parole, Université Henri Poincaré - Nancy I, 2003, Ph. D. Thesis.
[41]
R. Clark, K. Richmond, S. King.
Festival 2 - Build your own general purpose unit selection speech synhtesiser, in: ISCA 5th Speech Synthesis Workshop, Pittsburgh, 2004, p. 201–206.
[42]
M. Cohen, D. Massaro.
Modeling coarticulation in synthetic visual speech, 1993.
[43]
V. Colotte, R. Beaufort.
Linguistic features weighting for a Text-To-Speech system without prosody model, in: proceedings of EUROSPEECH/INTERSPEECH 2005, 2005, p. 2549-2552
http://hal.ccsd.cnrs.fr/ccsd-00012561/en/.
[44]
V. Colotte, Y. Laprie.
Higher precision pitch marking for TD-PSOLA, in: XI European Signal Processing Conference EUSIPCO, Toulouse, France, September 2002, vol. 1, p. 419-422.
[45]
J. Di Martino, Y. Laprie.
An Efficient F0 Determination Algorithm Based on the Implicit Calculation of the Autocorrelation of the Temporal Excitation Signal, in: 6th European Conference on Speech Communication and Technology - EUROSPEECH'99, Budapest, Hungary, 1999, 4 p p
http://hal.archives-ouvertes.fr/inria-00098759/en/.
[46]
E. Farnetani.
Labial coarticulation, in: In Coarticulation: Theory, data and techniques, Cambridge, W. J. Hardcastle, N. Hewlett (editors), Cambridge university press, 1999, chap. 8.
[47]
M.-C. Haton.
The teaching wheel: an agent for site viewing and subsite building, in: Int. Conf. Human-Computer Interaction, Heraklion, Greece, 2003.
[48]
J.-P. Haton, C. Cerisara, D. Fohr, Y. Laprie, K. Smaïli.
Reconnaissance Automatique de la Parole Du signal à son interprétation, UniverSciences (Paris) - ISSN 1635-625X, DUNOD, 2006
http://hal.inria.fr/inria-00105908/en/, I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE, I.: Computing Methodologies/I.5: PATTERN RECOGNITION.
[49]
A. Kain, M. Macon.
Spectral voice conversion for text-to-speech synthesis, in: International Conference on Acoustics, Speech, and Signal Processing, May 1998, p. 285–288.
[50]
A. Kipffer-Piquard.
Prédiction de la réussite ou de l'échec spécifiques en lecture au cycle 2. Suivi d'une population "à risque" et d'une population contrôle de la moyenne section de maternelle à la deuxième année de scolarisation primaire., ARNT - Lille, 2006
http://hal.inria.fr/inria-00185312/en/, Ouvrage disponible à l'ANRT : http://www.anrtheses.com.fr/ Nom de l'auteur : Agnès Piquard-Kipffer. Reproduction de la thèse de Linguistique soutenue à l'Université de Paris 7 - Denis Diderot..
[51]
A. Kipffer-Piquard.
Prédiction dès la maternelle de la réussite et de l'échec spécifique à l'apprentissage de la lecture en fin de cycle 2, in: Les troubles du développement chez l'enfant, Amiens France, L'HARMATTAN, 2007
http://hal.inria.fr/inria-00184601/en/.
[52]
P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, E. Herbst.
Moses: Open Source Toolkit for Statistical Machine Translation, in: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), demonstration session, June 2007.
[53]
P. Koehn.
Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models, in: 6th Conference Of The Association For Machine Translation In The Americas, Washington, DC, USA, 2004, p. 115-224.
[54]
J. Kupiec.
Robust part-of-speech tagging using a hidden markov model, in: Computer Speech and Language, 1992, vol. 6, p. pp. 225–242.
[55]
Y. Laprie.
A concurrent curve strategy for formant tracking, in: Proc. Int. Conf. on Spoken Language Processing, ICSLP, Jegu, Korea, October 2004.
[56]
C. Lavecchia, D. Langlois, K. Smaïli.
Discovering Phrases in Machine Translation by Simulated Annealing, in: INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, Brisbane Australie, 2008, p. 2354-2357
http://hal.inria.fr/inria-00331327/en/.
[57]
C. Lavecchia, K. Smaïli, D. Langlois.
Building a bilingual dictionary from movie subtitles based on inter-lingual triggers, in: Translating and the Computer, Londres Royaume-Uni, 2007
http://hal.inria.fr/inria-00184421/en/.
[58]
C. Lavecchia, K. Smaïli, D. Langlois, J.-P. Haton.
Using inter-lingual triggers for Machine translation, in: Eighth conference INTERSPEECH 2007, Antwerp/Belgium, 08 2007
http://hal.inria.fr/inria-00155791/en/.
[59]
S. Maeda.
Un modèle articulatoire de la langue avec des composantes linéaires, in: Actes 10èmes Journées d'Etude sur la Parole, Grenoble, Mai 1979, p. 152-162.
[60]
F. J. Och, H. Ney.
Improved statistical alignment models, in: ACL '00: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, Morristown, NJ, USA, Association for Computational Linguistics, 2000, p. 440–447.
[61]
L. Sprenger-Charolles, P. Colé, D. Béchennec, A. Kipffer-Piquard.
French normative data on reading and related skills from EVALEC, a new computerized battery of tests (end Grade 1, Grade 2, Grade 3, and Grade 4), in: Revue Européenne de Psychologie Appliquée, 2005, p. 157-186
http://hal.inria.fr/inria-00184979/en/.
[62]
L. Sprenger-Charolles, P. Colé, A. Kipffer-Piquard, F. Pinton, C. Billard.
Reliability and prevalence of an atypical development of phonological skills in french-speaking dyslexics, in: Reliability and prevalence of an atypical development of phonological skills in french-speaking dyslexics.
[63]
Y. Stylianou, O. Cappé, E. Moulines.
Continuous probabilistic transform for voice conversion, in: IEEE Transactions on Speech and Audio Processing, March 1998, vol. 6, no 2, p. 131–142.
[64]
A. Toutios, S. Ouni, Y. Laprie.
Protocol for a Model-based Evaluation of a Dynamic Acoustic-to-Articulatory Inversion Method using Electromagnetic Articulography, in: The eighth International Seminar on Speech Production - ISSP'08, Strasbourg France, 2008
http://hal.archives-ouvertes.fr/inria-00336380/en/.

previous
next