Team METISS

Members
Overall Objectives
Scientific Foundations
Application Domains
Software
New Results
Contracts and Grants with Industry
Other Grants and Activities
Dissemination
Bibliography
Inria / Raweb 2004
Project: METISS

Bibliography

Major publications by the team in recent years

[1]
F. Bimbot.
Traitement Automatique du Langage Parlé, collection Information - Commande - Communication (IC2), Hermès, 2002, chap. Reconnaissance Automatique du Locuteur, p. 79-114.
[2]
F. Bimbot, R. Blouet, J.-F. Bonastre, et al.
The ELISA systems for the NIST'99 evaluation in speaker detection and tracking, in: Digital Signal Processing, janvier/avril/juillet 2000, vol. 10, no 1-3, p. 143-­153.
[3]
R. Blouet.
Approche probabiliste par arbres de décision pour la vérification automatique du locuteur sur architectures embarquées, Ph. D. Thesis, Université de Rennes 1, IRISA, Rennes, December 2002.
[4]
G. Gravier, F. Yvon, B. Jacob, F. Bimbot.
Integrating contextual phonological rules in a large vocabulary decoder, in: The Integration of Phonetic Knowledge in Speech Technology, W. van Dommelen, B. Barry (editors), à paraître, Kluwer Academics, 2002.
[5]
R. Gribonval.
Fast Matching Pursuit with a multiscale dictionary of Gaussian Chirps, in: IEEE Trans. Signal Proc., May 2001, vol. 49, no 5, p. 994-1001.
[6]
R. Gribonval.
Approximations non-linéaires pour l'analyse de signaux sonores, Ph. D. Thesis, Université Paris IX Dauphine, September 1999.
[7]
M. Seck, R. Blouet, F. Bimbot.
The IRISA/ELISA speaker detection and tracking systems for the NIST'99 evaluation campaign, in: Digital Signal Processing, janvier/avril/juillet 2000, vol. 10, no 1­3, p. 154-­171.
[8]
M. Seck.
Détection de ruptures et suivi de classe de sons pour l'indexation sonore, Ph. D. Thesis, Université de Rennes 1, IRISA, Rennes, January 2001.

Publications of the year

Doctoral dissertations and Habilitation theses

[9]
M. Ben.
Approches robustes pour la vérification automatique du locuteur par normalisation et adaptation hiérarchique, thèse de doctorat, Université de Rennes 1, IRISA, Rennes, November 2004.

Articles in refereed journals and book chapters

[10]
L. Benaroya, F. Bimbot, R. Gribonval.
Audio source separation with a single sensor, in: IEEE Trans. On Speech and Audio Processing, to appear, 2005.
[11]
F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garcia, D. A. Reynolds.
A tutorial on text-independent speaker verification, in: EURASIP Journal on Applied Signal Processing, April 2004, vol. 2004, no 4, p. 430–451.
[12]
F. Bimbot, G. Gravier.
Evaluation des systèmes de reconnaissance de la parole, in: Evaluation des systèmes de traitement de l'information, Traité des Sciences et Techniques de l'Information, Hermes Science Publications, 2004, chap. 8, p. 189–213.
[13]
L. Borup, R. Gribonval, M. Nielsen.
Bi-framelet systems with few vanishing moments characterize Besov spaces, in: Appl. Comp. Harmonic Anal. (special issue on frames in harmonic analysis), 2004, vol. 17, no 1–2.
[14]
L. Borup, R. Gribonval, M. Nielsen.
Tight wavelet frames in Lebesgue and Sobolev spaces, in: J. Function Spaces and Appl., 2004, vol. 2, no 3, p. 227–252.
[15]
M. Dutat, I. Magrin-Chagnolleau, F. Bimbot.
Acoustic Modeling of Spoken Languages using Time-Frequency Principal Component Analysis and Hidden Markov Models : Application to Language Identification, in: IEEE Trans. Signal and Audio Processing, to appear, 2005.
[16]
G. Gravier, F. Yvon, B. Jacob, F. Bimbot.
Integrating contextual phonological rules in a large vocabulary decoder, in: Integration of Phonetic Knowledge In Speech Technology, W. van Dommelen, B. Barry (editors), Kluwer Academic, 2004.
[17]
R. Gribonval, M. Nielsen.
Nonlinear approximation with dictionaries. I. Direct estimates, in: J. of Fourier Anal. and Appl., 2004, vol. 10, no 1.
[18]
R. Gribonval, M. Nielsen.
On a problem of Gröchenig about nonlinear approximation with localized frames, in: J. of Fourier Anal. and Appl., 2004, vol. 10, no 4.
[19]
R. Gribonval, M. Nielsen.
On approximation with spline generated framelets, in: Constructive Approx., January 2004, vol. 20, no 2, p. 207–232.
[20]
E. Kijak, G. Gravier, L. Oisel, P. Gros.
Audiovisual integration for tennis broadcast structuring, in: Multimedia Tools and Application, 2004.

Publications in Conferences and Workshops

[21]
M. Ben, M. Betser, F. Bimbot, G. Gravier.
Speaker Diarization using bottom-up clustering based on a Parameter-derived Distance between adapted GMMs, in: Intl. Conf. on Speech and Language Processing, 2004.
[22]
M. Ben, G. Gravier, F. Bimbot.
Enhancing the robustness of Bayesian adaptation for text-independent speaker verification, in: Odyssey'04 Speaker and Language Recognition Workshop, 2004.
[23]
M. Betser, G. Gravier.
Multiple events tracking in sound tracks, in: Intl. Conf. on Multimedia and Exhibition, 2004.
[24]
M. Betser, G. Gravier.
Suivi d'événements sonores multiples dans les documents audiovisuels, in: Compression et Représentation des Signaux Audiovisuels (CORESA), 2004.
[25]
J.-F. Bonastre, F. Bimbot, L.-J. Boë, J. Campbell, D. Reynolds, I. Magrin-Chagnolleau.
Authentification des personnes par leur voix : un nécessaire devoir de précaution, in: Journées d'Etude sur la Parole (JEP04), Fès, Maroc, 2004.
[26]
F. Coldefy, M. Betser, G. Gravier, P. Bouthémy.
Tennis video abstraction from audio and visual cues, in: IEEE Intl. Workshop on Multimedia Signal Processing, 2004no.
[27]
G. Gravier, L. Benaroya, A. Ozerov, R. Gribonval, F. Bimbot.
Séparation de sources à partir d'un seul capteur pour la reconnaissance robuste de la parole, in: Journées d'Etude sur la Parole (JEP04), Fès, Maroc, 2004.
[28]
G. Gravier, J. Bonastre, S. Galliano, E. Geoffrois, K. M. Tait, K. Choukri.
The ESTER evaluation campaign of Rich Transcription of French Broadcast News, in: Language Evaluation and Resources Conference, 2004.
[29]
G. Gravier, J.-F. Bonastre, S. Galliano, E. Geoffrois, K. M. Tait, K. Choukri.
ESTER, une campagne d'évaluation des systèmes d'indexation d'émissions radiophoniques, in: Journées d'Etude sur la Parole (JEP04), 2004.
[30]
R. Gribonval, M. Nielsen.
On the strong uniqueness of highly sparse expansions from redundant dictionaries, in: Proc. Int Conf. Independent Component Analysis (ICA'04), LNCS series, Springer-Verlag, September 2004.
[31]
P. Gros, E. Kijak, G. Gravier.
Automatic video structuring based on HMMs and audiovisual integration, in: 2nd International Symposium on Image/Video Communications over fixed and mobile networks, 2004no.
[32]
J. Walker, F. Bimbot, L. Benaroya.
Experimental Evaluation of Audio Source Separation with One Sensor, in: Mathematics in Signal Processing IV, Cirencester (UK), December, 2004.

Internal Reports

[33]
R. Gribonval, R. Figueras, P. Vandergheynst.
A simple test to check the optimality of a sparse signal approximation, Technical report, IRISA, November 2004, no 1661.
[34]
R. Gribonval, P. Vandergheynst.
On the exponential convergence of Matching Pursuits in quasi-incoherent dictionaries, Technical report, IRISA, April 2004, no 1619.

References in notes

[35]
Action Jeunes Chercheurs du GDR ISIS (CNRS).
Ressources pour la séparation de signaux audiophoniques, 2002-2003,
http://www.ircam.fr/anasyn/ISIS/.
[36]
M. Ben, G. Gravier, A. Ozerov, F. Bimbot.
IRISA 2003 speaker recognition system - 1sp speaker detection, limited data, in: Proc. NIST Workshop on Speaker Verification, 2003.
[37]
L. Benaroya, F. Bimbot.
Wiener based source separation with HMM/GMM using a single sensor, in: Proc. 4th Int. Symp. on Independent Component Anal. and Blind Signal Separation (ICA2003), Nara, Japan, April 2003, p. 957–961.
[38]
L. Benaroya, F. Bimbot, G. Gravier, R. Gribonval.
Audio source separation with one sensor for robust speech recognition, in: ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, 2003.
[39]
L. Benaroya, L. McDonagh, F. Bimbot, R. Gribonval.
Non negative sparse representation for Wiener based source separation with a single sensor, in: Proc. IEEE Intl. Conf. Acoust. Speech Signal Process (ICASSP'03), Hong-Kong, April 2003, p. 613–616.
[40]
M. Betser, G. Gravier, R. Gribonval.
Extraction of information from video sound tracks - Can we detect simultaneous events?, in: Conference on Content-Based Multimedia Indexing, 2003, p. 71–78.
[41]
R. Boite, H. Bourlard, T. Dutoit, J. Hancq, H. Leich.
Traitement de la Parole, Presses Polytechniques et Universitaires Romandes, 2000.
[42]
J.-F. Bonastre, F. Bimbot, L.-J. Boë, J. C. bell, D. Reynolds, I. Magrin-Chagnolleau.
Person Authentication by Voice : A Need For Caution, in: Proc. Eurospeech'03, Genève, 2003.
[43]
H. Bourlard, S. Dupont, C. Ris.
Multi-stream speech recognition, Research Report, IDIAP, Dec. 1996, no RR 96-07.
[44]
S. Dupont, J. Luettin.
Audio-Visual Speech Modeling for Continuous Speech Recognition, in: IEEE Trans. on Multimedia, September 2000, vol. 2, no 3, p. 141–151.
[45]
J.-L. Gauvain, C.-H. Lee.
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, in: IEEE Trans. on Speech and Audio Processing, April 1994, vol. 2, no 2.
[46]
A. Gilbert, S. Muthukrishnan, M. Strauss.
Approximation of Functions over Redundant Dictionaries Using coherence, in: The 14th ACM-SIAM Symposium on Discrete Algorithms (SODA'03), January 2003.
[47]
A. Gilbert, S. Muthukrishnan, M. Strauss, J. Tropp.
Improved sparse approximation over quasi-incoherent dictionaries, in: Int. Conf. on Image Proc. (ICIP'03), Barcelona, Spain, sep 2003.
[48]
G. Gravier, F. Yvon, B. Jacob, F. Bimbot.
Sirocco, un système ouvert de reconnaissance de la parole, in: Journées d'étude sur la parole, Nancy, June 2002, p. 273-276.
[49]
R. Gribonval, E. Bacry.
Harmonic Decomposition of Audio Signals with Matching Pursuit, in: IEEE Trans. Signal Proc., jan 2003, vol. 51, no 1.
[50]
R. Gribonval.
Fast Matching Pursuit with a multiscale dictionary of Gaussian Chirps, in: IEEE Trans. Signal Proc., May 2001, vol. 49, no 5, p. 994-1001.
[51]
R. Gribonval.
Sparse decomposition of stereo signals with Matching Pursuit and application to blind separation of more than two sources from a stereo mixture, in: Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP'02), Orlando, Florida, May 2002.
[52]
R. Gribonval.
Approximations non-linéaires pour l'analyse de signaux sonores, Ph. D. Thesis, Université Paris IX Dauphine, September 1999.
[53]
R. Gribonval, M. Nielsen.
Highly sparse representations from dictionaries are unique and independent of the sparseness measure, submitted to Appl. Comp. Harmonic Anal., Dept of Math. Sciences, Aalborg University, October 2003, no R-2003-16.
[54]
R. Gribonval, M. Nielsen.
Sparse decompositions in unions of bases, in: IEEE Trans. Inform. Theory, December 2003, vol. 49, no 12, p. 3320–3325.
[55]
K. Gröchenig.
Localization of frames, Banach frames, and the invertibility of the frame operator, in: J. Fourier Anal. Appl., to appear, 2003.
[56]
F. Jelinek.
Statistical Methods for Speech Recognition, MIT Press, Cambridge, Massachussetts, 1998.
[57]
E. Kijak, G. Gravier, P. Gros, L. Oisel, F. Bimbot.
HMM based structuring of tennis videos using visual and audio cues, in: Proc. Intl. Conf. on Multimedia and Exhibition, 2003.
[58]
S. Mallat.
A Wavelet Tour of Signal Processing, 2, Academic Press, San Diego, 1999.
[59]
National Institute of Standards and Technology.
The 2003 NIST Speaker Recognition Evaluation, 2003,
http://www.nist.gov/speech/tests/spk/2003/.
[60]
S. Ortmanns, H. Ney.
A word graph algorithm for large vocabulary continuous speech recognition, in: Computer Speech and Language, 1997, vol. 11, p. 43-72.
[61]
G. Potamianos, C. Neti, G. Gravier, A. Garg, A. W. Senior.
Recent advances in the automatic recognition of audio-visual speech, in: IEEE Proceedings, September 2003, vol. 91, no 9, p. 1306–1326.
[62]
A. Reynolds, T. Quatieri, R. Dunn.
Speaker Verification Using Adapted Gaussian Mixture Models, in: Digital Signal Processing Vol 10,num 1-3, 2000.
[63]
J. Tropp.
Greed is good : Algorithmic results for sparse approximation, Technical report, Texas Institute for Computational Engineering and Sciences, 2003.
[64]
L. Villemoes.
Nonlinear Approximation with Walsh Atoms, in: Proceedings of ``Surface Fitting and Multiresolution Methods'', Chamonix 1996, A. Le M'ehaut'e, C. Rabut, L. Schumaker (editors), Vanderbilt University Press, 1997, p. 329–336.

previous
next