Team METISS

Members
Overall Objectives
Scientific Foundations
Application Domains
Software
New Results
Contracts and Grants with Industry
Other Grants and Activities
Dissemination
Bibliography
Inria / Raweb 2003
Project: METISS

Bibliography

Major publications by the team in recent years

[1]
F. Bimbot .
Traitement Automatique du Langage Parlé, collection Information - Commande - Communication (IC2), Hermès, 2002, chap. Reconnaissance Automatique du Locuteur, p. 79-114.
[2]
F. Bimbot , R. Blouet, J.-F. Bonastre, et al.
The ELISA systems for the NIST'99 evaluation in speaker detection and tracking, in: Digital Signal Processing, janvier/avril/juillet 2000, vol. 10, no 1-3, p. 143-­153.
[3]
R. Blouet.
Approche probabiliste par arbres de décision pour la vérification automatique du locuteur sur architectures embarquées, Ph. D. Thesis, Université de Rennes 1, IRISA, Rennes, December 2002.
[4]
G. Gravier , F. Yvon, B. Jacob, F. Bimbot .
Integrating contextual phonological rules in a large vocabulary decoder, in: The Integration of Phonetic Knowledge in Speech Technology, W. van Dommelen, B. Barry (editors), à paraître, Kluwer Academics, 2002.
[5]
R. Gribonval .
Fast Matching Pursuit with a multiscale dictionary of Gaussian Chirps, in: IEEE Trans. Signal Proc., May 2001, vol. 49, no 5, p. 994-1001.
[6]
R. Gribonval .
Approximations non-linéaires pour l'analyse de signaux sonores, Ph. D. Thesis, Université Paris IX Dauphine, September 1999.
[7]
M. Seck, R. Blouet, F. Bimbot .
The IRISA/ELISA speaker detection and tracking systems for the NIST'99 evaluation campaign, in: Digital Signal Processing, janvier/avril/juillet 2000, vol. 10, no 1­3, p. 154-­171.
[8]
M. Seck.
Détection de ruptures et suivi de classe de sons pour l'indexation sonore, Ph. D. Thesis, Université de Rennes 1, IRISA, Rennes, January 2001.

Year Publications

Doctoral dissertations and Habilitation theses

[9]
L. Benaroya .
Séparation de plusieurs sources sonores avec un capteur, thèse de doctorat, Université de Rennes 1, IRISA, Rennes, June 2003.

Articles in refereed journals and book chapters

[10]
R. Gribonval , E. Bacry.
Harmonic Decomposition of Audio Signals with Matching Pursuit, in: IEEE Trans. Signal Proc., jan 2003, vol. 51, no 1, p. 101–111.
[11]
R. Gribonval , M. Nielsen.
Nonlinear approximation with dictionaries. I. Direct estimates., in: J. Fourier Anal. and Appl., to appear, 2003.
[12]
R. Gribonval , M. Nielsen.
On approximation with spline generated framelets, in: Constr. Approx., published online on July 7th, printed version to appear, 2003.
[13]
R. Gribonval , M. Nielsen.
Sparse decompositions in unions of bases, in: IEEE Trans. Inform. Theory, December 2003, vol. 49, no 12, p. 3320–3325.
[14]
G. Potamianos, C. Neti, G. Gravier , A. Garg, A. W. Senior.
Recent advances in the automatic recognition of audio-visual speech, in: IEEE Proceedings, September 2003, vol. 91, no 9, p. 1306–1326.

Publications in Conferences and Workshops

[15]
M. Ben , F. Bimbot .
D-MAP: a distance normalized MAP estimation of speaker models for automatic speaker verification, in: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, 2003.
[16]
M. Ben , G. Gravier , A. Ozerov , F. Bimbot .
IRISA 2003 speaker recognition system - 1sp speaker detection, limited data, in: Proc. NIST Workshop on Speaker Verification, 2003.
[17]
L. Benaroya , F. Bimbot .
Wiener based source separation with HMM/GMM using a single sensor, in: Proc. 4th Int. Symp. on Independent Component Anal. and Blind Signal Separation (ICA2003), Nara, Japan, April 2003, p. 957–961.
[18]
L. Benaroya , F. Bimbot , G. Gravier , R. Gribonval .
Audio source separation with one sensor for robust speech recognition, in: ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, 2003.
[19]
L. Benaroya , L. McDonagh , F. Bimbot , R. Gribonval .
Non negative sparse representation for Wiener based source separation with a single sensor, in: Proc. IEEE Intl. Conf. Acoust. Speech Signal Process (ICASSP'03), Hong-Kong, April 2003, p. 613–616.
[20]
M. Betser , G. Gravier , R. Gribonval .
Extraction of information from video sound tracks - Can we detect simultaneous events?, in: Conference on Content-Based Multimedia Indexing, 2003, p. 71–78.
[21]
J.-F. Bonastre, F. Bimbot , L.-J. Boë, J. C. bell, D. Reynolds, I. Magrin-Chagnolleau.
Person Authentication by Voice : A Need For Caution, in: Proc. Eurospeech'03, Genève, 2003.
[22]
R. Gribonval , L. Benaroya , E. Vincent, C. Févotte.
Proposals for Performance Measurement in Source Separation, in: Proc. 4th Int. Symp. on Independent Component Anal. and Blind Signal Separation (ICA2003), Nara, Japan, April 2003, p. 763–768.
[23]
R. Gribonval .
Piecewise Linear Separation, in: Wavelets: Applications in Signal and Image Processing X, Proc. SPIE '03, San Diego, CA, M. Unser, A. Aldroubi, A. Laine (editors), August 2003, vol. 5207.
[24]
R. Gribonval , M. Nielsen.
Approximation with highly redundant dictionaries, in: Wavelets: Applications in Signal and Image Processing X, Proc. SPIE '03, San Diego, CA, M. Unser, A. Aldroubi, A. . Laine (editors), August 2003, vol. 5207.
[25]
R. Gribonval , M. Nielsen.
Sparse Decompositions in ``incoherent'' dictionaries, in: Proc. IEEE Intl. Conf. Image Proc. (ICIP'03), Barcelona, Spain, September 2003.
[26]
C. Jutten, R. Gribonval .
L'analyse en composantes indépendantes: un outil puissant pour le traitement de l'information, in: Proc. GRETSI 2003, ENST, Paris, France, September 2003, vol. I, 11 p.
[27]
E. Kijak , G. Gravier , P. Gros, L. Oisel, F. Bimbot .
HMM based structuring of tennis videos using visual and audio cues, in: Proc. Intl. Conf. on Multimedia and Exhibition, 2003no.
[28]
E. Kijak , G. Gravier , L. Oisel, P. Gros.
Audiovisual Integration for Tennis Broadcast Structuring, in: Conference on Content-Based Multimedia Indexing, 2003, p. 421–428.
[29]
E. Kijak , G. Gravier , L. Oisel, P. Gros.
Structuration multimodale d'une vidéo de tennis par modèles de Markov cachés, in: GRETSI, 2003no.
[30]
L. McDonagh , F. Bimbot , R. Gribonval .
A granular approach for the analysis of monophonic audio signals, in: Proc. IEEE Intl. Conf. Acoust. Speech Signal Process (ICASSP'03), Hong-Kong, April 2003, p. 469–472.
[31]
E. Vincent, C. Févotte, R. Gribonval , ETAL .
Comment évaluer les algorithmes de séparation de sources audio?, in: Proc. GRETSI 2003, ENST, Paris, France, September 2003, vol. I, 27 p.
[32]
E. Vincent, C. Févotte, R. Gribonval , X. Rodet, E. Le Carpentier, L. Benaroya , A. Röbel, F. Bimbot .
A Tentative Typology of Audio Source Separation Tasks, in: Proc. 4th Int. Symp. on Independent Component Anal. and Blind Signal Separation (ICA2003), Nara, Japan, April 2003, p. 715–720.

Internal Reports

[33]
L. Borup, R. Gribonval , M. Nielsen.
Bi-framelet systems with few vanishing moments characterize Besov spaces, submitted to Appl. Comp. Harmonic Anal. in October 2003, Technical report, Dept of Math. Sciences, Aalborg University, November 2003, no R-2003-18.
[34]
L. Borup, R. Gribonval , M. Nielsen.
Tight wavelet frames in Lebesgue and Sobolev spaces, submitted to J. Function Spaces, Technical report, Aalborg Univ., Dept of Math., March 2003, no R-2003-05.
[35]
R. Gribonval , M. Nielsen.
Highly sparse representations from dictionaries are unique and independent of the sparseness measure, submitted to Appl. Comp. Harmonic Anal., Technical report, Dept of Math. Sciences, Aalborg University, October 2003, no R-2003-16.
[36]
R. Gribonval , M. Nielsen.
On a problem of Gröchenig about nonlinear approximation with localized frames, submitted to J. Fourier Anal. and Appl. in October 2003, Technical report, Dept of Math. Sciences, Aalborg University, November 2003, no R-2003-19.

References in notes

[37]
Action Jeunes Chercheurs du GDR ISIS (CNRS).
Ressources pour la séparation de signaux audiophoniques, 2002-2003 http://www.ircam.fr/anasyn/ISIS/.
[38]
R. Boite, H. Bourlard, T. Dutoit, J. Hancq, H. Leich.
Traitement de la Parole, Presses Polytechniques et Universitaires Romandes, 2000.
[39]
H. Bourlard, S. Dupont, C. Ris.
Multi-stream speech recognition, Research Report, IDIAP, Dec. 1996, no RR 96-07.
[40]
C. K. Chui, W. He, J. Stöckler.
Compactly supported tight and sibling frames with maximum vanishing moments, in: Appl. Comput. Harmon. Anal., 2002, vol. 13, no 3, p. 224–262.
[41]
I. Daubechies, B. Han, A. Ron, Z. Shen.
Framelets: MRA-based constructions of wavelet frames, in: Applied and Computational Harmonic Analysis, 2003, vol. 14, no 1, p. 1–46.
[42]
S. Dupont, J. Luettin.
Audio-Visual Speech Modeling for Continuous Speech Recognition, in: IEEE Trans. on Multimedia, September 2000, vol. 2, no 3, p. 141–151.
[43]
J. Foote, M. Cooper.
Media Segmentation using Self-Similarity Decomposition, in: Proc. SPIE Storage and Retrieval for Multimedia Databases , Vol. 5021, January 2003, p. 167-75.
[44]
J.-L. Gauvain, C.-H. Lee.
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains, in: IEEE Trans. on Speech and Audio Processing, April 1994, vol. 2, no 2.
[45]
A. Gilbert, S. Muthukrishnan, M. Strauss.
Approximation of Functions over Redundant Dictionaries Using coherence, in: The 14th ACM-SIAM Symposium on Discrete Algorithms (SODA'03), January 2003.
[46]
A. Gilbert, S. Muthukrishnan, M. Strauss, J. Tropp.
Improved sparse approximation over quasi-incoherent dictionaries, in: Int. Conf. on Image Proc. (ICIP'03), Barcelona, Spain, sep 2003.
[47]
G. Gravier , F. Yvon, B. Jacob, F. Bimbot .
Sirocco, un système ouvert de reconnaissance de la parole, in: Journées d'étude sur la parole, Nancy, June 2002, p. 273-276.
[48]
R. Gribonval , E. Bacry.
Harmonic Decomposition of Audio Signals with Matching Pursuit, in: IEEE Trans. Signal Proc., jan 2003, vol. 51, no 1.
[49]
R. Gribonval .
Fast Matching Pursuit with a multiscale dictionary of Gaussian Chirps, in: IEEE Trans. Signal Proc., May 2001, vol. 49, no 5, p. 994-1001.
[50]
R. Gribonval .
Sparse decomposition of stereo signals with Matching Pursuit and application to blind separation of more than two sources from a stereo mixture, in: Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP'02), Orlando, Florida, May 2002.
[51]
R. Gribonval .
Approximations non-linéaires pour l'analyse de signaux sonores, Ph. D. Thesis, Université Paris IX Dauphine, September 1999.
[52]
K. Gröchenig.
Localization of frames, Banach frames, and the invertibility of the frame operator, in: J. Fourier Anal. Appl., to appear, 2003.
[53]
H. Hermansky, N. Morgan.
RASTA processing of speech, in: IEEE Trans. on Speech and Audio, 1994, vol. 2, no 4, p. 578–589.
[54]
F. Jelinek.
Statistical Methods for Speech Recognition, MIT Press, Cambridge, Massachussetts, 1998.
[55]
L. F. Lamel, J.-L. Gauvain, M. Eskénazi.
BREF, a large vocabulary spoken corpus for French, in: Proc. European Conf. on Speech Processing (EUROSPEECH'91), 1991, p. 505–508.
[56]
S. Mallat.
A Wavelet Tour of Signal Processing, 2, Academic Press, San Diego, 1999.
[57]
National Institute of Standards and Technology.
The 2003 NIST Speaker Recognition Evaluation, 2003 http://www.nist.gov/speech/tests/spk/2003/.
[58]
S. Ortmanns, H. Ney.
A word graph algorithm for large vocabulary continuous speech recognition, in: Computer Speech and Language, 1997, vol. 11, p. 43-72.
[59]
G. Peeters, X. Rodet.
SINOLA: A New Analysis/Synthesis Method using Spectrum Peak Shape Distortion, Phase and Reassigned Spectrum, in: Proc. Int. Computer Music Conf. (ICMC'99), Beijing, ICMC, 1999.
[60]
A. Reynolds, T. Quatieri, R. Dunn.
Speaker Verification Using Adapted Gaussian Mixture Models, in: Digital Signal Processing Vol 10,num 1-3, 2000.
[61]
J. Tropp.
Greed is good : Algorithmic results for sparse approximation, Technical report, Texas Institute for Computational Engineering and Sciences, 2003.
[62]
L. Villemoes.
Nonlinear Approximation with Walsh Atoms, in: Proceedings of ``Surface Fitting and Multiresolution Methods'', Chamonix 1996, A. Le M'ehaut'e, C. Rabut, L. Schumaker (editors), Vanderbilt University Press, 1997, p. 329–336.
[63]
A. Zils, F. Pachet.
Musical Mosaicing, in: Proceedings of DAFX '01, University of Limerick, December 2001.

previous
next