Team METISS

Members
Overall Objectives
Scientific Foundations
Application Domains
Software
New Results
Contracts and Grants with Industry
Other Grants and Activities
Dissemination
Bibliography

Bibliography

Major publications by the team in recent years

[1]
S. Arberet.
Estimation robuste et apprentissage aveugle de modèles pour la séparation de sources sonores, Université de Rennes I, december 2008, Ph. D. Thesis.
[2]
L. Borup, R. Gribonval, M. Nielsen.
Beyond coherence : recovering structured time-frequency representations, in: Appl. Comput. Harmon. Anal., 2008, vol. 24, no 1, p. 120–128.
[3]
S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J.-F. Bonastre, G. Gravier.
The ESTER Phase II Evaluation Campaign for the Rich Transcription of French Broadcast News, in: European Conference on Speech Communication and Technology, 2005.
[4]
R. Gribonval, R. M. Figueras i Ventura, P. Vandergheynst.
A simple test to check the optimality of sparse signal approximations, in: EURASIP Signal Processing, special issue on Sparse Approximations in Signal and Image Processing, March 2006, vol. 86, no 3, p. 496–510.
[5]
R. Gribonval.
Sur quelques problèmes mathématiques de modélisation parcimonieuse, Université de Rennes I, octobre 2007, Habilitation à Diriger des Recherches, spécialité “Mathématiques”.
[6]
R. Gribonval, M. Nielsen.
On approximation with spline generated framelets, in: Constructive Approx., January 2004, vol. 20, no 2, p. 207–232.
[7]
R. Gribonval, M. Nielsen.
Beyond sparsity : recovering structured representations by $ \ell$1 -minimization and greedy algorithms, in: Advances in Computational Mathematics, January 2008, vol. 28, no 1, p. 23–41.
[8]
R. Gribonval, H. Rauhut, K. Schnass, P. Vandergheynst.
Atoms of all channels, unite! Average case analysis of multi-channel sparse recovery using greedy algorithms, in: J. Fourier Anal. Appl., December 2008, vol. 14, no 5, p. 655–687.
[9]
R. Gribonval, K. Schnass.
Dictionary identifiability from few training samples, in: Proc. European Conf. on Signal Processing - EUSIPCO, August 2008.
[10]
R. Gribonval, K. Schnass.
Some recovery conditions for basis learning by l1-minimization, in: 3rd IEEE International Symposium on Communications, Control and Signal Processing - ISCCSP 2008, March 2008, p. 768–773.
[11]
R. Gribonval, P. Vandergheynst.
On the exponential convergence of Matching Pursuits in quasi-incoherent dictionaries, in: IEEE Trans. Information Theory, January 2006, vol. 52, no 1, p. 255–261
http://dx.doi.org/10.1109/TIT.2005.860474.
[12]
S. Huet, G. Gravier, P. Sébillot.
Un modèle multi-sources pour la segmentation en sujets de journaux radiophoniques, in: Proc. Traitement Automatique des Langues Naturelles, 2008, p. 49–58.
[13]
E. Kijak, G. Gravier, L. Oisel, P. Gros.
Audiovisual integration for tennis broadcast structuring, in: Multimedia Tools and Application, 2006, vol. 30, no 3, p. 289–312.
[14]
B. Mailhé, S. Lesage, R. Gribonval, P. Vandergheynst, F. Bimbot.
Shift–invariant dictionary learning for sparse representations : extending K–SVD, in: Proc. European Conf. on Signal Processing - EUSIPCO, August 2008.
[15]
A. Ozerov, P. Philippe, F. Bimbot, R. Gribonval.
Adaptation of Bayesian models for single channel source separation and its application to voice / music separation in popular songs, in: IEEE Trans. Audio, Speech and Language Processing, juillet 2007, vol. 15, no 5, p. 1564–1578.
[16]
A. Rosenberg, F. Bimbot, S. Parthasarathy.
36, in: Overview of Speaker Recognition, Springer, 2008, p. 725–741.
[17]
E. Vincent, R. Gribonval, C. Févotte.
Performance measurement in Blind Audio Source Separation, in: IEEE Trans. Speech, Audio and Language Processing, 2006, vol. 14, no 4, p. 1462–1469
http://dx.doi.org/10.1109/TSA.2005.858005.
[18]
E. Vincent, M. Plumbley.
Low bitrate object coding of musical audio using bayesian harmonic models, in: IEEE Trans. on Audio, Speech and Language Processing, 2007, vol. 15, no 4, p. 1273–1282.

Publications of the year

Doctoral Dissertations and Habilitation Theses

[19]
G. Gravier.
Intégration de connaissances par modèles probabilistes pour l'analyse de documents multimédias, Université de Rennes 1, 2009, Habilitation à Diriger des Recherches.
[20]
B. Mailhé.
Modèles et algorithmes pour la représentation parcimonieuse de signaux de grandes dimensions, Université de Rennes I, december 2009, Ph. D. Thesis.

Articles in International Peer-Reviewed Journal

[21]
S. Arberet, R. Gribonval, F. Bimbot.
A Robust Method to Count and Locate Audio Sources in a Multichannel Underdetermined Mixture, in: IEEE Trans. on Signal Processing, 2010, vol. 58, 14 pages.
[22]
R. Badeau, N. Bertin, E. Vincent.
On the stability of multiplicative update algorithms. Application to non-negative matrix factorization., in: IEEE Trans. on Neural Networks, 2010, To Appear.
[23]
N. Bertin, R. Badeau, E. Vincent.
Enforcing harmonicity and smoothness in Bayesian non-negative matrix factorization applied to polyphonic music transcription, in: IEEE Trans. on Audio, Speech and Language Processing, 2010, To appear.
[24]
M. E. Davies, R. Gribonval.
Restricted Isometry Constants where _ellp sparse recovery can fail for 0<p_leq1 , in: IEEE Trans. Inform. Theory, May 2009, vol. 55, no 5, p. 2203–2214.
[25]
N. Duong, E. Vincent, R. Gribonval.
Under-determined audio source separation with a new modeling of the convolutive mixing process, in: IEEE Trans. on Audio, Speech and Language Processing, 2010, Submitted.
[26]
V. Emiya, E. Vincent, N. Harlander, V. Hohmann.
Subjective assessment of audio source separation and objective measures using subband-based distortion extraction, in: IEEE Trans. on Audio, Speech and Language Processing, 2010, Submitted.
[27]
G. Gonon, F. Bimbot, R. Gribonval.
Probabilistic scoring using decision trees for fast and scalable speaker recognition, in: Speech Communication, November 2009, vol. 51, no 11, p. 1065-1081.
[28]
D. K. Hammond, P. Vandergheynst, R. Gribonval.
Wavelets on Graphs via Spectral Graph Theory, in: Applied and Computational Harmonic Analysis, 2010, submitted.
[29]
S. Huet, G. Gravier, P. Sébillot.
Morpho-syntactic post-processing with N-best lists for improved French automatic speech recognition, in: Computer Speech and Language, October 2009, vol. doi:10.1016/j.csl.2009.10.001, 22 pages p, doi:10.1016/j.csl.2009.10.001.
[30]
M. Kowalski, E. Vincent, R. Gribonval.
Beyond the narrowband approximation: Wideband convex methods for under-determined reverberant audio source separation, in: IEEE Trans. on Audio, Speech and Language Processing, 2010, Submitted.
[31]
R. Tavenard, L. Amsaleg, G. Gravier.
Model-based similarity estimation of multidimensional temporal sequences, in: Annals of Telecommunications, 2009, vol. 64, no 5–6, p. 381-390.
[32]
E. Vincent, N. Bertin, R. Badeau.
Adaptive harmonic spectral decomposition for multiple pitch estimation, in: IEEE Trans. on Audio, Speech and Language Processing, 2010, To appear.

International Peer-Reviewed Conference/Proceedings

[33]
N. Bertin, R. Badeau, E. Vincent.
Fast Bayesian NMF algorithms enforcing harmonicity and temporal continuity in polyphonic music transcription, in: Proc. 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2009.
[34]
N. Bertin, E. Vincent, R. Badeau.
Fast Bayesian constrained NMF for polyphonic pitch transcription, in: Proc. 5th Music Information Retrieval Evaluation eXchange (MIREX), International Society for Music Information Retrieval, 2009.
[35]
M. E. Davies, R. Gribonval.
On Lp minimisation, instance optimality, and restricted isometry constants for sparse approximation, in: Proc. SAMPTA'09 (Sampling Theory and Applications), Marseille, France, may 2009.
[36]
M. E. Davies, R. Gribonval.
The Restricted Isometry Property and _ellp sparse recovery failure, in: Proc. SPARS'09 (Signal Processing with Adaptive Sparse Structured Representations), Saint-Malo, France, April 2009.
[37]
N. Duong, E. Vincent, R. Gribonval.
Spatial covariance models for under-determined reverberant audio source separation, in: Proc. 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2009.
[38]
N. Duong, E. Vincent, R. Gribonval.
Under-determined convolutive blind source separation using spatial covariance models, in: Proc. 2010 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2010, Submitted.
[39]
V. Emiya, E. Vincent, R. Gribonval.
An investigation of discrete-state discriminant approaches to single-sensor source separation, in: Proc. 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2009.
[40]
S. Galliano, G. Gravier, L. Chaubard.
The ESTER 2 evaluation campaign for the rich transcription of French radio broadcasts, in: Conf. of the Intl. Speech Communication Association (Interspeech), Brighton, UK, September 2009, p. 2583–2586.
[41]
C. Guinaudeau, G. Gravier, P. Sébillot.
Can automatic speech transcripts be used for large scale TV stream description and structuring ?, in: First International Workshop on Content-Based Audio/Video Analysis for Novel TV Services, San Diego, CA, USA, December 2009, In conjunction with the International IEEE Symposium on Multimedia.
[42]
N. Ito, N. Ono, E. Vincent, S. Sagayama.
Designing the Wiener post-filter for diffuse noise suppression using imaginary parts of inter-channel cross-spectra, in: Proc. 2010 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2010, Submitted.
[43]
G. Lecorvé, G. Gravier, P. Sébillot.
Constraint selection for topic-based MDI adaptation of language models, in: Proceedings of the International Conference on Speech and Language Technology (Interspeech'09), Brighton, UK, September 2009, p. 368–371.
[44]
B. Mailhé, R. Gribonval, P. Vandergheynst, F. Bimbot.
A low–complexity Orthogonal Matching Pursuit for Sparse Signal Approximation with Shift–Invariant Dictionaries, in: Proc. IEEE ICASSP, April 2009.
[45]
B. Mailhé, M. Lemay, R. Gribonval, P. Vandergheynst, J.-M. Vesin, F. Bimbot.
Dictionary learning for the sparse modelling of atrial fibrillation in ECG signals, in: Proc. IEEE ICASSP, April 2009.
[46]
A. Muscariello, G. Gravier, F. Bimbot.
Audio keyword extraction by unsupervised word discovery, in: Conf. of the Intl. Speech Communication Association (Interspeech), Brighton, UK, September 2009, p. 2843–2846.
[47]
A. Muscariello, G. Gravier, F. Bimbot.
Variability tolerant motif discovery, in: Intl. Multimedia Model Conference, B. Huet, e. al. (editors), 2009.
[48]
F. M. Naini, R. Gribonval, L. Jacques, P. Vandergheynst.
Compressive sampling of pulse trains: Spread the spectrum !, in: Proc. ICASSP, 2009.
[49]
A. Nesbit, E. Vincent, M. Plumbley.
Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation, in: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2009.
[50]
A. Nesbit, E. Vincent, M. Plumbley.
Extension of sparse, adaptive signal decompositions to semi-blind audio source separation, in: Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA), 2009, p. 605-612.
[51]
M. Puigt, E. Vincent, Y. Deville.
Validity of the independence assumption for the separation of instantaneous and convolutive mixtures of speech and music sources, in: Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA), 2009, p. 613-620.
[52]
J. L. Roux, H. Kameoka, E. Vincent, N. Ono, K. Kashino, S. Sagayama.
Complex NMF under spectrogram consistency constraints, in: Proc. of the Acoustical Society of Japan (ASJ) Autumn Meeting, 2009.
[53]
J. L. Roux, H. Kameoka, E. Vincent, N. Ono, K. Kashino, S. Sagayama.
Complex NMF with spectrogram consistency constraints, in: Proc. 2010 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2010, Submitted.
[54]
R. Scholz, E. Vincent, F. Bimbot.
Robust modeling of musical chord sequences using probabilistic N-grams, in: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2009.
[55]
W. X. Teng, G. Gravier, F. Bimbot, F. Soufflet.
Speaker Adaptation by Variable Reference Model Subspace and Application to large Vocabulary Speech Recognition, in: IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, April 2009, p. 4381–4384.
[56]
E. Vincent, S. Araki, P. Bofill.
The 2008 Signal Separation Evaluation Campaign: A community-based approach to large-scale evaluation, in: Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA), 2009, p. 734-741.
[57]
E. Vincent, S. Arberet, R. Gribonval.
Underdetermined instantaneous audio source separation via local Gaussian modeling, in: Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA), 2009, p. 775-782.
[58]
E. Vincent.
Audio source separation using hierarchical phase-invariant models, in: Proc. 2009 ISCA Tutorial and Research Workshop on Non-linear Speech Processing (NOLISP), 2009.

National Peer-Reviewed Conference/Proceedings

[59]
S. Baghdadi, G. Gravier, C.-H. Demarty, P. Gros.
Apprentissage de structure dans les réseaux bayésiens pour la détection d'événements vidéo, in: Traitement et Analyse de l'Information : Méthodes et Applications, 2009.
[60]
V. Emiya, E. Vincent, R. Gribonval.
Estimateurs oracles pour la séparation de sources monocapteur par approches spectrales à états discrets, in: Proc. 22e colloque GRETSI sur le Traitement du Signal et des Images, 2009.
[61]
B. Mailhé, R. Gribonval, F. Bimbot, P. Vandergheynst.
LocOMP: algorithme localement orthogonal pour l'approximation parcimonieuse rapide de signaux longs sur des dictionnaires locaux, in: Proc. GRETSI, Septembre 2009.

Scientific Books (or Scientific Book chapters)

[62]
F. Bimbot.
9, in: Automatic Speaker Recognition, Iste / John Wiley, 2009, p. 321–353.
[63]
A. Nesbit, M. Jafari, E. Vincent, M. Plumbley.
Audio source, in: Audio source separation using sparse representations, IGI Global, 2010, Accepted subject to minor revisions.
[64]
E. Vincent, Y. Deville.
Audio applications, in: Handbook of Blind Source Separation, Independent Component Analysis and Applications, Academic Press, 2009.
[65]
E. Vincent, M. Jafari, S. Abdallah, M. Plumbley, M. Davies.
unknown, in: Probabilistic modeling paradigms for audio source separation, IGI Global, 2010, Accepted subject to minor revisions.

Internal Reports

[66]
R. Gribonval, K. Schnass.
Dictionary Identification - Sparse Matrix-Factorisation via _ell1 -Minimisation, arXiv, April 2009, no 0904.4774, Technical report.

Other Publications

[67]
S. Foucart, R. Gribonval.
Real vs. Complex Null Space Properties for Sparse Vector Recovery, oct 2009, submitted to Comptes Rendus de l'Académie des Sciences.

References in notes

[68]
R. Baraniuk.
Compressive sensing, in: IEEE Signal Processing Magazine, July 2007, vol. 24, no 4, p. 118–121.
[69]
R. Boite, H. Bourlard, T. Dutoit, J. Hancq, H. Leich.
Traitement de la Parole, Presses Polytechniques et Universitaires Romandes, 2000.
[70]
M. Davy, S. J. Godsill, J. Idier.
Bayesian Analysis of Polyphonic Western Tonal Music, in: Journal of the Acoustical Society of America, 2006, vol. 119, no 4, p. 2498–2517.
[71]
G. Gravier, F. Yvon, B. Jacob, F. Bimbot.
Sirocco, un système ouvert de reconnaissance de la parole, in: Journées d'étude sur la parole, Nancy, June 2002, p. 273-276.
[72]
C. Herley.
ARGOS: Automatically Extracting repeating objects from multimedia streams, in: IEEE Trans. on Multimedia, February 2006, vol. 8, no 1, p. 115–129.
[73]
F. Jelinek.
Statistical Methods for Speech Recognition, MIT Press, Cambridge, Massachussetts, 1998.
[74]
S. Mallat.
A Wavelet Tour of Signal Processing, 2, Academic Press, San Diego, 1999.
[75]
K. Murphy.
An introduction to graphical models, 2001
http://www.cs.ubc.ca/~murphyk/Papers/intro_gm.pdf.
[76]
M. Utiyama, H. Isahara.
A Statistical Model for Domain-Independent Text Segmentation, in: Proceedings of the 39th Annual Meeting of Association for Computational Linguistics, ACL'01, Toulouse, France, July 2001.
[77]
N. Whiteley, A. T. Cemgil, S. J. Godsill.
Sequential Inference of Rhythmic Structure in Musical Audio, in: Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2007, p. 1321–1324.

previous
next