Bibliography
Major publications by the team in recent years
- [1]
- G. Antoniu, L. Bougé, P. Hatcher, M. MacBeth, K. McGuigan, R. Namyst.
The Hyperion system: Compiling multithreaded Java bytecode for distributed execution, in: Parallel Computing, October 2001, vol. 27, p. 1279–1297. - [2]
- O. Aumage, L. Bougé, A. Denis, L. Eyraud, J.-F. Méhaut, G. Mercier, R. Namyst, L. Prylli.
A Portable and Efficient Communication Library for High-Performance Cluster Computing (extended version), in: Cluster Computing, January 2002, vol. 5, no 1, p. 43-54. - [3]
- O. Aumage, E. Brunet, N. Furmento, R. Namyst.
NewMadeleine: a Fast Communication Scheduling Engine for High Performance Networks, in: CAC 2007: Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2007, Long Beach, California, USA, March 2007
http://hal.inria.fr/inria-00127356, Also available as LaBRI Report 1421-07 and INRIA RR-6085. - [4]
- O. Aumage, G. Mercier.
MPICH/MadIII: a Cluster of Clusters Enabled MPI Implementation, in: Proc. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2003), Tokyo, IEEE, May 2003, p. 26–35. - [5]
- O. Aumage, G. Mercier, R. Namyst.
MPICH/Madeleine: a True Multi-Protocol MPI for High-Performance Networks, in: Proc. 15th International Parallel and Distributed Processing Symposium (IPDPS 2001), San Francisco, IEEE, April 2001, 51 p, Extended proceedings in electronic form only.. - [6]
- D. Buntinas, G. Mercier, W. Gropp.
Implementation and Shared-Memory Evaluation of MPICH2 over the Nemesis Communication Subsystem, in: Recent Advances in Parallel Virtual Machine and Message Passing Interface: Proc. 13th European PVM/MPI Users Group Meeting, Bonn, Germany, September 2006. - [7]
- V. Danjean, R. Namyst, R. Russell.
Linux Kernel Activations to Support Multithreading, in: Proc. 18th IASTED International Conference on Applied Informatics (AI 2000), Innsbruck, Austria, IASTED, February 2000, p. 718-723. - [8]
- B. Goglin.
Improving Message Passing over Ethernet with I/OAT Copy Offload in Open-MX, in: Proceedings of the IEEE International Conference on Cluster Computing, Tsukuba, Japan, IEEE Computer Society Press, September 2008, p. 223–231
http://hal.inria.fr/inria-00288757. - [9]
- S. Moreaud, B. Goglin.
Impact of NUMA Effects on High-Speed Networking with Multi-Opteron Machines, in: The 19th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2007), Cambridge, Massachussetts, November 2007
http://hal.inria.fr/inria-00175747. - [10]
- R. Namyst.
Contribution à la conception de supports exécutifs multithreads performants, Université Claude Bernard de Lyon, pour des travaux effectués à l'école normale supérieure de Lyon, December 2001, Habilitation à diriger des recherches. - [11]
- S. Thibault, F. Broquedis, B. Goglin, R. Namyst, P.-A. Wacrenier.
An Efficient OpenMP Runtime System for Hierarchical Architectures, in: International Workshop on OpenMP (IWOMP), Beijing,China, 6 2007, p. 148–159
http://hal.inria.fr/inria-00154502. - [12]
- S. Thibault, R. Namyst, P.-A. Wacrenier.
Building Portable Thread Schedulers for Hierarchical Multiprocessors: the BubbleSched Framework, in: EuroPar, Rennes,France, ACM, 8 2007
http://hal.inria.fr/inria-00154506. - [13]
- F. Trahay, E. Brunet, A. Denis, R. Namyst.
A multithreaded communication engine for multicore architectures, in: CAC 2008: Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2008, Miami, FL, IEEE Computer Society Press, April 2008
http://hal.inria.fr/inria-00224999. - [14]
- F. Trahay, A. Denis, O. Aumage, R. Namyst.
Improving Reactivity and Communication Overlap in MPI using a Generic I/O Manager, in: EuroPVM/MPI, F. Cappello, T. Herault, J. Dongarra (editors), Lecture Notes in Computer Science, Springer, 2007, vol. Recent Advances in Parallel Virtual Machine and Message Passing Interface, no 4757, p. 170-177
http://hal.inria.fr/inria-00177167.
Publications of the year
Doctoral Dissertations and Habilitation Theses
- [15]
- F. Trahay.
De l'interaction des communications et de l'ordonnancement de threads au sein des grappes de machines multi-cœurs, Université Bordeaux 1, 351 cours de la Libération — 33405 TALENCE cedex, November 2009, Ph. D. Thesis.
Articles in International Peer-Reviewed Journal
- [16]
- C. Augonnet, S. Thibault, R. Namyst.
StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines, in: IEEE Transactions on Parallel and Distributed Systems, 2010, Submitted. - [17]
- C. Augonnet, S. Thibault, R. Namyst, P.-A. Wacrenier.
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, in: Concurrency and Computation: Practice and Experience - Euro-Par 2009 Best Papers issue, 2010, Submitted. - [18]
- L.-C. Canon, O. Dubuisson, J. Gustedt, E. Jeannot.
Defining and Controlling the Heterogeneity of a Cluster: the Wrekavoc Tool, in: The Journal of Systems & Software, 2010. - [19]
- M. Faverge, P. Ramet.
A NUMA Aware Scheduler for a Parallel Sparse Direct Solver, in: Parallel Computing, 2010, Submitted. - [20]
- B. Goglin.
NIC-assisted Cache-Efficient Receive Stack for Message Passing over Ethernet, in: Concurrency and Computation: Practice and Experience - Euro-Par 2009 Best Papers issue, 2010, Submitted.
International Peer-Reviewed Conference/Proceedings
- [21]
- C. Augonnet, S. Thibault, R. Namyst.
Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures, in: Proceedings of the International Euro-Par Workshops 2009, HPPC'09, Delft, The Netherlands, Lecture Notes in Computer Science, Springer, August 2009
http://hal.inria.fr/inria-00421333. - [22]
- C. Augonnet, S. Thibault, R. Namyst, M. Nijhuis.
Exploiting the Cell/BE architecture with the StarPU unified runtime system, in: SAMOS Workshop - International Workshop on Systems, Architectures, Modeling, and Simulation, Samos, Greece, Lecture Notes in Computer Science, July 2009
http://hal.inria.fr/inria-00378705. - [23]
- C. Augonnet, S. Thibault, R. Namyst, P.-A. Wacrenier.
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, in: Proceedings of the 15th International Euro-Par Conference, Lecture Notes in Computer Science, Delft, The Netherlands, Lecture Notes in Computer Science, Springer, August 2009, vol. 5704, p. 863–874
http://hal.inria.fr/inria-00384363. - [24]
- F. Broquedis, O. Aumage, B. Goglin, S. Thibault, P.-A. Wacrenier, R. Namyst.
Structuring the execution of OpenMP applications for multicore architectures, in: Proceedings of 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS'10), Atlanta, GA, IEEE Computer Society Press, April 2010
http://hal.inria.fr/inria-00441472. - [25]
- F. Broquedis, J. Clet-Ortega, S. Moreaud, N. Furmento, B. Goglin, G. Mercier, S. Thibault, R. Namyst.
hwloc: a Generic Framework for Managing Hardware Affinities in HPC Applications, in: Proceedings of the 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2010), Pisa, Italia, IEEE Computer Society Press, February 2010
http://hal.inria.fr/inria-00429889. - [26]
- F. Broquedis, N. Furmento, B. Goglin, R. Namyst, P.-A. Wacrenier.
Dynamic Task and Data Placement over NUMA Architectures: an OpenMP Runtime Perspective, in: Evolving OpenMP in an Age of Extreme Parallelism, 5th International Workshop on OpenMP, IWOMP 2009, Dresden, Germany, Lecture Notes in Computer Science, Springer, June 2009, vol. 5568, p. 79–92
http://hal.inria.fr/inria-00367570. - [27]
- D. Buntinas, B. Goglin, D. Goodell, G. Mercier, S. Moreaud.
Cache-Efficient, Intranode Large-Message MPI Communication with MPICH2-Nemesis, in: Proceedings of the 38th International Conference on Parallel Processing (ICPP-2009), Vienna, Austria, IEEE Computer Society Press, September 2009
http://hal.inria.fr/inria-00390064. - [28]
- L.-C. Canon, E. Jeannot, J. Weissman.
A Dynamic Approach for Characterizing Collusion in Desktop Grids, in: Proceedings of 24rd IEEE International Parallel and Distributed Processing Symposium (IPDPS'10), Atlanta, GA, IEEE Computer Society Press, April 2010
http://hal.inria.fr/inria-00441256/. - [29]
- B. Goglin, N. Furmento.
Enabling High-Performance Memory-Migration in Linux for Multithreaded Applications, in: MTAAP'09: Workshop on Multithreaded Architectures and Applications, held in conjunction with IPDPS 2009, Rome, Italy, IEEE Computer Society Press, May 2009
http://hal.inria.fr/inria-00358172. - [30]
- B. Goglin, N. Furmento.
Finding a Tradeoff between Host Interrupt Load and MPI Latency over Ethernet, in: Proceedings of the IEEE International Conference on Cluster Computing, New Orleans, LA, IEEE Computer Society Press, September 2009
http://hal.inria.fr/inria-00397328. - [31]
- B. Goglin, N. Furmento.
Memory Migration on Next-Touch, in: Proceedings of the Linux Symposium, Montreal, Canada, July 2009, p. 101–110
http://hal.inria.fr/inria-00378580. - [32]
- B. Goglin.
Decoupling Memory Pinning from the Application with Overlapped on-Demand Pinning and MMU Notifiers, in: CAC 2009: The 9th Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2009, Rome, Italy, IEEE Computer Society Press, May 2009
http://hal.inria.fr/inria-00356236. - [33]
- B. Goglin.
High Throughput Intra-Node MPI Communication with Open-MX, in: Proceedings of the 17th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2009), Weimar, Germany, IEEE Computer Society Press, February 2009
http://hal.inria.fr/inria-00331209. - [34]
- B. Goglin.
NIC-assisted Cache-Efficient Receive Stack for Message Passing over Ethernet, in: Proceedings of the 15th International Euro-Par Conference, Lecture Notes in Computer Science, Delft, The Netherlands, Lecture Notes in Computer Science, Springer, August 2009, vol. 5704, p. 1065–1077
http://hal.inria.fr/inria-00379168. - [35]
- G. Mercier, J. Clet-Ortega.
Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments, in: EuroPVM/MPI, Espoo, Finland, Lecture Notes in Computer Science, Springer, September 2009, vol. 5759, p. 104-115
http://hal.inria.fr/inria-00392581. - [36]
- G. Mercier, F. Trahay, D. Buntinas, E. Brunet.
NewMadeleine: An Efficient Support for High-Performance Networks in MPICH2, in: Proceedings of 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS'09), Rome, Italy, IEEE Computer Society Press, May 2009
http://hal.archives-ouvertes.fr/hal-00360275. - [37]
- S. Moreaud, B. Goglin, D. Goodell, R. Namyst.
Optimizing MPI Communication within large Multicore nodes with Kernel assistance, in: CAC 2010: The 10th Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2010, Atlanta, GA, IEEE Computer Society Press, April 2010, Submitted. - [38]
- M. Nijhuis, H. Bos, H. Bal, C. Augonnet.
Mapping and synchronizing streaming applications on Cell processors, in: International Conference on High Performance Embedded Architectures & Compilers, Paphos, Cyprus, January 2009. - [39]
- F. Trahay, E. Brunet, A. Denis.
Analysis of the impact of multi-threading on communication performance, in: CAC 2009: The 9th Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2009, Rome, Italy, IEEE Computer Society Press, May 2009
http://hal.inria.fr/inria-00381670. - [40]
- F. Trahay, A. Denis.
A scalable and generic task scheduling system for communication libraries, in: Proceedings of the IEEE International Conference on Cluster Computing, New Orleans, LA, IEEE Computer Society Press, September 2009
http://hal.inria.fr/inria-00408521.
National Peer-Reviewed Conference/Proceedings
- [41]
- C. Augonnet.
StarPU: un support exécutif unifié pour les architectures multicœurs hétérogènes, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.inria.fr/inria-00411581. - [42]
- F. Broquedis.
Ordonnancement de threads OpenMP et placement de données coordonnés sur architectures hiérarchiques, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.inria.fr/inria-00422213. - [43]
- J. Clet-Ortega.
Une stratégie efficace pour le placement de processus en environnement multicœur, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.archives-ouvertes.fr/inria-00410756. - [44]
- M. Faverge.
Vers un solveur de systèmes linéaires creux adapté aux machines NUMA, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.inria.fr/inria-00416496. - [45]
- S. Moreaud.
Adaptation des communications MPI intra-nœud aux architectures multicœurs modernes, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.inria.fr/inria-00430021. - [46]
- F. Trahay.
Bibliothèque de communication multi-threadée pour architectures multi-cœurs, in: 19ème Rencontres Francophones du Parallélisme, Toulouse / France, September 2009
http://hal.inria.fr/inria-00410355.
Workshops without Proceedings
- [47]
- M. Faverge.
A NUMA Aware Scheduler for a Parallel Sparse Direct Solver, in: Journées Informatique Massivement Multiprocesseur et Multicoeur, Rocquencourt, France, February 2009
http://www.labri.fr/~ramet/restricted/i3m.pdf.gz.
References in notes
- [48]
- T. Anderson, B. Bershad, E. Lazowska, H. Levy.
Scheduler Activations: Effective Kernel Support for the User-Level Management of Parallelism, in: ACM Transactions on Computer Systems, February 1992, vol. 10, no 1, p. 53-79. - [49]
- P. Balaji, H.-W. Jin, K. Vaidyanathan, D. K. Panda.
Supporting iWARP Compatibility and Features for Regular Network Adapters, in: Proceedings of the Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations, and Technologies (RAIT); held in conjunction with the IEEE International Confer ence on Cluster Computing, Boston, MA, September 2005. - [50]
- G. Ciaccio, G. Chiola.
GAMMA and MPI/GAMMA on GigabitEthernet, in: Proceedings of 7th EuroPVM-MPI conference, Balatonfured, Hongrie, Lecture Notes in Computer Science, Springer Verlag, Septembre 2000, vol. 1908. - [51]
- G. R. Gao, T. Sterling, R. Stevens, M. Hereld, W. Zhu.
Hierarchical multithreading: programming model and system software, in: 20th International Parallel and Distributed Processing Symposium (IPDPS), April 2006.