Overall Objectives
Research Program
Application Domains
New Software and Platforms
New Results
Partnerships and Cooperations
XML PDF e-pub
PDF e-Pub


Major publications by the team in recent years
S. Abiteboul, B. André, D. Kaplan.
Managing your digital life, in: Commun. ACM, 2015, vol. 58, no 5, pp. 32–35.
S. Abiteboul, R. Hull, V. Vianu.
Foundations of Databases, Addison-Wesley, 1995.
S. Abiteboul, I. Manolescu, P. Rigaux, M. Rousset, P. Senellart.
Web Data Management, Cambridge University Press, 2011.
A. Amarilli, P. Bourhis, P. Senellart.
Provenance Circuits for Trees and Treelike Instances, in: Automata, Languages, and Programming - 42nd International Colloquium, ICALP 2015, Kyoto, Japan, July 6-10, 2015, Proceedings, Part II, 2015, pp. 56–68.
M. Benedikt, P. Senellart.
Databases, in: Computer Science, The Hardware, Software and Heart of It, Springer, 2011, pp. 169–229.
N. Francis, L. Segoufin, C. Sirangelo.
Datalog Rewritings of Regular Path Queries using Views, in: Logical Methods in Computer Science, 2015, vol. 11, no 4.
F. Jacquemard, L. Segoufin, J. Dimino.
FO2(<, +1, ~) on data trees, data tree automata and branching vector addition systems, in: Logical Methods in Computer Science, 2016, vol. 12, no 2.
W. Kazana, L. Segoufin.
Enumeration of monadic second-order queries on trees, in: ACM Trans. Comput. Log., 2013, vol. 14, no 4, pp. 25:1–25:12.
S. Lei, S. Maniu, L. Mo, R. Cheng, P. Senellart.
Online Influence Maximization, in: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, NSW, Australia, August 10-13, 2015, 2015, pp. 645–654.
D. Montoya, S. Abiteboul, P. Senellart.
Hup-me: inferring and reconciling a timeline of user activity from rich smartphone data, in: Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems, Bellevue, WA, USA, November 3-6, 2015, 2015, pp. 62:1–62:4.
Publications of the year

Articles in International Peer-Reviewed Journals

D. Figueira, L. Segoufin.
Bottom-up automata on data trees and vertical XPath, in: Logical Methods in Computer Science, 2017, pp. 1-40, [ DOI : 10.08748 ]
S. Maniu, R. Cheng, P. Senellart.
An Indexing Framework for Queries on Probabilistic Graphs, in: ACM Trans. Datab. Syst, 2017.
P. Senellart.
Provenance and Probabilities in Relational Databases: From Theory to Practice, in: SIGMOD record, December 2017, pp. 1-11.

Invited Conferences

S. Abiteboul.
Issues in Ethical Data Management - Extended Abstract, in: PPDP 2017 - 19th International Symposium on Principles and Practice of Declarative Programming, Namur, Belgium, October 2017.
S. Abiteboul, D. Montoya.
Personal Knowledge Base Systems, in: PAP 2017, Personal analytics and privacy, Skopje, Macedonia, September 2017.

International Conferences with Proceedings

A. Amarilli, Y. Amsterdamer, T. Milo, P. Senellart.
Top-k Querying of Unknown Values under Order Constraints, in: ICDT 2017 - International Conference on Database Theory, Venice, Italy, March 2017. [ DOI : 10.4230/LIPIcs.ICDT.2017.5 ]
A. Amarilli, P. Bourhis, M. Monet, P. Senellart.
Combined Tractability of Query Evaluation via Tree Automata and Cycluits, in: ICDT 2017 - International Conference on Database Theory, Venice, Italy, March 2017. [ DOI : 10.4230/LIPIcs.ICDT.2017.6 ]
A. Amarilli, M. Lamine Ba, D. Deutch, P. Senellart.
Possible and Certain Answers for Queries over Order-Incomplete Data, in: 24th International Symposium on Temporal Representation and Reasoning (TIME 2017), Mons, Belgium, S. Schewe, T. Schneider, J. Wijsen (editors), Schloss Dagstuhl, October 2017, vol. 90, pp. 4:1-4:19, [ DOI : 10.4230/LIPIcs.TIME.2017.4 ]
A. Amarilli, M. Monet, P. Senellart.
Conjunctive Queries on Probabilistic Graphs: Combined Complexity, in: Principles of Database Systems (PODS), Chicago, United States, May 2017, [ DOI : 10.1145/3034786.3056121 ]
M. Crochemore, A. Heliou, G. Kucherov, L. Mouchard, S. P. Pissis, Y. Ramusat.
Minimal absent words in a sliding window & applications to on-line pattern matching, in: FCT 2017, Bordeaux, France, Lecture Notes in Computer Science, Springer, September 2017, forthcoming.
O. Savković, E. Kharlamov, W. Nutt, P. Senellart.
Towards Approximating Incomplete Queries over Partially Complete Databases (Extended Abstract), in: AMW, Montevideo, Uruguay, AMW 2017 - 11th Alberto Mendelzon International Workshop on Foundations of Data Management Montevideo, Uruguay June 5 – 9, 2017, June 2017.
L. Segoufin, A. Vigny.
Constant Delay Enumeration for FO Queries over Databases with Local Bounded Expansion, in: ICDT, Venise, Italy, March 2017.
J. Stoyanovich, B. Howe, S. Abiteboul, G. Miklau, A. Sahuguet, G. Weikum.
Fides: Towards a Platform for Responsible Data Science, in: SSDBM'17 - 29th International Conference on Scientific and Statistical Database Management, Chicago, United States, June 2017. [ DOI : 10.1145/3085504.3085530 ]

National Conferences with Proceedings

K. Rafes, S. Cohen-Boulakia, S. Abiteboul.
Une autocomplétion générique de SPARQL dans un contexte multi-services, in: BDA 2017 - 33ème conférence sur la «Gestion de Données — Principes, Technologies et Applications», Nancy, France, November 2017.

Books or Proceedings Editing

A. Meliou, P. Senellart (editors)
Proceedings of the 20th International Workshop on the Web and Databases, WebDB 2017, May 2017.

Scientific Popularization

S. Abiteboul, G. Dowek.
Le temps des algorithmes, Editions Le Pommier, 2017, 192 p.
S. Abiteboul, V. Peugeot.
Terra Data : Qu'allons-nous faire des données numériques ?, Editions Le Pommier, 2017, 320 p.
P. Senellart.
Archivage du Web, in: Les Big Data à découvert, CNRS Éditions, March 2017.

Other Publications

A. Amarilli, Y. Amsterdamer, T. Milo, P. Senellart.
Top-k Querying of Unknown Values under Order Constraints (Extended Version), January 2017, - 32 pages, 1 figure, 1 algorithm, 51 references. Extended version of paper at ICDT'17.
A. Amarilli, M. L. Ba, D. Deutch, P. Senellart.
Possible and Certain Answers for Queries over Order-Incomplete Data, October 2017, - 55 pages, 5 figures, 1 table, 44 references. Accepted at TIME'17. This paper is the full version with appendices of the article in the TIME proceedings. The main text of this full version is the same as the TIME proceedings version, except some superficial changes (to fit the proceedings version to 15 pages, and to obey LIPIcs-specific formatting requirements). [ DOI : 10.4230/LIPIcs.TIME.2017.4 ]
P. Senellart, A. Amarilli, M. Monet.
Connecting Width and Structure in Knowledge Compilation, October 2017, - 32 pages, no figures, 39 references. Submitted.
References in notes
S. Abiteboul, P. Bourhis, V. Vianu.
Comparing workflow specification languages: A matter of views, in: ACM Trans. Database Syst., 2012, vol. 37, no 2, pp. 10:1–10:59.
S. Abiteboul, P. Buneman, D. Suciu.
Data on the Web: From Relations to Semistructured Data and XML, Morgan Kaufmann, 1999.
S. Abiteboul, D. Deutch, V. Vianu.
Deduction with Contradictions in Datalog, in: Proc. 17th International Conference on Database Theory (ICDT), Athens, Greece, March 24-28, 2014., N. Schweikardt, V. Christophides, V. Leroy (editors),, 2014, pp. 143–154.
S. Abiteboul, L. Herr, J. V. den Bussche.
Temporal Versus First-Order Logic to Query Temporal Databases, in: Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 3-5, 1996, Montreal, Canada, R. Hull (editor), ACM Press, 1996, pp. 49–57.
S. Abiteboul, B. Kimelfeld, Y. Sagiv, P. Senellart.
On the expressiveness of probabilistic XML models, in: VLDB J., 2009, vol. 18, no 5, pp. 1041–1064.
S. Abiteboul, L. Segoufin, V. Vianu.
Representing and querying XML with incomplete information, in: ACM Trans. Database Syst., 2006, vol. 31, no 1, pp. 208–254.
A. Amarilli, P. Bourhis, P. Senellart.
Tractable Lineages on Treelike Instances: Limits and Extensions, in: Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS 2016, San Francisco, CA, USA, June 26 - July 01, 2016, T. Milo, W. Tan (editors), ACM, 2016, pp. 355–370.
Y. Amsterdamer, D. Deutch, V. Tannen.
Provenance for aggregate queries, in: Proceedings of the 30th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2011, June 12-16, 2011, Athens, Greece, M. Lenzerini, T. Schwentick (editors), ACM, 2011, pp. 153–164.
Y. Amsterdamer, Y. Grossman, T. Milo, P. Senellart.
CrowdMiner: Mining association rules from the crowd, in: PVLDB, 2013, vol. 6, no 12, pp. 1250–1253.
P. B. Baeza.
Querying graph databases, in: Proceedings of the 32nd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2013, New York, NY, USA - June 22 - 27, 2013, R. Hull, W. Fan (editors), ACM, 2013, pp. 175–188.
D. Barbará, H. Garcia-Molina, D. Porter.
The Management of Probabilistic Data, in: IEEE Trans. Knowl. Data Eng., 1992, vol. 4, no 5, pp. 487–502.
D. Basu, Q. Lin, W. Chen, H. T. Vo, Z. Yuan, P. Senellart, S. Bressan.
Regularized Cost-Model Oblivious Database Tuning with Reinforcement Learning, in: T. Large-Scale Data- and Knowledge-Centered Systems, 2016, vol. 28, pp. 96–132.
M. Benedikt, G. Gottlob, P. Senellart.
Determining relevance of accesses at runtime, in: Proceedings of the 30th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2011, June 12-16, 2011, Athens, Greece, M. Lenzerini, T. Schwentick (editors), ACM, 2011, pp. 211–222.
M. Bienvenu, D. Deutch, D. Martinenghi, P. Senellart, F. M. Suchanek.
Dealing with the Deep Web and all its Quirks, in: Proceedings of the Second International Workshop on Searching and Integrating New Web Data Sources, Istanbul, Turkey, August 31, 2012, M. Brambilla, S. Ceri, T. Furche, G. Gottlob (editors), CEUR Workshop Proceedings,, 2012, vol. 884, pp. 21–24.
M. Bojańczyk, L. Segoufin, S. Toruńczyk.
Verification of database-driven systems via amalgamation, in: Proceedings of the 32nd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2013, New York, NY, USA - June 22 - 27, 2013, R. Hull, W. Fan (editors), ACM, 2013, pp. 63–74.
P. Buneman, S. Khanna, W.-C. Tan.
Why and Where: A Characterization of Data Provenance, in: Database Theory - ICDT 2001, 8th International Conference, London, UK, January 4-6, 2001, Proceedings., J. V. den Bussche, V. Vianu (editors), Lecture Notes in Computer Science, Springer, 2001, vol. 1973, pp. 316–330.
B. Courcelle.
The Monadic Second-Order Logic of Graphs. I. Recognizable Sets of Finite Graphs, in: Inf. Comput., 1990, vol. 85, no 1, pp. 12–75.
N. N. Dalvi, D. Suciu.
The dichotomy of probabilistic inference for unions of conjunctive queries, in: J. ACM, 2012, vol. 59, no 6, pp. 30:1–30:87.
A. Deshpande, Z. G. Ives, V. Raman.
Adaptive Query Processing, in: Foundations and Trends in Databases, 2007, vol. 1, no 1, pp. 1–140.
P. Donmez, J. G. Carbonell.
Proactive learning: cost-sensitive active learning with multiple imperfect oracles, in: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, Napa Valley, California, USA, October 26-30, 2008, J. G. Shanahan, S. Amer-Yahia, I. Manolescu, Y. Zhang, D. A. Evans, A. Kolcz, K. Choi, A. Chowdhury (editors), ACM, 2008, pp. 619–628.
M. Faheem, P. Senellart.
Adaptive Web Crawling Through Structure-Based Link Classification, in: Digital Libraries: Providing Quality Information - 17th International Conference on Asia-Pacific Digital Libraries, ICADL 2015, Seoul, Korea, December 9-12, 2015, Proceedings, R. B. Allen, J. Hunter, M. L. Zeng (editors), Lecture Notes in Computer Science, Springer, 2015, vol. 9469, pp. 39–51.
A. Galland, S. Abiteboul, A. Marian, P. Senellart.
Corroborating information from disagreeing views, in: Proceedings of the Third International Conference on Web Search and Web Data Mining, WSDM 2010, New York, NY, USA, February 4-6, 2010, B. D. Davison, T. Suel, N. Craswell, B. Liu (editors), ACM, 2010, pp. 131–140.
F. Geerts, A. Poggi.
On database query languages for K-relations, in: J. Applied Logic, 2010, vol. 8, no 2, pp. 173–185.
L. Getoor.
Introduction to statistical relational learning, MIT Press, 2007.
G. Gouriten, S. Maniu, P. Senellart.
Scalable, generic, and adaptive systems for focused crawling, in: 25th ACM Conference on Hypertext and Social Media, HT '14, Santiago, Chile, September 1-4, 2014, L. Ferres, G. Rossi, V. A. F. Almeida, E. Herder (editors), ACM, 2014, pp. 35–45.
T. J. Green, G. Karvounarakis, V. Tannen.
Provenance semirings, in: Proceedings of the Twenty-Sixth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, June 11-13, 2007, Beijing, China, L. Libkin (editor), ACM, 2007, pp. 31–40.
T. J. Green, V. Tannen.
Models for Incomplete and Probabilistic Information, in: IEEE Data Eng. Bull., 2006, vol. 29, no 1, pp. 17–24.
A. Y. Halevy.
Answering queries using views: A survey, in: VLDB J., 2001, vol. 10, no 4, pp. 270–294.
M. A. Hearst, S. T. Dumais, E. Osuna, J. Platt, B. Scholkopf.
Support vector machines, in: IEEE Intelligent Systems, 1998, vol. 13, no 4, pp. 18–28.
T. Imielinski, W. L. Jr..
Incomplete Information in Relational Databases, in: J. ACM, 1984, vol. 31, no 4, pp. 761–791.
W. Kazana, L. Segoufin.
Enumeration of first-order queries on classes of structures with bounded expansion, in: Proceedings of the 32nd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2013, New York, NY, USA - June 22 - 27, 2013, R. Hull, W. Fan (editors), ACM, 2013, pp. 297–308.
B. Kimelfeld, P. Senellart.
Probabilistic XML: Models and Complexity, in: Advances in Probabilistic Databases for Uncertain Information Management, Z. Ma, L. Yan (editors), Studies in Fuzziness and Soft Computing, Springer, 2013, vol. 304, pp. 39–66.
A. C. Klug.
Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions, in: J. ACM, 1982, vol. 29, no 3, pp. 699–717.
D. Kossmann.
The State of the art in distributed query processing, in: ACM Comput. Surv., 2000, vol. 32, no 4, pp. 422–469.
J. D. Lafferty, A. McCallum, F. C. N. Pereira.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, in: Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28 - July 1, 2001, C. E. Brodley, A. P. Danyluk (editors), Morgan Kaufmann, 2001, pp. 282–289.
M. Mohri.
Semiring Frameworks and Algorithms for Shortest-Distance Problems, in: Journal of Automata, Languages and Combinatorics, 2002, vol. 7, no 3, pp. 321–350.
F. Neven.
Automata Theory for XML Researchers, in: SIGMOD Record, 2002, vol. 31, no 3, pp. 39–46.
L. Segoufin.
A glimpse on constant delay enumeration (Invited Talk), in: 31st International Symposium on Theoretical Aspects of Computer Science (STACS 2014), STACS 2014, March 5-8, 2014, Lyon, France, E. W. Mayr, N. Portier (editors), LIPIcs, Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2014, vol. 25, pp. 13–27.
P. Senellart, A. Mittal, D. Muschick, R. Gilleron, M. Tommasi.
Automatic wrapper induction from hidden-web sources with domain knowledge, in: 10th ACM International Workshop on Web Information and Data Management (WIDM 2008), Napa Valley, California, USA, October 30, 2008, C. Y. Chan, N. Polyzotis (editors), ACM, 2008, pp. 9–16.
B. Settles, M. Craven, L. Friedland.
Active learning with real annotation costs, in: NIPS 2008 Workshop on Cost-Sensitive Learning, 2008.
B. Settles.
Active Learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, 2012.
F. M. Suchanek, S. Abiteboul, P. Senellart.
PARIS: Probabilistic Alignment of Relations, Instances, and Schema, in: PVLDB, 2011, vol. 5, no 3, pp. 157–168.
D. Suciu, D. Olteanu, C. , C. Koch.
Probabilistic Databases, Synthesis Lectures on Data Management, Morgan & Claypool Publishers, 2011.
R. S. Sutton, A. G. Barto.
Reinforcement learning - an introduction, Adaptive computation and machine learning, MIT Press, 1998.
M. Y. Vardi.
The Complexity of Relational Query Languages (Extended Abstract), in: Proceedings of the 14th Annual ACM Symposium on Theory of Computing, May 5-7, 1982, San Francisco, California, USA, H. R. Lewis, B. B. Simons, W. A. Burkhard, L. H. Landweber (editors), ACM, 1982, pp. 137–146.
K. Zhou, M. Lalmas, T. Sakai, R. Cummins, J. M. Jose.
On the reliability and intuitiveness of aggregated search metrics, in: 22nd ACM International Conference on Information and Knowledge Management, CIKM'13, San Francisco, CA, USA, October 27 - November 1, 2013, Q. He, A. Iyengar, W. Nejdl, J. Pei, R. Rastogi (editors), ACM, 2013, pp. 689–698.
M. T. Özsu, P. Valduriez.
Principles of Distributed Database Systems, Third Edition, Springer, 2011.