Data, Intelligence & Graphs – Télécom Paris

Home

The Data, Intelligence and Graphs (DIG) team is a group of researchers at Télécom Paris working on the fundamental issues raised in databases, knowledge management, graph mining and artificial intelligence. Research interests cover theoretical foundations of data intelligence and graph systems, practical solutions and applications, as well as cognitive aspects.

Check the newest version of the YAGO knowledge base!

The DIG team has strong industrial collaborations:

The DIG team is a proud signer of the TCS4F pledge for sustainable research in theoretical computer science. A large majority of DIG members are signers of the No free view? No review! pledge in favor of open access:

Research

Knowledge Bases

A knowledge base is a computer-processable collection of knowledge about the world. We construct and mine such knowledge bases.

YAGO: YAGO is a large ontology constructed from WordNet, Wikipedia, and other sources. We develop YAGO together with the Database department of the Max Planck Institute for Informatics in Germany.
AMIE: AMIE is a project to learn patterns and rules in ontologies. We conduct this project together with the Database department of the Max Planck Institute for Informatics in Germany.
NoRDF is our new project to model and extract complex information from natural language text. We are currently hiring PhDs, postdocs, and engineers!

Graph Mining

Graphs are a near-universal way to represent data. We are concerned with mining graphs for patterns and properties. Our particular focus is on the scalability of such approaches.

scikit-network: scikit-network is a Python package for the analysis of large graphs (clustering, embedding, classification, ranking).

Social Web

The Web has evolved more and more into a social Web: content is produced and shared by users. In the DIG team, we follow and anticipate developments in this area.

Community detection: We are investigating means to detect and distinguish social communities on the Web.
Social Relations: We investigate the optimal investment in social relations from a theoretical point of view.

Language and Relevance

Computer science is not just about computers. In this area of research, we investigate how humans reason, and what this implies for machines.

Simplicity Theory: Simplicity theory seeks to explain the relevance of situations or events to human minds. See http://www.simplicitytheory.science
Relevance in natural language: The point is to retro-engineer methods to achieve meaningful and relevant speech from our understanding of human performance. Read this paper. Read more on this.
Communication as social signalling: We apply game theory and social simulation to explore conditions in which providing valuable (i.e. relevant) information is a profitable strategy. Read this paper. Read more on this.

Machine Learning for Data Streams

We investigate how to do machine learning in real time, contributing to new open source tools:

River: a Python library for online Machine Learning
MOA: Massive Online Analytics, a framework for mining data streams (in Java)
Apache SAMOA: Scalable Advanced Massive Online Analytics, an open source framework for data stream mining on the Hadoop Ecosystem

People

Talel Abdessalem	Mehwish Alam	Albert Bifet	Thomas Bonald

Jean-Louis Dessalles	Nils Holzenberger	Louis Jachiet	Mauro Sozio	Fabian Suchanek

Faculty

Talel Abdessalem, Professor
Mehwish Alam, Associate Professor
Albert Bifet, Professor
Thomas Bonald, Professor, leader of the team
Jean-Louis Dessalles, Emeritus
Nils Holzenberger, Associate Professor
Louis Jachiet, Associate Professor
Marc Jeanmougin, Research Engineer
Mauro Sozio, Professor
Fabian Suchanek, Professor

Post-docs

Sven Dziadek
Peter Fratrik
Fajrian Yunus

PhD candidates

Amy Affoudhi. Advisors: Fabian M. Suchanek and Nils Holzenberger
Zakari Ait Ouazzou. Advisors: Talel Abdessalem and Albert Bifet
François Amat. Advisor: Fabian Suchanek.
Tom Calamai. Advisors: Fabian M. Suchanek and Oana Balalau
Cyril Chhun. Advisors: Fabian M. Suchanek and Chloé Clavel
Simon Coumes. Advisor: Fabian M. Suchanek
Gabriel Damay. Advisor Mauro Sozio
Simon Delarue. Advisors: Thomas Bonald and Tiphaine Viard
Rajaa El Hamdani. Advisor: Thomas Bonald & Fragkiskos Malliaros
Chadi Helwe. Advisors: Fabian M. Suchanek and Chloé Clavel
Lanfang Kong. Advisor Mauro Sozio
Yiwen Peng. Advisors: Thomas Bonald and Mehwish Alam
Roman Plaud. Advisors: Thomas Bonald, Mathieu Labeau and Antoine Saillenfest
Samuel Reyd. Advisors: Ada Diaconescu and Jean-Louis Dessalles
Zacchary Sadeddine. Advisor: Fabian Suchanek

Interns

Bérénice Jaulmes. Advisors: Mehwish Alam, Fabian Suchanek
Nicoline Nymand-Andersen. Advisors: Thomas Bonald, Marc Jeanmougin

Former members

Antoine Amarilli (currently at Inria)
Sri Appakutti. Advisors: Nils Holzenberger, Fabian Suchanek
Pierre-Henri Paris
Octave Gaspard. Advisor: Antoine Amarilli
Vlad-Stefan Vergelea. Advisor: Antoine Amarilli
Mariam Bary. Advisor: Albert Bifet
Georges Hebrail, Invited professor (2020-2023)
Julien Lie-Panis. Advisors: Jean-Louis Dessalles and Jean-Baptiste André. (2019-2023)
Minh Huong Le Nguyen. Advisor: Albert Bifet
Nedeljko Radulovic. Advisors: Fabian M. Suchanek and Albert Bifet.
Angelo Ortiz. Advisors: Thomas Bonald, Mathieu Labeau and Antoine Saillenfest
Andrian Putina. Advisor: Mauro Sozio.
Etienne Houzé. Advisors: Ada Diaconescu and Jean-Louis Dessalles (2019-2022).
Armand Boschin. Advisor: Thomas Bonald.
Lihu Chen. Advisors: Fabian Suchanek and Gael Varoquaux
Dihia Boulegane. Advisors: Albert Bifet and Giyyarpuram Madhusudan.
Samed Atouati. Advisor: Mauro Sozio.
Léo Laugier. Advisor: Thomas Bonald.
Tiphaine Viard, Associate Professor
Jakub Różycki. Advisor: Antoine Amarilli
Martín Muñoz, Pontificia Universidad Católica de Chile
Quentin Lutz. Advisor: Thomas Bonald.
Laurent Decreusefond, Professor
Pierre Senellart, Invited Professor
Cédric Kulbach. Visiting PhD. (2021)
Éloi Tanguy. Advisors: Thomas Bonald and Tiphaine Viard
Julie Dessaint. Advisors: Fabien Suchanek and Thomas Bonald
Flavia Salutari. Advisor: Mauro Sozio (2018-2021)
Hanady Gebran. Advisor: jean-louis Dessalles (2021)
Emmanuel Lagrée. Advisor: jean-louis Dessalles (2021)
Etienne Li. Advisor: jean-louis Dessalles (2021)
Demir Renaux (2021). Advisors: Antoine Amarilli and Louis Jachiet and Luc Segoufin
Rémi Dupré (2019). Advisors: Antoine Amarilli and Pierre Senellart.
Lucas Gréaux (2019) Advisor: Mauro Sozio.
Anes Mekki (2019). Advisor: Antoine Amarilli.
Bertrand Charpentier (2018)
Arnaud Guerquin (2018) Advisor: Mauro Sozio
Heitor Murilo Gomes (2017-2019)
Mostafa Haghir Chehreghani (2016-2019)
Céline Comte (2016-2019)
Suraj Jog (2016). Advisor: Pierre Senellart.
Qing Liu (2016)
Jesse Read (2016)
Katerina Tzompanaki (2016)
Marc Benhamou (2015)
Anis Bouriga (2015)
Maximilien Danisch (2015-2017)
Assane Dieng (2015)
Miyoung Han (2015-2018)
Takashi Hashimoto (2015)
Alexandre Hollocou (2015-2018)
Sitesh Indra (2015)
Quentin Lobbé (2015-2018)
Mikaël Monet (2015-2018)
Jacob Montiel (2015-2019)
Thomas Rebele (2015-2018)
Émilie Saintilan (2015)
Danai Symeonidou (2015)
Adrien Tibere-Inglesse (2015)
Oana Bălălău (2014-2017)
Yoann Bourse (2014)
Sahar Brinis (2014)
Luis Galárraga (2014-2016)
David Montoya (2014-2016)
Pierre-Alexandre Murena (2014-2019)
Mohamed-Amine Baazizi (2013)
Raphaël Bonaque (2013)
Roxana Gabriela Horincar (2013-2015)
Diogo Martins (2013-2014)
Sébastien Montenez (2013-2015)
Abdellah Fourtassi (2012)
Mayur Garg (2012)
Christos Giatsidis (2012-2013)
Lise Hobeika (2012)
Fragkiskos Malliaros (2012-2013)
Sylvain Mellak (2012)
Anna-Isavella Rerra (2012)
François Rousseau (2012-2013)
Yael Amsterdamer (2011)
Mouhamadou Lamine Ba (2011-2015)
Muhammad Faheem (2011-2014)
Aditya Goel (2011)
Georges Gouriten (2011-2013)
Modou Gueye (2011-2014)
Ioana Ileana (2011-2014)
Chengxuan Liao (2011)
Nicoleta Oita (2011-2012)
Antoine Saillenfest (2011-2016)
Imen Ben Dhia (2010-2013)
Vishal Gupta (2010)
Patrick Prianon (2010)
Michalis Vazirgiannis (2010-2013)
Adeel Anjum (2009)
Ashutosh Dwivedi (2009)
Evgeny Kharlamov (2009-2010)
Silviu Maniu (2009-2012)
Damien Munch (2009-2013)
Marilena Oita (2009-2012)
Asma Souihli (2009-2012)
Bogdan Cautis (2008-2013)
Nora Derouiche (2008-2012)
Adrian Dimulescu (2008-2010)
Bilel Gueni (2007-2010)
Ruiming Tang (2014)
Maroua Bahri. Advisors: Albert Bifet and Silviu Maniu.
Nader Beltaief. Advisor: Laurent Decreusefond.
Siwar Garouachi
Jean-Benoît Griesner ( )
Sanaz Hasanzadeh Fard. Advisors: Fabian M. Suchanek and Chloé Clavel
Jonathan Lajus. Advisor: Fabian M. Suchan
Nathan De Lara. Advisor: Thomas Bonald.
Denys Lazarenko. Advisor: Thomas Bonald.
Ngurah Agus Sanjaya ER. Advisors: Talel Abdessalem and Stéphane Bressan.
Thomas Pellissier Tanon. Advisor: Fabian M. Suchanek and Antoine Amarilli.
Edouard Pineau. Advisor: Thomas Bonald.
Julien Romero. Advisors: Fabian M. Suchanek and Nicoleta Preda
Atef Shaar
Zhihan Zhang. Advisor: Laurent Decreusefond

News

An open position of Assistant / Associate Professor is available in the team!

Tuesday, November 12, 2024, 11:45, 4A125

Fabian Suchanek YAGO In this talk I will present the newest version of YAGO, the knowledge base that we are building with several members of the DIG team. I will show why we build it, how we build it, and how it can be used. This will also be an occasion for me to get …

Continue reading “Tuesday, November 12, 2024, 11:45, 4A125”

Tuesday, October 29, 2024, 11:45, 4A125

Simon Coumes Qiana: A First-Order Formalism to Quantify over Contexts and Formulas Qiana is a logic framework for reasoning on formulas that are true only in specific contexts. In Qiana, it is possible to quantify over both formulas and contexts to express, e.g., that “everyone knows everything Alice says”. Qiana also permits paraconsistent logics within …

Continue reading “Tuesday, October 29, 2024, 11:45, 4A125”

Tuesday, October 15, 2024, 11:45, 4A301

Yael Amsterdamer & Daniel Deutch Query-Guided Data Cleaning (Yael Amsterdamer) We take an active approach to the cleaning of uncertain databases, by proposing a set of tools to guide the cleaning process. We start with a database whose tuple correctness is uncertain, and with some means of resolving this uncertainty, e.g., crowdsourcing, experts, a trained …

Continue reading “Tuesday, October 15, 2024, 11:45, 4A301”