Home

The research interests of the DIG team span knowledge graphs, LLMs, foundational models, graph mining and data streams, united by a focus on structured knowledge and reasoning. The team develops methods for representing, integrating, and reasoning over complex, dynamic data to enable interpretable and trustworthy AI. Applications range from general-purpose AI to domain-specific areas such as legal AI and AI for health.
More specifically, the DIG team’s research activity covers the following topics:
  • Knowledge bases
  • Logic and algorithms
  • Language and relevance
  • Graph mining
  • Machine learning
  • Data streams
  • LLMs
  • Legal AI
  • AI for health

The DIG team has strong industrial collaborations.

The DIG team is a proud signer of the TCS4F pledge for sustainable research in theoretical computer science.  A large majority of DIG members are signers of the No free view? No review! pledge in favor of open access:

Theoretical Computer Scientists for Future No free view? No review!

Research

Knowledge Bases

A knowledge base is a computer-processable collection of knowledge about the world. We construct and mine such knowledge bases.

Graph Mining

Graphs are a near-universal way to represent data. We are concerned with mining graphs for patterns and properties. Our particular focus is on the scalability of such approaches.

  • Logo of scikit-networkscikit-network: scikit-network is a Python package for the analysis of large graphs (clustering, embedding, classification, ranking).

Data Streams

We investigate how to do machine learning in real time, contributing to new open source tools:

  • River: a Python library for online Machine Learning
  • MOA: Massive Online Analytics, a framework for mining data streams (in Java)
  • Apache SAMOA: Scalable Advanced Massive Online Analytics, an open source framework for data stream mining on the Hadoop Ecosystem

Language and Relevance

Computer science is not just about computers. In this area of research, we investigate how humans reason, and what this implies for machines.

  • Simplicity theory seeks to explain the relevance of situations or events to human minds.
  • Relevance in natural language: The point is to retro-engineer methods to achieve meaningful and relevant speech from our understanding of human performance.
  • We apply game theory and social simulation to explore conditions in which providing valuable (i.e. relevant) information is a profitable strategy. Read this paper.

Team

Talel Abdessalem Mehwish Alam Albert Bifet Thomas Bonald Jean-Louis Dessalles
Nils Holzenberger Louis Jachiet Van-Tam Nguyen Nikola Simidjievski Fabian Suchanek

Faculty

Research engineer

Post-docs

  • Peter Fratrik
  • Fajrian Yunus
  • Alaa Mazouz

PhD candidates

  • François Amat. Advisor: Fabian Suchanek
  • Tom Calamai. Advisors: Fabian M. Suchanek and Oana Balalau
  • Simon Coumes. Advisor: Fabian M. Suchanek
  • Pierre Epron. Advisors: Mehwish Alam and Adrien Coulet
  • Lorenzo Guerra. Advisors: Guillaume Duc, Pavlo Mozharovsky, Van-Tam Nguyen
  • Samy Haffoudhi. Advisors: Fabian Suchanek and Nils Holzenberger
  • Rajaa El HamdaniAdvisosr: Thomas Bonald & Fragkiskos Malliaros
  • Bérénice Jaulmes. Advisors: Mehwish Alam and Fabian Suchanek
  • Zhu Liao. Advisors: Enzo Tartaglione and Van-Tam Nguyen
  • Rémi Nahon. Advisors: Enzo Tartaglione and Van-Tam Nguyen
  • Hung Nguyen. Advisor: Mehwish Alam
  • Le Trung Nguyen. Advisors: Enzo Tartaglione and Van-Tam Nguyen
  • Van Chien Nguyen. Advisors: Samuel Tardieu  and Van-Tam Nguyen
  • Zakari Ait Ouazzou. Advisors: Talel Abdessalem and Albert Bifet
  • Yiwen Peng. Advisors: Thomas Bonald and Mehwish Alam
  • Roman Plaud. Advisors: Thomas Bonald, Mathieu Labeau and Antoine Saillenfest
  • Ael Quelennec. Advisors: Enzo Tartaglione, Pavlo Mozharovsky and Van-Tam Nguyen
  • Samuel Reyd. Advisors: Ada Diaconescu and Jean-Louis Dessalles
  • Zacchary Sadeddine. Advisor: Fabian Suchanek
  • Ali Tarhini. Advisors: Paul Chollet  and Van-Tam Nguyen
  • Long-Tuan Vo. Advisors: Mehwish Alam, Pavlo Mozharovsky and Van-Tam Nguyen
  • Yinghao Wang. Advisors: Enzo Tartaglione and Van-Tam Nguyen
  • Viktoriya Zhukova. Advisor: Thomas Bonald

PhD track students

  • Zeinab Ghamlouch. Advisor: Mehwish Alam
  • Avrile Floro. Advisor: Nils Holzenberger
  • Marc Farah. Advisor: Nikola Simidjievski
  • Daniela Cojocaru. Advisor: Nikola Simidjievski
  • Quoc-Dat Tran. Advisor: Nikola Simidjievski
  • Hai Thien Long Vu. Advisor: Thomas Bonald
  • Thanh Hai Tran. Advisor: Van-Tam Nguyen
  • Thanh Nam Tran. Advisor: Van-Tam Nguyen

News

We are hiring two PhD students and one Postdoc to work on language models and knowledge graphs!

Best paper award at ISWC 2025

Yiwen Peng, Thomas Bonald and Fabian Suchanek received the best paper award at ISWC 2025 for their paper on FLORA: Unsupervised Knowledge Graph Alignment by Fuzzy Logic.

Tuesday, October 28, 2025, 11:45, 4A125

Cristian Santini (University of Macerata) Entity Linking and Relation Extraction for Historical Italian Texts: Challenges and Potential Solutions Entity Linking and Relation Extraction enable the automatic identification of named entities mentioned in texts, along with their relationships, by connecting them to external knowledge graphs such as Wikidata. While these techniques work well on modern documents, …

Tuesday, October 21, 2025, 11:45, 4A301

Yiwen Peng FLORA: Unsupervised Knowledge Graph Alignment by Fuzzy Logic Knowledge graph alignment is the task of matching equivalent entities (that is, instances and classes) and relations across two knowledge graphs. Most existing methods focus on pure entity-level alignment, computing the similarity of entities in some embedding space. They lack interpretable reasoning and need training …