Our Publications

  • All
  • 2025
  • 2024
  • 2023
  • 2022
  • 2021
  • 2020
  • 2019
  • 2018
  • 2017
  • 2016
  • 2015
  • 2014
  • 2013
  • 2012 and before
2025

The Lucie-7B LLM and the Lucie Training Dataset: Open resources for multilingual language generation

arXiv preprint arXiv:2503.12294

#Olivier Gouvert, #Julie Hunter, #Jérôme Louradour, Christophe Cerisara, Evan Dufraisse, Yaya Sy, Laura Rivière, #Jean-Pierre Lorré

#OpenLLM, Language

Read article

2024

LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect

AAAI 2025 Workshop Good-Data

#Speech

Read article

2024

Bias in the Mirror: Are LLMs opinions robust to their own adversarial attacks ?

ACL 2025

#Virgile Rennard, Christos Xypolopoulos, Michalis Vazirgiannis

#Language, #SUMM-RE, #LLM4All

2024

SUMM-RE: A corpus of French meeting-style conversations

31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN)

#Julie Hunter, Hiroyoshi Yamasaki, Océane Granier, #Jérôme Louradour, Roxane Bertrand, #Kate Thompson, Laurent Prévot

#Language, #SUMM-RE

Read article

2024

Nebula: A discourse aware Minecraft builder

Nebula: A discourse aware Minecraft builder

#Kate Thompson, Nicholas Asher, Akshay Chaturvedi

#COCOBOTS, #DISCUTER, #Language

Read article

2024

Llamipa: An incremental discourse parser

Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP)

#Kate Thompson, #Julie Hunter, Nicholas Asher, Akshay Chaturvedi

#COCOBOTS, #DISCUTER, #Language

Read article

2024

Discourse Structure for the Minecraft Corpus

2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

#Kate Thompson, #Julie Hunter, Nicholas Asher

#COCOBOTS, #DISCUTER, #Language

Read article

2024

Claire: Large language models for spontaneous French dialogue

31ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN)

#Julie Hunter, #Jérôme Louradour, Virgile Rennard, Ismaïl Harrando, Guokan Shang, #Jean-Pierre Lorré

#Language, #LLM4ALL

Read article

2023

The Claire French Dialogue Dataset

arXiv preprint arXiv:2311.16840

#Julie Hunter, #Jérôme Louradour, Virgile Rennard, Ismaïl Harrando, Guokan Shang, #Jean-Pierre Lorré

#Language, #LLM4ALL

Read article

2023

SubstanReview: the First Annotated Dataset for Analyzing Substantiation in Peer Reviews

Findings of Empirical Methods in Natural Language Processing (EMNLP)

Yanzhu Guo, Guokan Shang, #Virgile Rennard, Michalis Vazirgiannis, Chloé Clavel

Read article