publications

Here is list of my main scientific publications. a more exhaustive list can be found on my Google Scholar profile.

2024

  1. Multimodal RAG
    Towards Retrieval Augmented Generation over Large Video Libraries
    Yannis Tevissen, Khalil Guetari, and Frédéric Petitpont
    In Proceedings of HSI 2024, 2024
  2. AI Fairness
    Disability Representations: Finding Biases in Automatic Image Generation
    Yannis Tevissen
    In Workshop AVA: Accessibility, Vision, and Autonomy Meet, 2024
  3. Video understanding
    Multimodal Chaptering for Long-Form TV Newscast Video
    Khalil Guetari, Yannis Tevissen, and Frédéric Petitpont
    2024
  4. VLM
    Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
    Yannis Tevissen, Khalil Guetari, Marine Tassel, Erwan Kerleroux, and Frédéric Petitpont
    arXiv preprint arXiv:2405.02305, 2024
  5. Edge AI
    Privacy Preserving Personal Assistant with On-Device Diarization and Spoken Dialogue System for Home and Beyond
    Gérard Chollet, Hugues Sansen, Yannis Tevissen, Jérôme Boudy, Mossaab Hariz, Christophe Lohr, and Fathy Yassa
    Proceedings of IHIET 2024, 2024

2023

  1. Speaker Diarization
    Diarisation multimodale: vers des modèles robustes et justes en contexte réel
    Yannis Tevissen
    Institut Polytechnique de Paris, 2023
  2. Speaker Diarization
    Détection d’activité vocale Multi-flux pour la Diarisation du locuteur
    Yannis Tevissen, Jérôme Boudy, Gérard Chollet, and Frédéric Petitpont
    Proceedings of GRETSI 2023, 2023
  3. Speaker Diarization
    Home monitoring for frailty detection through sound and speaker diarization analysis
    Yannis Tevissen, Dan Istrate, Vincent Zalc, Jérôme Boudy, Gérard Chollet, Frédéric Petitpont, and Sami Boutamine
    In JETSAN 2023, 2023
  4. Fairness
    Towards measuring and scoring speaker diarization fairness
    Yannis Tevissen, Jérôme Boudy, Gérard Chollet, and Frédéric Petitpont
    arXiv preprint arXiv:2302.09991, 2023

2022

  1. Speech Processing
    Multi-stream voice activity detection for robust speaker diarization
    Yannis Tevissen, Jérôme Boudy, and Gérard Chollet
    In GDR ISIS 2022: Information, Signal, Image et ViSion: Traitement du signal pour la voix, 2022
  2. Speech Processing
    The Newsbridge-Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description
    Yannis Tevissen, Jérôme Boudy, and Frédéric Petitpont
    VoxCeleb Speaker Recognition Challenge 2022 Tack 4, 2022