publications

Here is list of my main scientific publications. a more exhaustive list can be found on my Google Scholar profile.

2025

  1. Video understanding
    Frame Sampling Strategies Matter: A Benchmark for small vision language models
    Marija Brkic, Anas Filali Razzouki, Yannis Tevissen, Khalil Guetari, and Mounim A El Yacoubi
    arXiv preprint arXiv:2509.14769, 2025
  2. Patent
    Computer-based platforms and methods for efficient AI-based digital video shot indexing
    Frédéric Petitpont, Philippe Petitpont, Yannis Tevissen, and Khalil Guetari
    Apr 2025
    US Patent 12,288,377

2024

  1. Patent
    Systems and methods for AI generation of image captions enriched with multiple AI modalities
    Frédéric Petitpont, Yannis Tevissen, and Khalil Guetari
    Nov 2024
    US Patent 12,148,233
  2. Multimodal RAG
    Towards Retrieval Augmented Generation over Large Video Libraries
    Yannis Tevissen, Khalil Guetari, and Frédéric Petitpont
    In Proceedings of HSI 2024, Nov 2024
  3. AI Fairness
    Disability Representations: Finding Biases in Automatic Image Generation
    Yannis Tevissen
    In Workshop AVA: Accessibility, Vision, and Autonomy Meet, Nov 2024
  4. Video understanding
    Multimodal Chaptering for Long-Form TV Newscast Video
    Khalil Guetari, Yannis Tevissen, and Frédéric Petitpont
    Nov 2024
  5. VLM
    Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
    Yannis Tevissen, Khalil Guetari, Marine Tassel, Erwan Kerleroux, and Frédéric Petitpont
    arXiv preprint arXiv:2405.02305, Nov 2024
  6. Edge AI
    Privacy Preserving Personal Assistant with On-Device Diarization and Spoken Dialogue System for Home and Beyond
    Gérard Chollet, Hugues Sansen, Yannis Tevissen, Jérôme Boudy, Mossaab Hariz, Christophe Lohr, and Fathy Yassa
    Proceedings of IHIET 2024, Nov 2024

2023

  1. Speaker Diarization
    Diarisation multimodale: vers des modèles robustes et justes en contexte réel
    Yannis Tevissen
    Institut Polytechnique de Paris, Nov 2023
  2. Speaker Diarization
    Détection d’activité vocale Multi-flux pour la Diarisation du locuteur
    Yannis Tevissen, Jérôme Boudy, Gérard Chollet, and Frédéric Petitpont
    Proceedings of GRETSI 2023, Nov 2023
  3. Speaker Diarization
    Home monitoring for frailty detection through sound and speaker diarization analysis
    Yannis Tevissen, Dan Istrate, Vincent Zalc, Jérôme Boudy, Gérard Chollet, Frédéric Petitpont, and Sami Boutamine
    In JETSAN 2023, Nov 2023
  4. Fairness
    Towards measuring and scoring speaker diarization fairness
    Yannis Tevissen, Jérôme Boudy, Gérard Chollet, and Frédéric Petitpont
    arXiv preprint arXiv:2302.09991, Nov 2023

2022

  1. Speech Processing
    Multi-stream voice activity detection for robust speaker diarization
    Yannis Tevissen, Jérôme Boudy, and Gérard Chollet
    In GDR ISIS 2022: Information, Signal, Image et ViSion: Traitement du signal pour la voix, Nov 2022
  2. Speech Processing
    The Newsbridge-Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description
    Yannis Tevissen, Jérôme Boudy, and Frédéric Petitpont
    VoxCeleb Speaker Recognition Challenge 2022 Tack 4, Nov 2022