Yannis Tevissen
Head of Science for Moments Lab. AI scientist.
I currently work as Head of Science for Moments Lab, where I lead Research initiatives on video understanding. More info can be found on the Moments Lab Research website.
Previously, I completed a PhD in artificial intelligence at the Institut Polytechnique de Paris, advised by Jérôme Boudy and Gérard Chollet.
My current research interests include video understanding, multimodal agents, and AI fairness.
Feel free to reach out to me to discuss any of these topics or to collaborate on research projects.
Recent news
Dec 02, 2024 | I was invited to speak at the OECD for a panel about AI and disability. I shared my concerns and recommendations for a more inclusive AI development. |
---|---|
Jul 10, 2024 | I was awarded the Best Presentation Paper at the HSI 2024 conference for my talk about our recent paper: Towards Retrieval Augmented Generation over Large Video Libraries. |
Jun 26, 2024 | I was interviewed by TVBEurope about the future of video understanding together with Olivier Penin from TF1. Check out the interview. |
Apr 01, 2024 | We launched an AI Research Program with Moments Lab, focusing on video understanding. |
Mar 28, 2024 | I was invited to give a TEDx talk. I talked about disability biases in the society and how it propagates in AI systems. You can watch it here! |
Science publications
- Multimodal RAGTowards Retrieval Augmented Generation over Large Video LibrariesIn Proceedings of HSI 2024, 2024
- AI FairnessDisability Representations: Finding Biases in Automatic Image GenerationIn Workshop AVA: Accessibility, Vision, and Autonomy Meet, 2024
- Video understandingMultimodal Chaptering for Long-Form TV Newscast Video2024
- VLMInserting Faces inside Captions: Image Captioning with Attention Guided MergingarXiv preprint arXiv:2405.02305, 2024
- Edge AIPrivacy Preserving Personal Assistant with On-Device Diarization and Spoken Dialogue System for Home and BeyondProceedings of IHIET 2024, 2024
- Speaker DiarizationDiarisation multimodale: vers des modèles robustes et justes en contexte réelInstitut Polytechnique de Paris, 2023
- Speaker DiarizationDétection d’activité vocale Multi-flux pour la Diarisation du locuteurProceedings of GRETSI 2023, 2023
- Speaker DiarizationHome monitoring for frailty detection through sound and speaker diarization analysisIn JETSAN 2023, 2023
- FairnessTowards measuring and scoring speaker diarization fairnessarXiv preprint arXiv:2302.09991, 2023
- Speech ProcessingMulti-stream voice activity detection for robust speaker diarizationIn GDR ISIS 2022: Information, Signal, Image et ViSion: Traitement du signal pour la voix, 2022
- Speech ProcessingThe Newsbridge-Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System DescriptionVoxCeleb Speaker Recognition Challenge 2022 Tack 4, 2022