GTM-UVigo System for Multimodal Person Discovery in Broadcast TV Task at MediaEval 2016

TitleGTM-UVigo System for Multimodal Person Discovery in Broadcast TV Task at MediaEval 2016
Publication TypeConference Proceedings
Year of Publication2016
AuthorsLópez Otero, P, Docío Fernández, L, García Mateo, C
Conference NameMediaEval 2016
AbstractIn this paper, we present the system developed by GTM-UVigo team for the Multimedia Person Discovery in Broadcast TV task at MediaEval 2016. The proposed approach consists in a novel strategy for person discovery which is not based on speaker and face diarisation as in previous works. In this system, the task is approached as a person recognition problem: there is an enrolment stage, where the voice and face of each discovered person are detected and, for each shot, the most suitable voice and face are assigned using the i-vector paradigm. These two biometric modalities are combined by decision fusion.
ProjectMultimedia and Multilingual Human-Centered Content Discovery
Citation Key601