Influence of speaker de-identification in depression detection

TitleInfluence of speaker de-identification in depression detection
Publication TypeJournal Article
Year of Publication2017
AuthorsLópez Otero, P, Magariños, C, Docío Fernández, L, Rodríguez Banga, E, Erro, D, García Mateo, C
JournalIET Signal Processing
Date Published07/2017
ISSN1751-9683
Abstract Depression is a common mental disorder that is usually addressed by outpatient treatments that favour patients’ inclusion in the society. This raises the need for tools to remotely monitor the emotional state of the patients, which can be carried out via telephone or the Internet using speech processing approaches. However, these strategies lead to privacy concerns caused by the transmission of the patients’ speech and its subsequent storage in servers. The use of speech de- identification to protect the privacy of these patients seems straightforward, but the influence of this procedure in the manifestation of the disease in the patients’ speech has not been addressed yet. Hence, this study evaluates the performance of an automatic depression level estimation system when dealing with original and de-identified speech, in order to analyse the influence of the de-identification procedure in the detection of depression. Two de-identification approaches based on voice transformation via frequency warping and amplitude scaling are assessed, which can be applied to any speaker without additional training. Experiments carried out in the framework of the audio/visual emotion challenge 2014 show that the proposed de-identification approaches achieve promising de-identification results at the expense of a slight degradation of depression detection.  
DOI10.1049/iet-spr.2016.0731
ProjectMultimedia and Multilingual Human-Centered Content Discovery
Citation Key612