Speech De-identification with Deep Neural Networks

Ádám Fodor; László Kopácsi; Zoltán Ádám  Milacski; András Lőrincz

doi:10.14232/actacyb.288282

Authors

Ádám Fodor Department of Software Technology and Methodology, Eötvös Loránd University, Budapest, Hungary https://orcid.org/0000-0001-7370-930X
László Kopácsi Department of Software Technology and Methodology, Eötvös Loránd University, Budapest, Hungary https://orcid.org/0000-0003-2387-2015
Zoltán Ádám Milacski Department of Software Technology and Methodology, Eötvös Loránd University, Budapest, Hungary https://orcid.org/0000-0002-3135-2936
András Lőrincz Department of Software Technology and Methodology, Eötvös Loránd University, Budapest, Hungary https://orcid.org/0000-0002-1280-3447

DOI:

https://doi.org/10.14232/actacyb.288282

Keywords:

speech processing, voice conversion, deep neural network, text-to-speech, speaker privacy

Abstract

Cloud-based speech services are powerful practical tools but the privacy of the speakers raises important legal concerns when exposed to the Internet. We propose a deep neural network solution that removes personal characteristics from human speech by converting it to the voice of a Text-to-Speech (TTS) system before sending the utterance to the cloud. The network learns to transcode sequences of vocoder parameters, delta and delta-delta features of human speech to those of the TTS engine. We evaluated several TTS systems, vocoders and audio alignment techniques. We measured the performance of our method by (i) comparing the result of speech recognition on the de-identified utterances with the original texts, (ii) computing the Mel-Cepstral Distortion of the aligned TTS and the transcoded sequences, and (iii) questioning human participants in A-not-B, 2AFC and 6AFC tasks. Our approach achieves the level required by diverse applications.

Downloads

Download data is not yet available.

Speech De-identification with Deep Neural Networks

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

Developed By

Information

Make a Submission

Current Issue