Manipulating tracheoesophageal speech

Open Access
Authors
Publication date 2010
Journal Interspeech
Event Interspeech 2010: Spoken Language Processing for All Ages, Health Conditions, Native Languages and Environments
Volume | Issue number 11
Pages (from-to) 274-277
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
Speech therapy aiming at improving voice quality and speech intelligibility is often hampered by the lack of knowledge of the underlying deficits. One way to help speech therapists treating patients would be to supply synthetic benchmarks for pathological
speech. These can be used to train therapists and evaluate and interpret automatic speech recognizers used for diagnosing pathological speech. Moreover, synthetic pathological speech can also be used to make expected therapy aims audible before
treatment. In a listening experiment testing perceived intelligibility, three types of manipulations of tracheoesophageal speech were evaluated by experienced speech therapists. It was found that modeling the intensity contour of the voice source signal improved speech quality over plain analysis-synthesis. Replacing the voicing source with fully synthetic source periods decreased the perceived intelligibility markedly. Making the
source fully periodic with a regular pitch had no effect on perceived intelligibility. Low quality speech benefitted more from manipulations, or deteriorated less, than high quality speech.
Document type Article
Note Proceedings title: Proceedings of the 11th Annual Conference of the International Speech Communication Association: 26-30 September, 2010, Makuhari, Chiba, Japan [cd-rom] Publisher: ISCA Place of publication: Makuhari
Language English
Published at http://www.fon.hum.uva.nl/rob/Publications/RvS_TEsourceIS2010.pdf
Downloads
RvS_TEsourceIS2010.pdf (Accepted author manuscript)
Permalink to this page
Back