Manipulating tracheoesophageal speech
| Authors | |
|---|---|
| Publication date | 2010 |
| Journal | Interspeech |
| Event | Interspeech 2010: Spoken Language Processing for All Ages, Health Conditions, Native Languages and Environments |
| Volume | Issue number | 11 |
| Pages (from-to) | 274-277 |
| Organisations |
|
| Abstract |
Speech therapy aiming at improving voice quality and speech intelligibility is often hampered by the lack of knowledge of the underlying deficits. One way to help speech therapists treating patients would be to supply synthetic benchmarks for pathological
speech. These can be used to train therapists and evaluate and interpret automatic speech recognizers used for diagnosing pathological speech. Moreover, synthetic pathological speech can also be used to make expected therapy aims audible before treatment. In a listening experiment testing perceived intelligibility, three types of manipulations of tracheoesophageal speech were evaluated by experienced speech therapists. It was found that modeling the intensity contour of the voice source signal improved speech quality over plain analysis-synthesis. Replacing the voicing source with fully synthetic source periods decreased the perceived intelligibility markedly. Making the source fully periodic with a regular pitch had no effect on perceived intelligibility. Low quality speech benefitted more from manipulations, or deteriorated less, than high quality speech. |
| Document type | Article |
| Note | Proceedings title: Proceedings of the 11th Annual Conference of the International Speech Communication Association: 26-30 September, 2010, Makuhari, Chiba, Japan [cd-rom] Publisher: ISCA Place of publication: Makuhari |
| Language | English |
| Published at | http://www.fon.hum.uva.nl/rob/Publications/RvS_TEsourceIS2010.pdf |
| Downloads |
RvS_TEsourceIS2010.pdf
(Accepted author manuscript)
|
| Permalink to this page | |