Garbage Modeling for On-device Speech Recognition

C. Van Gysel; L. Velikovich; I. McGraw; F. Beaufays

doi:https://doi.org/10.21437/Interspeech.2015-480

Garbage Modeling for On-device Speech Recognition

Authors	C. Van Gysel L. Velikovich I. McGraw F. Beaufays
Publication date	2015
Journal	Interspeech
Event	16th Annual Conference of the International Speech Communication Association (Interspeech 2015)
Volume \| Issue number	16
Pages (from-to)	2127-2131
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	User interactions with mobile devices increasingly depend on voice as a primary input modality. Due to the disadvantages of sending audio across potentially spotty network connections for speech recognition, in recent years there has been growing attention to performing recognition on-device. The limited computational resources, however, typically require additional model constraints. In this work, we explore the task of on-device utterance verification, wherein the recognizer must transcribe an utterance if it is in a target set or reject it as being out of domain. We present a data-driven methodology for mining tens of thousands of target phrases from an existing corpus. We then compare two common garbage-modeling approaches to utterance verification: a sub-word rejection model and a white-listed n-gram model. We examine a deficiency of the sub-word modeling approach and introduce a novel modification that makes use of common prefixes between targeted phrases and non-targeted phrases. We show good performance in the trade-off between recall and word error rate using both the prefix and white-listed n-gram approaches. Finally, we evaluate the prefix-based approach in a hybrid setting where rejected instances are sent to a server-side recognizer.
Document type	Article
Note	Proceedings title: Interspeech 2015: 16th Annual Conference of the International Speech Communication Association: Dresden, Germany, September 6-10, 2015 Publisher: ISCA
Language	English
Published at	https://doi.org/10.21437/Interspeech.2015-480 (Final published version)
Downloads	i15_2127 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Garbage Modeling for On-device Speech Recognition