Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora Using WebMAUS

J. Strunk; F. Schiel; F. Seifart

Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora Using WebMAUS

Authors	J. Strunk F. Schiel F. Seifart
Publication date	2014
Host editors	N. Calzolari K. Choukri T. Declerck H. Loftsson B. Maegaard J. Mariani A. Moreno J. Odijk S. Piperidis
Book title	Proceedings of the Ninth International Conference on Language Resources and Evaluation
Book subtitle	May 26-31, 2014, Reykjavik, Iceland : proceedings
ISBN	9782951740884
Event	9th International Conference on Language Resources and Evaluation, LREC 2014
Pages (from-to)	3940-3947
Publisher	Paris: European Language Resources Association (ELRA)
Organisations	Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract	Language documentation projects supported by recent funding intiatives have created a large number of multimedia corpora of typologically diverse languages. Most of these corpora provide a manual alignment of transcription and audio data at the level of larger units, such as sentences or intonation units. Their usefulness both for corpus-linguistic and psycholinguistic research and for the development of tools and teaching materials could, however, be increased by achieving a more fine-grained alignment of transcription and audio at the word or even phoneme level. Since most language documentation corpora contain data on small languages, there usually do not exist any speech recognizers or acoustic models specifically trained on these languages. We therefore investigate the feasibility of untrained forced alignment for such corpora. We report on an evaluation of the tool (Web)MAUS (Kisler et al., 2012) on several language documentation corpora and discuss practical issues in the application of forced alignment. Our evaluation shows that (Web)MAUS with its existing acoustic models combined with simple grapheme-to-phoneme conversion can be successfully used for word-level forced alignment of a diverse set of languages without additional training, especially if a manual prealignment of larger annotation units is already avaible.
Document type	Conference contribution
Language	English
Published at	http://www.lrec-conf.org/proceedings/lrec2014/pdf/1176_Paper.pdf (Final published version)
Other links	http://www.lrec-conf.org/proceedings/lrec2014/index.html
Downloads	1176_Paper (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora Using WebMAUS