Untrained Forced Alignment of Transcriptions and Audio for Language Documentation Corpora Using WebMAUS

Open Access
Authors
Publication date 2014
Host editors
  • N. Calzolari
  • K. Choukri
  • T. Declerck
  • H. Loftsson
  • B. Maegaard
  • J. Mariani
  • A. Moreno
  • J. Odijk
  • S. Piperidis
Book title Proceedings of the Ninth International Conference on Language Resources and Evaluation
Book subtitle May 26-31, 2014, Reykjavik, Iceland : proceedings
ISBN
  • 9782951740884
Event 9th International Conference on Language Resources and Evaluation, LREC 2014
Pages (from-to) 3940-3947
Publisher Paris: European Language Resources Association (ELRA)
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
Language documentation projects supported by recent funding intiatives have created a large number of multimedia corpora of typologically diverse languages. Most of these corpora provide a manual alignment of transcription and audio data at the level of larger units, such as sentences or intonation units. Their usefulness both for corpus-linguistic and psycholinguistic research and for the
development of tools and teaching materials could, however, be increased by achieving a more fine-grained alignment of transcription and audio at the word or even phoneme level. Since most language documentation corpora contain data on small languages, there usually do not exist any speech recognizers or acoustic models specifically trained on these languages. We therefore investigate the
feasibility of untrained forced alignment for such corpora. We report on an evaluation of the tool (Web)MAUS (Kisler et al., 2012) on several language documentation corpora and discuss practical issues in the application of forced alignment. Our evaluation shows that (Web)MAUS with its existing acoustic models combined with simple grapheme-to-phoneme conversion can be successfully used for word-level forced alignment of a diverse set of languages without additional training, especially if a manual prealignment of larger annotation units is already avaible.
Document type Conference contribution
Language English
Published at http://www.lrec-conf.org/proceedings/lrec2014/pdf/1176_Paper.pdf
Other links http://www.lrec-conf.org/proceedings/lrec2014/index.html
Downloads
1176_Paper (Final published version)
Permalink to this page
Back