Similarity visualization for the grouping of forensic speech recordings

Open Access
Authors
Publication date 2008
Host editors
  • S.N. Srihari
  • K. Franke
Book title Computational Forensics
Book subtitle Second International Workshop, IWCF 2008, Washington, DC, USA, August 7-8, 2008 : proceedings
ISBN
  • 9783540853022
ISBN (electronic)
  • 9783540853039
Series Lecture Notes in Computer Science
Event Second International Workshop on Computational Forensics (IWCF 2008), Washington, DC, USA
Pages (from-to) 169-180
Publisher Berlin: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In a forensic phone wiretapping investigation, a major problem is to get the full picture of the speakers involved. Typically, the wiretapped speech recordings are grouped using a clustering tool. The main disadvantage of such an approach is that in a bootstrapped scenario grouping errors accumulate. In this paper, we propose a visual approach to find similar speech recordings that probably stem from the same speaker. We first model the speech recordings and define suitable similarity measures between recordings. Then, through an approximate 2-D visualization of the inter-speech, similarities the investigator can identify clear groups of recordings and recordings that are harder to differentiate. We did extensive experiments on phone data of 50 speakers with 2 recordings per speaker. We tested quality of the 2-D visualization in relation to original high dimensional similarities. It turned out that for the original high dimensional similarity measure the nearest recording is almost always the one from the same speaker. In the 2-D visualization, we achieved that on average for all speech recordings a recording of the same speaker is among the 10 nearest recordings.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-540-85303-9_16
Downloads
295465.pdf (Submitted manuscript)
Permalink to this page
Back