Automated dating of the world’s language families based on lexical similarity

Authors
  • E.W. Holman
  • C.H. Brown
  • S. Wichmann
  • A. Müller
  • V. Velupillai
  • H. Hammarström
  • S. Sauppe
  • H. Jung
  • D. Bakker
  • P. Brown
  • O. Belyaev
  • M. Urban
  • R. Mailhammer
  • J.-M. List
  • D. Egorov
Publication date 2011
Journal Current Anthropology
Volume | Issue number 52 | 6
Pages (from-to) 841-875
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract
This paper describes a computerized alternative to glottochronology for estimating elapsed time since parent languages diverged into daughter languages. The method, developed by the Automated Similarity Judgment Program (ASJP) consortium, is different from glottochronology in four major respects: (1) it is automated and thus is more objective, (2) it applies a uniform analytical approach to a single database of worldwide languages, (3) it is based on lexical similarity as determined from Levenshtein (edit) distances rather than on cognate percentages, and (4) it provides a formula for date calculation that mathematically recognizes the lexical heterogeneity of individual languages, including parent languages just before their breakup into daughter languages. Automated judgments of lexical similarity for groups of related languages are calibrated with historical, epigraphic, and archaeological divergence dates for 52 language groups. The discrepancies between estimated and calibration dates are found to be on average 29% as large as the estimated dates themselves, a figure that does not differ significantly among language families. As a resource for further research that may require dates of known level of accuracy, we offer a list of ASJP time depths for nearly all the world’s recognized language families and for many subfamilies.

Document type Article
Note With suppl.: ASJP dates for Ethnologue groups
Language English
Published at https://doi.org/10.1086/662127
Permalink to this page
Back