Principles of exploratory data analysis in problem solving: what can we learn from a well-known case?

Authors
Publication date 2009
Journal Quality Engineering
Volume | Issue number 21 | 4
Pages (from-to) 366-375
Organisations
  • Faculty of Economics and Business (FEB) - Amsterdam School of Economics Research Institute (ASE-RI)
Abstract
Exploratory data analysis (EDA) is sometimes suggested as a hypothesis identification approach. It is often used as such in problem solving and consists of the analysis of observational data, often collected without well-defined hypotheses, with the purpose of finding clues that could inspire ideas and hypotheses. This article seeks to uncover some of the main principles of EDA in problem solving. The article discusses and explains EDA's main steps: (1) Display the data; (2) identify salient features; (3) interpret salient features. The empiricist notion of EDA, which pervades many textbook accounts of EDA, is criticized and contrasted to an account that emphasizes the role of mental models in hypothesis generation. The framework has some implications for the limitations of EDA. It also sheds light on the role of the statistician compared to the role of the context expert. The article argues that in teaching EDA the emphasis for statistical data analysis should be balanced with teaching students to theorize and be inquisitive. Throughout the article, ideas are illustrated by the well-known case of John Snow's studies of the transmission mechanism of cholera.
Document type Article
Note http://dx.doi.org/10.1080/08982110903188276
Language English
Published at https://doi.org/10.1080/08982110903188276
Published at http://pdfserve.informaworld.com/688923_751317769_914962135.pdf
Permalink to this page
Back