Frequency Ratio: a method for dealing with missing values within nearest neighbour search
| Authors |
|
|---|---|
| Publication date | 2015 |
| Journal | Journal of Systems Integration |
| Volume | Issue number | 6 | 3 |
| Pages (from-to) | 3-14 |
| Organisations |
|
| Abstract |
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within nearest neighbour search. We test the FR method on known medical datasets from the UCI machine learning repository. We compare the accuracy of the FR method with five commonly used methods (three "imputation" and two "bypassing" methods) for dealing with values that are "missing completely at random" (MCAR) for the purpose of classification. We discovered that in most cases, the FR method outperforms the other methods. We conclude that the FR method is a strong addition to the commonly used methods for dealing with missing values within the nearest neighbour method.
|
| Document type | Article |
| Language | English |
| Published at | https://doi.org/10.20470/jsi.v6i3.233 |
| Downloads |
502264
(Final published version)
|
| Permalink to this page | |