Down-sampled and Under-sampled Data sets in Feature Selective Validation (FSV)

Abstract

Feature Selective Validation (FSV) is a heuristic method for quantifying the (dis)similarity of two data sets. The computational burden of obtaining the FSV values might be unnecessarily high if data sets with large numbers of points are used. While this may not be an important issue per se it is an important issue for future developments in FSV such as real-time processing or where multi-dimensional FSV is needed. Coupled with the issue of data set size, is the issue of data sets having ‘missing’ values. This may come about because of a practical difficulty or because of noise or other confounding factors making some data points unreliable. These issues relate to the question “what is the effect on FSV quantification of reducing or removing data points from a comparison – i.e. down- or under-sampling data?” This paper uses three strategies to achieve this from known data sets. This paper demonstrates, through a representative sample of 16 pairs of data sets, that FSV is robust to changes providing a minimum data set size of approximately 200 points is maintained. It is robust also for up to approximately 10% ‘missing’ data, providing this does not result in a continuous region of missed data.

Description

The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.

Keywords

computational electromagnetics;sampling methods;FSV;data sensitivity;dataset size;datasets dissimilarity;downsampling;feature selective validation;undersampling;Educational institutions;Interpolation;Robustness;Sensitivity;Standards;Transient analysis;Visualization;Data sensitivity;downsampling;feature selective validation (FSV);undersampling, computational electromagnetics, sampling methods, FSV, data sensitivity, dataset size, datasets dissimilarity, downsampling, feature selective validation, undersampling, Educational institutions, Interpolation, Robustness, Sensitivity, Standards, Transient analysis, Visualization, Data sensitivity

Citation

G. Zhang et al. (2014) Downsampled and Undersampled Datasets in Feature Selective Validation (FSV). IEEE Transactions on Electromagnetic Compatibility, 56 (4), pp. 817-824

Rights

Research Institute