Dealing with Missing Data in the Smart Buildings using Innovative Imputation Techniques

Abstract

Data quality plays a crucial role in the context of smart buildings. Meanwhile, missing data is relatively common in acquired datasets from sensors within the smart buildings. Poor data could result in a big bias in forecasting, control and operational services. Despite the common techniques to handle missing data, it is essential to systematically select the most appropriate approach for such missing values. This paper aims to focus on the lift systems as one of the essential parts in the smart buildings by exploring the most appropriate data imputation methods to handle missing data and to provide its service and allow a better understanding of patterns to issue the correct control actions based on forecasted models. The imputed data is not only investigated statistically but also modelled through machine learning algorithm to explore the impact of selecting inappropriate imputation techniques. Seven imputation techniques deployed on datasets with three level of missing values including 10%, 20% and 30% and the performance of methods examined through the normalized root mean square error (NRMSE) approach. In addition, the interaction between imputation techniques and a machine learning algorithm, namely random forest were examined. Findings from this paper can be employed in identifying an appropriate imputation technique not only within the lift datasets, but smart building context.

Description

The Publisher's final version can be found by following the DOI link.

Keywords

Imputation, KNN, smart buildings, lift, missing values

Citation

Pazhoohesh, M., Javadi, M.S., Gheisari, M., Aziz, S. and Villa, R. (2021) Dealing with Missing Data in the Smart Buildings using Innovative Imputation Techniques. IECON 2021–47th Annual Conference of the IEEE Industrial Electronics Society, October 2021.

Rights

Research Institute