LOF weighted KNN regression ensemble and its application to a die manufacturing company


Öngelen G., İNKAYA T.

Sadhana - Academy Proceedings in Engineering Sciences, cilt.48, sa.4, 2023 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 48 Sayı: 4
  • Basım Tarihi: 2023
  • Doi Numarası: 10.1007/s12046-023-02283-0
  • Dergi Adı: Sadhana - Academy Proceedings in Engineering Sciences
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Communication Abstracts, Compendex, INSPEC, Metadex, zbMATH, Civil Engineering Abstracts
  • Anahtar Kelimeler: Bootstrap aggregation, Ensemble learning, Local outlier factor, Manufacturing, Prediction, Weighted KNN
  • Bursa Uludağ Üniversitesi Adresli: Evet

Özet

K-nearest neighbor (KNN) algorithm is a widely used machine learning technique for prediction problems due its simplicity, flexibility and interpretability. When predicting the output variable of a data point, it basically averages the output values of its k closest neighbors. However, the impact of the neighboring points on the estimation may differ. Even though there are weighted versions of KNN, the effect of outliers and density differences within the neighborhoods are not considered. In order to fill this gap, we propose a novel weighting scheme for KNN regression based on local outlier factor (LOF). In particular, we combine the inverse of the Euclidean distance and LOF value so that the weights of the neighbors are determined using not only distance and connectivity but also outlier and density information around the neighborhood. Also, bootstrap aggregation is used to leverage the stability and accuracy of the LOF weighted KNN regression. Using real-life benchmark datasets, extensive experiments and statistical tests were performed for evaluating the performance of the proposed approach. The experimental results indicate the superior performance of the proposed approach in small neighborhood sizes. Moreover, the proposed approach was implemented in a make-to-order manufacturing company, and die production times were estimated successfully.