Machine Learning Models Stacking in the Malicious Links Detecting

Authors

  • Yevhenii Khukalenko National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Ukraine
  • Iryna Stopochkina National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute", Ukraine
  • Mykola Ilin

DOI:

https://doi.org/10.20535/tacs.2664-29132023.1.287752

Abstract

An analysis of the performance of various classifiers on address and network groups of features was performed. A new classification model is proposed, which is a stacking of 3 models: kNN, XGBoost and Transformer. The best model for stacking was experimentally determined: Logistic Regression, which made it possible to improve the result of the best available model by 3%. The hypothesis that stacking a larger number of worse models has an advantage over stacking a smaller number of more productive models on the used data set was confirmed: regardless of the choice of stacking meta-algorithm, stacking of three models showed better results than stacking two.

Downloads

Published

2023-09-19

Issue

Section

Intelligent Data analysis methods in cybersecurity