Curating Datasets to Enhance Spyware Classification

Mousumi Ahmed Mimi, Hu Ng, Timothy Tzen Vun Yap

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Current methods for spyware classification lack effectiveness as well-structured datasets are typically absent, especially those with directionality properties in their set of features. In this particular research work, the efficacy of directionality properties for classification is explored, through engineered features from those on existing datasets. This study curates two datasets, Dataset A which includes features extracted from only single directional packet flows and Dataset B which includes those from bi-directional packet flows. Classification with these features is performed with selected classifiers, where SVM obtained the highest accuracy with 99.88% for Dataset A, while the highest accuracy went to RF, DT and XGBoost for Dataset B with 99.24%. Comparing these results with those from existing research work, the directional properties in these engineered features are able to provide improvements in terms of accuracy in classifying these spywares.

Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalJordanian Journal of Computers and Information Technology
Volume11
Issue number1
DOIs
Publication statusPublished - Mar 2025

Keywords

  • Datasets curation
  • Feature engineering
  • Packet analysis
  • Spyware classification

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Curating Datasets to Enhance Spyware Classification'. Together they form a unique fingerprint.

Cite this