TY - JOUR
T1 - Cost-aware Feature Selection for IoT Device Classification
AU - Chakraborty, Biswadeep
AU - Divakaran, Dinil Mon
AU - Nevat, Ido
AU - Peters, Gareth W.
AU - Gurusamy, Mohan
N1 - Funding Information:
Manuscript received July 24, 2020; revised November 25, 2020; accepted December 30, 2020. Date of publication January 13, 2021; date of current version July 7, 2021. This work was supported by the National Research Foundation, Prime Minister’s Office, Singapore, under its Corporate Laboratory@University Scheme, National University of Singapore, and Singapore Telecommunications Ltd. (Corresponding author: Dinil Mon Divakaran.) Biswadeep Chakraborty is with the Department of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332 USA (e-mail: [email protected]).
Publisher Copyright:
© 2014 IEEE.
PY - 2021/7/15
Y1 - 2021/7/15
N2 - The classification of Internet-of-Things (IoT) devices into different types is of paramount importance, from multiple perspectives, including security and privacy aspects. Recent works have explored machine learning techniques for fingerprinting (or classifying) IoT devices, with promising results. However, the existing works have assumed that the features used for building the machine learning models are readily available or can be easily extracted from the network traffic; in other words, they do not consider the costs associated with feature extraction. In this work, we take a more realistic approach, and argue that feature extraction has a cost, and the costs are different for different features. We also take a step forward from the current practice of considering the misclassification loss as a binary value, and make a case for different losses based on the misclassification performance. Thereby, and more importantly, we introduce the notion of risk for IoT device classification. We define and formulate the problem of cost-aware IoT device classification. This being a combinatorial optimization problem, we develop a novel algorithm to solve it in a fast and effective way using the cross-entropy (CE)-based stochastic optimization technique. Using traffic of real devices, we demonstrate the capability of the CE-based algorithm in selecting features with minimal risk of misclassification while keeping the cost for feature extraction within a specified limit.
AB - The classification of Internet-of-Things (IoT) devices into different types is of paramount importance, from multiple perspectives, including security and privacy aspects. Recent works have explored machine learning techniques for fingerprinting (or classifying) IoT devices, with promising results. However, the existing works have assumed that the features used for building the machine learning models are readily available or can be easily extracted from the network traffic; in other words, they do not consider the costs associated with feature extraction. In this work, we take a more realistic approach, and argue that feature extraction has a cost, and the costs are different for different features. We also take a step forward from the current practice of considering the misclassification loss as a binary value, and make a case for different losses based on the misclassification performance. Thereby, and more importantly, we introduce the notion of risk for IoT device classification. We define and formulate the problem of cost-aware IoT device classification. This being a combinatorial optimization problem, we develop a novel algorithm to solve it in a fast and effective way using the cross-entropy (CE)-based stochastic optimization technique. Using traffic of real devices, we demonstrate the capability of the CE-based algorithm in selecting features with minimal risk of misclassification while keeping the cost for feature extraction within a specified limit.
KW - Identification
KW - Internet-of-Things (IoT)
KW - machine learning
KW - network
KW - optimization
UR - http://www.scopus.com/inward/record.url?scp=85099591349&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2021.3051480
DO - 10.1109/JIOT.2021.3051480
M3 - Article
AN - SCOPUS:85099591349
SN - 2327-4662
VL - 8
SP - 11052
EP - 11064
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 14
ER -