TY - GEN
T1 - Enhanced Affinity Propagation Clustering on Heterogeneous Information Network
AU - Rajan, Debinal
AU - Jiang, Shouyong
AU - Yi, Dewei
AU - Pang, Wei
AU - Coghill, George MacLeod
PY - 2024/5/19
Y1 - 2024/5/19
N2 - The real world data sets with multi-typed objects and multi-typed relations can be structured as heterogeneous information networks (HIN). Clustering is one of the most significant process in HIN since it provides useful insights of hidden patterns of objects and their complex relation structure. However, grouping multi-relational target objects without losing their rich semantics and unknown number of clusters is a challenging task. Hence, we use the meta-path concepts to compute the similarity matrix between each pair of objects by exploring the different relations to preserve their semantics. Subsequently, we employ the Affinity Propagation (AP) clustering approach that can automatically generate clusters and corresponding exemplars (cluster center) for each object based on the similarity matrix. The basic motivation of using AP algorithm is its effectiveness, scalability and the speed on detecting community/clustering of networked data and yet it has not been applied in HIN. However, the performance of AP algorithm depends on two parameters: i) preference p and ii) damping factor λ which causes the algorithm to be non-converged and produce unsatisfactory clustering results. Although some existing methods have been developed to handle this issue, it still faces two challenges: i) slow convergence ii) high computation for finding optimal clustering. In this paper, we presented an enhanced AP (EAP) clustering approach to overcome this issue by updating their parameter values based on different strategies, to improve the AP performance on an HIN data set. The experimental results show that the proposed method can accelerate the algorithm’s convergence to evaluate optimal clustering compared to the other methods.
AB - The real world data sets with multi-typed objects and multi-typed relations can be structured as heterogeneous information networks (HIN). Clustering is one of the most significant process in HIN since it provides useful insights of hidden patterns of objects and their complex relation structure. However, grouping multi-relational target objects without losing their rich semantics and unknown number of clusters is a challenging task. Hence, we use the meta-path concepts to compute the similarity matrix between each pair of objects by exploring the different relations to preserve their semantics. Subsequently, we employ the Affinity Propagation (AP) clustering approach that can automatically generate clusters and corresponding exemplars (cluster center) for each object based on the similarity matrix. The basic motivation of using AP algorithm is its effectiveness, scalability and the speed on detecting community/clustering of networked data and yet it has not been applied in HIN. However, the performance of AP algorithm depends on two parameters: i) preference p and ii) damping factor λ which causes the algorithm to be non-converged and produce unsatisfactory clustering results. Although some existing methods have been developed to handle this issue, it still faces two challenges: i) slow convergence ii) high computation for finding optimal clustering. In this paper, we presented an enhanced AP (EAP) clustering approach to overcome this issue by updating their parameter values based on different strategies, to improve the AP performance on an HIN data set. The experimental results show that the proposed method can accelerate the algorithm’s convergence to evaluate optimal clustering compared to the other methods.
U2 - 10.1007/978-3-031-55568-8_28
DO - 10.1007/978-3-031-55568-8_28
M3 - Conference contribution
SN - 9783031555671
T3 - Advances in Intelligent Systems and Computing
SP - 334
EP - 346
BT - Advances in Computational Intelligence Systems
PB - Springer
T2 - 21st UK Workshop on Computational Intelligence 2022
Y2 - 7 September 2022 through 9 September 2022
ER -