TY - JOUR
T1 - Granular-ball computing guided anomaly detection for hybrid attribute data
AU - Su, Xinyu
AU - Wang, Xiwen
AU - Peng, Dezhong
AU - Chen, Hongmei
AU - Chen, Yingke
AU - Yuan, Zhong
PY - 2024/11/3
Y1 - 2024/11/3
N2 - Anomaly detection is one of the important research areas in data mining or data analytics. However, most of the existing anomaly detection methods only consider homogeneous data, such as nominal or numerical attribute data, and fail to effectively deal with hybrid attribute data. Moreover, these methods also suffer from inefficiency and noise sensitivity due to their single-granularity sample-based input paradigm. In this study, we propose an unsupervised anomaly detection method based on the granular-ball fuzzy set called HGBAD. First, we define a novel granular-ball fuzzy set to deal with the uncertainty information in hybrid attribute data. Based on the novel fuzzy set, multiple granular-ball fuzzy information granules are constructed. The anomaly degrees of granular-ball fuzzy information granules are fused to calculate the anomaly factors. The anomaly factors are used to measure the anomaly degrees of samples. Based on the anomaly factors, anomalies can be detected by an anomaly determination threshold. Experimental results demonstrate the superior performance of HGBAD in detecting anomalies across various data types. The code is publicly available at https://github.com/Mxeron/HGBAD.
AB - Anomaly detection is one of the important research areas in data mining or data analytics. However, most of the existing anomaly detection methods only consider homogeneous data, such as nominal or numerical attribute data, and fail to effectively deal with hybrid attribute data. Moreover, these methods also suffer from inefficiency and noise sensitivity due to their single-granularity sample-based input paradigm. In this study, we propose an unsupervised anomaly detection method based on the granular-ball fuzzy set called HGBAD. First, we define a novel granular-ball fuzzy set to deal with the uncertainty information in hybrid attribute data. Based on the novel fuzzy set, multiple granular-ball fuzzy information granules are constructed. The anomaly degrees of granular-ball fuzzy information granules are fused to calculate the anomaly factors. The anomaly factors are used to measure the anomaly degrees of samples. Based on the anomaly factors, anomalies can be detected by an anomaly determination threshold. Experimental results demonstrate the superior performance of HGBAD in detecting anomalies across various data types. The code is publicly available at https://github.com/Mxeron/HGBAD.
KW - Anomaly detection
KW - Granular computing
KW - Granular-ball computing
KW - Hybrid attribute data
KW - Outlier detection
UR - http://www.scopus.com/inward/record.url?scp=85208139659&partnerID=8YFLogxK
U2 - 10.1007/s13042-024-02425-8
DO - 10.1007/s13042-024-02425-8
M3 - Article
AN - SCOPUS:85208139659
SN - 1868-8071
JO - International Journal of Machine Learning and Cybernetics
JF - International Journal of Machine Learning and Cybernetics
ER -