Paper
7 March 2022 WBSWK: weighted bi-directional sampling based on wk-means method for imbalanced datasets
Peng Zhou
Author Affiliations +
Proceedings Volume 12167, Third International Conference on Electronics and Communication; Network and Computer Technology (ECNCT 2021); 1216735 (2022) https://doi.org/10.1117/12.2628780
Event: 2021 Third International Conference on Electronics and Communication, Network and Computer Technology, 2021, Harbin, China
Abstract
Traditional classification algorithms tend to cause minority classes to be misclassified when classifying imbalanced data sets. In this paper, we propose an over-sampling and under-sampling algorithm based on WK-means clustering. Our method firstly uses WK-means to cluster the whole datasets, then oversamples the datasets in some regions with a large number of minority class according to the imbalance ratio by different weights of each cluster, avoids the generation of noise and effectively overcomes imbalances between and within classes. Finally, undersampling the clusters with a large number of minority class to balance the sample number of the whole datasets. The experimental results obtained from 11 datasets show that the proposed method is superior to other methods under different classifiers and evaluation criteria.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Peng Zhou "WBSWK: weighted bi-directional sampling based on wk-means method for imbalanced datasets", Proc. SPIE 12167, Third International Conference on Electronics and Communication; Network and Computer Technology (ECNCT 2021), 1216735 (7 March 2022); https://doi.org/10.1117/12.2628780
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Fermium

Frequency modulation

Lawrencium

Glasses

Heart

Iris

Astatine

Back to Top