Future of Information and Communication Conference (FICC) 2024
4-5 April 2024
Publication Links
IJACSA
Special Issues
Future of Information and Communication Conference (FICC)
Computing Conference
Intelligent Systems Conference (IntelliSys)
Future Technologies Conference (FTC)
International Journal of Advanced Computer Science and Applications(IJACSA), Volume 13 Issue 11, 2022.
Abstract: Datasets with a balanced distribution of data are often difficult to find in real life. Although various methods have been developed and proven successful using shallow learning algorithms, handling unbalanced classes using a deep learning approach is still limited. Most of these studies only focus on image data using the Convolution Neural Network (CNN) architecture. In this study, we tried to apply several class handling techniques to three datasets of unbalanced text data. Both use a data-level approach with resampling techniques on word vectors and algorithm-level using Weighted Cross-Entropy Loss (WCEL) to handle cases of imbalanced text classification. With Bidirectional Long-Short Term Memory (BiLSTM) architecture. We tested each method using three datasets with different characteristics and levels of imbalance. Based on the experiments that have been carried out, each technique applied has a different performance on each dataset.
Sumarni Adi, Awaliyatul Hikmah, Bety Wulan Sari, Andi Sunyoto, Ainul Yaqin and Mardhiya Hayaty, “The Best Techniques to Deal with Unbalanced Sequential Text Data in Deep Learning” International Journal of Advanced Computer Science and Applications(IJACSA), 13(11), 2022. http://dx.doi.org/10.14569/IJACSA.2022.0131177
@article{Adi2022,
title = {The Best Techniques to Deal with Unbalanced Sequential Text Data in Deep Learning},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2022.0131177},
url = {http://dx.doi.org/10.14569/IJACSA.2022.0131177},
year = {2022},
publisher = {The Science and Information Organization},
volume = {13},
number = {11},
author = {Sumarni Adi and Awaliyatul Hikmah and Bety Wulan Sari and Andi Sunyoto and Ainul Yaqin and Mardhiya Hayaty}
}
Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.