International Journal of Advances in Science, Engineering and Technology(IJASEAT)
.
Follow Us On :
current issues
Volume-12,Issue-1  ( Jan, 2024 )
Past issues
  1. Volume-12,Issue-1  ( Jan, 2024 )
  2. Volume-11,Issue-4  ( Oct, 2023 )
  3. Volume-11,Issue-3  ( Jul, 2023 )
  4. Volume-11,Issue-2  ( Apr, 2023 )
  5. Volume-11,Issue-1  ( Jan, 2023 )
  6. Volume-10,Issue-4  ( Oct, 2022 )
  7. Volume-10,Issue-3  ( Jul, 2022 )
  8. Volume-10,Issue-2  ( Apr, 2022 )
  9. Volume-10,Issue-1  ( Jan, 2022 )
  10. Volume-9,Issue-4  ( Oct, 2021 )

Statistics report
Apr
Submitted Papers : 80
Accepted Papers : 10
Rejected Papers : 70
Acc. Perc : 12%
  Journal Paper


Paper Title :
Dimensionality Reduction for Classification of Filipino Text Documents Based on Improved Bayesian Vectorization Technique

Author :Hajah T. Sueno, Bobby D. Gerardo, Ruji P. Medina

Article Citation :Hajah T. Sueno ,Bobby D. Gerardo ,Ruji P. Medina , (2020 ) " Dimensionality Reduction for Classification of Filipino Text Documents Based on Improved Bayesian Vectorization Technique " , International Journal of Advances in Science, Engineering and Technology(IJASEAT) , pp. 56-60, Volume-8,Issue-1

Abstract : Dimensionality reduction of feature vector size plays a vital role in enhancing the text processing capabilities to reduce the size of the feature vector used in the mining tasks to achieve a higher classification accuracy. While dimensionality reduction for text classification is becoming a great area of research in most languages, Filipino documents have received little or no attention from researchers. Thus, this paper addresses the issue of dimensionality reduction in representing relevant data from Filipino texts using an improved Bayesian vectorization technique. To validate the effectiveness of improved Bayesian vectorization, the model was compared to the Term Frequency and Inverse Document Frequency (TF-IDF) method. The outcomes are presented using standard measures such as precision, recall, f-score and accuracy. The results revealed that the improved Bayesian vectorization has significantly better results having 98% classification accuracy compared to 76% classification accuracy of the TF-IDF vectorization technique. Keywords - Dimensionality Reduction, Bayesian Vectorization, Filipino Text Document, OPM Songs, Lyrics, Text Classification

Type : Research paper

Published : Volume-8,Issue-1


DOIONLINE NO - IJASEAT-IRAJ-DOIONLINE-17047   View Here

Copyright: © Institute of Research and Journals

| PDF |
Viewed - 56
| Published on 2020-05-23
   
   
IRAJ Other Journals
IJASEAT updates
Volume-11,Issue-4 (Oct,2023)
The Conference World

JOURNAL SUPPORTED BY