TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE

The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...

Full beskrivning

Sparad:
Bibliografiska uppgifter
Huvudupphovsman: Lê, Thị Minh Nguyện
Materialtyp: Artikel
Språk:English
Publicerad: Trường Đại học Đà Lạt 2023
Länkar:https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114322
https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
Taggar: Lägg till en tagg
Inga taggar, Lägg till första taggen!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
Beskrivning
Sammanfattning:The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.