TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE

The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...

Celý popis

Uloženo v:
Podrobná bibliografie
Hlavní autor: Lê, Thị Minh Nguyện
Médium: Článek
Jazyk:English
Vydáno: Trường Đại học Đà Lạt 2023
On-line přístup:https://scholar.dlu.edu.vn/thuvienso/handle/DLU123456789/114322
https://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo otaguje tento záznam!
Thư viện lưu trữ: Thư viện Trường Đại học Đà Lạt
Popis
Shrnutí:The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.