A SUITABLE MODEL FOR CLASSIFYING VIETNAMESE DOCUMENTS
189 viewsKeywords:
Text Classification; Tokenization; Conditonal Random Fields - CRFs.Abstract
In this paper, we proposed a text classifying model for Vietnamese document. Our model is a combination of two separated components: A tokenization algorithm based on Conditional Random Fields (CRFs)[1] and StarSpace[2] – a general text classification model. Experiments results indicate that our model performed well on classifying task (with accuracy above 90% on the testing dataset).
Downloads
Published
10-04-2020
How to Cite
[1]
Hùng, “A SUITABLE MODEL FOR CLASSIFYING VIETNAMESE DOCUMENTS”, JMST, no. 66, pp. 238–242, Apr. 2020.
Issue
Section
News & Views