PARTS-OF-SPEECH TAGGER FOR NEPALI TEXT USING SVM

Asmita Subedi
2017
BSc.CSIT
Semester 7
Downloads 2

Parts-of-Speech Tagger for Nepali Text using SVM is an application that assigns parts of speech like noun, pronoun, verb, adverb and other lexical tags to each word in Nepali text based on both its definition, as well as its context. The tagger is built using the Support Vector Machine learning framework that is trained with 80,000 lemmatized words from the Nepali National Monolingual Written Corpus. The average accuracy of 88% and 72% was obtained for lemmatized text and unprocessed raw text tagging system respectively.

Support Vector Machine
Natural Language Processing
Supervised Machine Learning
Parts-of Speech Tagger
Tag-set
Nepali National Monolingual Written Corpus

Similar Projects