PARTS-OF-SPEECH TAGGER FOR NEPALI TEXT USING SVM

Asmita Subedi

2017

BSc.CSIT

Semester 7

Downloads 6

Parts-of-Speech Tagger for Nepali Text using SVM is an application that assigns parts of speech like noun, pronoun, verb, adverb and other lexical tags to each word in Nepali text based on both its definition, as well as its context. The tagger is built using the Support Vector Machine learning framework that is trained with 80,000 lemmatized words from the Nepali National Monolingual Written Corpus. The average accuracy of 88% and 72% was obtained for lemmatized text and unprocessed raw text tagging system respectively.

Support Vector Machine

Natural Language Processing

Supervised Machine Learning

Parts-of Speech Tagger

Tag-set

Nepali National Monolingual Written Corpus

PARTS-OF-SPEECH TAGGER FOR NEPALI TEXT USING SVM

Similar Projects