Part of Speech (POS) tagging

Top  Previous  Next

Automatic Part Of Speech (POS) tagging is the process of annotating words in a corpus to a particular part of speech based on the context and definition of the word(s).

 

See Annotation Tag Sets for tag details.

 

Evaluation results for POS:

(As reported in Eiselen, R. & Puttkammer, M.J. 2014. Developing Text Resources for Ten South African Languages. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'14) (pp. 3698-3703).

Language

Accuracy

Afrikaans

95.71%

isiNdebele

82.57%

isiXhosa

84.18%

isiZulu

83.83%

Sesotho sa Leboa

96.00%

Sesotho

92.36%

Setswana

96.02%

Siswati

82.08%

Tshivenḓa

88.25%

Xitsonga

89.83%