Create POS Tagged and Parsed Text in Senseval-2 format

These tools are intended for use with the Brill Part of Speech Tagger and the Collins Parser. Their objective is to part of speech tag and parse text that is stored in the Senseval-2 format.

parseSenseval

This package allows a user to parse any text in the Senseval-2 format using the Collins Parser. The text should have already been POS tagged (using posSenseval).

posSenseval

This package allows a user to part of speech tag any text in the Senseval-2 format with the Brill POS Tagger.

Guaranteed Pre-Tagging

The user may optionally apply the following patch to the Brill tagger in order to allow the user to specify a pre-tag for a word and guarantee that it be used by the tagger. This patch is incorporated into the posSenseval package if the user has it installed.

Pre-Processing for Tagging

This package will format the Senseval-2 data appropriately for the Brill Tagger, and allow for the easy insertion of pretags.

By: Ted Pedersen - tpederse AT d umn edu