Proceedings of the 6th Language Resources and Evaluation Conference (LREC 2008). Marrakech, Morocco.
Based on the idea that local contexts predict the same basic category across a language, we develop a simple method for comparing tagsets across corpora. The principle differences between tagsets are evidenced by variation in categories in one corpus in the same contexts where another corpus exhibits only a single tag. Such mismatches highlight differences in the definitions of tags which are crucial when porting technology from one annotation scheme to another.
Electronically available file formats:
Bibtex entry:
@InProceedings{dickinson:jochim:08,
author = {Markus Dickinson and Charles Jochim},
title = {A Simple Method for Tagset Comparison},
booktitle = {Proceedings of the 6th Language Resources and
Evaluation Conference (LREC 2008)},
address = {Marrakech, Morocco},
pages = {},
url = {\url{http://jones.ling.indiana.edu/~mdickinson/papers/dickinson-jochim08.html}},
year = {2008}
}