![]() |
Bakeoff |
The first bakeoff, held in 2003 and presented at the 2nd SIGHAN Workshop at ACL 2003 in Sapporo, has become the pre-eminent measure for Chinese word segmentation evaluation and has been cited in numerous papers. The second bakeoff held in 2005 and presented at the 4th SIGHAN Workshop at IJCNLP-05 on Jeju Island, Korea demostrated further progress in this task. In a change from the first two evaluations, the third bakeoff will augment the classic Word Segmentation task with a new Named Entity Recognition task. Corpora from the following organizations will be available for use:
The final details of the segmentation and named entity tagging task will be made available through the registration site which will open March 15, 2006.
Participants are required to submit a short paper describing their system and analyzing their performance, and present a summary at the workshop. The reports will be published in the SIGHAN workshop proceedings.
The language of the workshop is English. Papers must be submitted and presented in English. Note that unlike the workshop proper, there will not be a peer review process on the bakeoff reports.
The web page for the competition is:
http://sighan.cs.uchicago.edu/bakeoff2006/
Questions on the bakeoff should be addressed to Gina-Anne Levow, levow@cs.uchicago.edu