Description and Schedule
1. Introduction
This is the official announcement for the Fourth International Chinese Language Processing Bakeoff, sponsored by the Special Interest Group for Chinese Language Processing (SIGHAN) of the Association for Computational Linguistics. The bakeoff will be jointly held with the First CIPS Chinese Language Processing Evaluation in the summer of 2007, and co-organized by SIGHAN, ChineseLDC, and the Verifying Centre of Chinese Language and Character Standards of the State Language Commission of P.R.C. ?The results will be presented at the 6th SIGHAN Workshop, to be held at IJCNLP 2008 in Hyderabad, India, January 11-12, 2008.
The first bakeoff, held in 2003 and presented at the 2nd SIGHAN Workshop at ACL 2003 in Sapporo,Japan has become the pre-eminent measure for Chinese word segmentation evaluation and has been cited in numerous papers. The second bakeoff held in 2005 and presented at the 4th SIGHAN Workshop at IJCNLP-05 on Jeju Island, Korea demonstrated further progress in this task. Following the success of the first two evaluations, the third bakeoff (Sydney, 2006) augmented the classic Word Segmentation task with a new Named Entity Recognition task.
In the fourth bakeoff, the following tasks will be evaluated:
Participants are required to submit a short paper describing the structure and algorithm of their system and analyzing their performance, and present a summary at the workshop. The reports will be published in the SIGHAN workshop proceedings.
The language of the workshop is English. Papers must be submitted and presented in English. Note that unlike the workshop proper, there will not be a peer review process on the bakeoff reports.Notice: Not all the participants should go to India to attend the workshop. Submitting a short paper is OK.
2. Schedule
2007-07-10 |
Official website opened at http://www.china-language.gov.cn |
2007-07-15 12:00 to 2007-08-24 12:00 (Beijing Time) |
Registration Open |
2007-08-25 12:00 (Beijing Time) |
Training data made available |
2007-09-25 12:00 (Beijing Time) |
Testing data made available |
2007-09-28 deadline 12:00 (Beijing Time) |
Test results due back to organizers |
2007-10-15 |
Results privately reported to participants |
2007-11-10 |
Final reports due from participants |
3. Contact Information
The fourth bakeoff is being coordinated by Dr. Guangjin Jin of the Institute of Applied Linguistics, Beijing, China.
Questions on the bakeoff should be addressed to: bakeoff_4@126.com or gorillax@126.com
http//:www.china-language.gov.cn
No.51 Chaonei Nanxiaojie, Beijing, China, Institute of Applied Linguistics.
Postal Code£º100010
Phone£º010-65592937,13810078546
2007-7-6