Call for Participation

Newsflash

 

Description and Schedule

1. Introduction
This is the official announcement for the Fourth International Chinese Language Processing Bakeoff, sponsored by the Special Interest Group for Chinese Language Processing (SIGHAN) of the Association for Computational Linguistics. The bakeoff will be jointly held with the First CIPS Chinese Language Processing Evaluation in the summer of 2007, and co-organized by SIGHAN, ChineseLDC, and the Verifying Centre of Chinese Language and Character Standards of the State Language Commission of P.R.C. ?The results will be presented at the 6th SIGHAN Workshop, to be held at IJCNLP 2008 in Hyderabad, India, January 11-12, 2008.

The first bakeoff, held in 2003 and presented at the 2nd SIGHAN Workshop at ACL 2003 in Sapporo,Japan has become the pre-eminent measure for Chinese word segmentation evaluation and has been cited in numerous papers. The second bakeoff held in 2005 and presented at the 4th SIGHAN Workshop at IJCNLP-05 on Jeju Island, Korea demonstrated further progress in this task. Following the success of the first two evaluations, the third bakeoff (Sydney, 2006) augmented the classic Word Segmentation task with a new Named Entity Recognition task.

In the fourth bakeoff, the following tasks will be evaluated:

The corpora from following organizations would be available:
Academia Sinica, Taipei
City University of Hong Kong, Hong Kong
Microsoft Research Asia, Beijing
Peking University, Beijing
Shanxi University, Taiyuan
State Language Commission of P.R.C.,Beijing£¨National Chinese Corpus£©
University of Colorado, United States(Chinese Tree Bank)

Participants are required to submit a short paper describing the structure and algorithm of their system and analyzing their performance, and present a summary at the workshop. The reports will be published in the SIGHAN workshop proceedings.

The language of the workshop is English. Papers must be submitted and presented in English. Note that unlike the workshop proper, there will not be a peer review process on the bakeoff reports.

Notice: Not all the participants should go to India to attend the workshop. Submitting a short paper is OK.

2. Schedule

2007-07-10

Official website opened at http://www.china-language.gov.cn

2007-07-15 12:00 to 2007-08-24 12:00 (Beijing Time)

Registration Open

2007-08-25 12:00 (Beijing Time)

Training data made available

2007-09-25 12:00 (Beijing Time)

Testing data made available

2007-09-28 deadline 12:00 (Beijing Time)

Test results due back to organizers

2007-10-15

Results privately reported to participants

2007-11-10

Final reports due from participants

 

3. Contact Information
The fourth bakeoff is being coordinated by Dr. Guangjin Jin of the Institute of Applied Linguistics, Beijing, China.
Questions on the bakeoff should be addressed to: bakeoff_4@126.com or gorillax@126.com
http//:www.china-language.gov.cn
No.51 Chaonei Nanxiaojie, Beijing, China, Institute of Applied Linguistics.
Postal Code£º100010
Phone£º010-65592937,13810078546

 

2007-7-6