Skip to main content.

Latest news

Workshop programme and proceedings (PDF) are available now! See details under “Workshop programme”.

Call for Papers

The 4th Web as Corpus Workshop: Can we beat Google? (at LREC 2008)
endorsed by the ACL Special Interest Group on the Web as Corpus (SIGWAC)

 Date:Sunday, 1 June 2008
 Location:Marrakech, Morocco
 Web page:
 Deadline:Friday, 29 February 2008


Commercial Web search engines offer fast search on huge amounts of text, combined with increasingly clever ranking and data analysis algorithms, but their content-centric services do not cater to the needs of the computational linguistics and NLP communities. The leading theme of this workshop, the fourth in a row of highly successful Web as Corpus meetings, is to find out how to combine the power and scalability of modern search engine technology with sophisticated linguistic annotation and query processing.

We invite papers on various topics concerning the use of Web resources for corpus research and NLP applications, including (but not limited to) the following:

Submission information

Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should follow the format of LREC proceedings and should not exceed eight (8) pages, including references. We strongly recommend the use of LREC LaTeX or Microsoft Word style files tailored for this year's conference. Details on the submission procedure will be posted on this Web site shortly.

Programme committee

Organising committee