The IMS Open Corpus Workbench (CWB)
The IMS Open Corpus Workbench (CWB) is a collection of open-source tools for managing and querying large text corpora (up to 2 billion words) with linguistic annotations. Its central component is the flexible and efficient query processor CQP.
The first official open-source release of the Corpus Workbench was Version 3.0. The second stable release will be Version 3.5, which we are working towards now. The current development version is 3.4, which, as of 2021, is very close to being set as 3.5.
CWB can be obtained as a download from this website; it can also be accessed as source code via SourceForge. Other than downloads of the release versions of the CWB, plus associated software and sample corpora, you'll also find some documentation, academic publications including standard references, and other information across the different sections of this site.
For more technical matters, take a look at the section of the site for developers. You may well be interested in our roadmap for CWB Version 4, including Project Ziggurat.
Kindly note that maintaining this website is done by S.E. and A.H., i.e. the same culprits who do the actual programming... We hope you'll forgive us if sometimes pages get a bit out of date!
Found a bug?
You can report it on our SourceForge bug tracker (SourceForge log-in needed, due to spam).
Need to contact us?
- The CWB email list is probably your best bet.
- Or maybe you could try our individual main web presences: S.E.'s is here; A.H.'s is here.
About this site
This Web site uses the GopherPHP layout and navigation framework. The upper row of the navigation bar at the top of the screen allows you to switch between different sections of the site, while the lower row lists pages in the current section. Click on the preferences icon
in the navigation bar to customise fonts and colours. If you have enabled cookies, these settings will be remembered on your next visit to the CWB Web site.