Ziggurat and the future of CWB

Ziggurat is the name of our current project to develop a successor to the corpus-indexing technology that underpins the Corpus Workbench.

The project aims to produce:

Please join the CWBdev mailing list if you are interested in hearing about our progress on Ziggurat.

So far, we have developed a proposal (not wholly finalised) for the data model and file formats. You can read about this in the documents linked below. The next step will be a rough prototype for the Ziggurat API. Finally, we will move on to full implementation.

Corpus Workbench version 4: the future

Although we hope that Ziggurat will be broadly useful, its primary purpose is to power CWB v 4.

CWB 4 will be a complete re-design that uses Ziggurat to improve flexibility and scalability. Some design goals:

Work on CWB v 4 will begin after the 1.0 release of Ziggurat itself.

The Ziggurat data model

Comments are welcome! You might want to read Evert & Hardie (2015) first to get an overview.

New versions of this and other Ziggurat documents will be published soon. As always code will appear on the SourceForge repository as we write it.

How to install

Ziggurat is not end user software. Most people will never need to install it. Only developers need to consider doing so.

Read more

To learn more about Ziggurat, see: