Q: How do i index formats other than HTML and TREC?
A: The HTML indexer will work over text data as well (things that look like markup may be ignored). Simply index your text data as HTML and search as normal. You can use common filters such as Antiword and ps2ascii to convert Microsoft Word documents, PostScript and PDF files into text data suitable for searching with Zettair.
Q: How much data is Zettair capable of handling?
A: We're not really sure. We've indexed over 100GB of data without problems. As of 0.6.1, we've indexed the 426GB TREC terabyte track collection.
Q: Are indexes portable between machines?
A: Indexes are currently portable between different machines, but aren't portable between different architectures. Unfortunately, we currently don't store floating point document weights portably. You may also experience problems if your source documents are stored in different locations on different machines.
Q: I want Zettair to ... and it currently doesn't. Help?
A: You can email us at firstname.lastname@example.org. We'll try our best to help, or provide some advice.
Q: I found a bug in Zettair, where do i report it?
A: Please mail us at email@example.com we'll deal with it.
For detailed guides on using Zettair visit the tutorial page. Note: The tutorial is also included in the download, in the directory zettair/doc.
Quick Start Guides
Read our getting started with HTML documents guide. Getting Zettair up and running has seven simple steps!
It's also easy to get Zettair going for your TREC experiments. Here's a simple getting started with TREC guide.