Here are some existing test collections with queries and relevance judgements that can be downloaded from the internet. Perhaps it would be nice to improve the lucene benchmark package to be able to easily download these collections and run the evaluations?
Several small ones here: http://www.cs.utk.edu/~lsi/corpa.html
Trec-5 confusion: http://trec.nist.gov/data/t5_confusion.html
Trec-9 filtering: http://trec.nist.gov/data/t9_filtering.html
braun corpus: http://ilps.science.uva.nl/resources/hdr
- note this last one is not working because her quota is exceeded but she sent me the corpus, maybe we should contact her and ask permission to host elsewhere?
> So I hope this use of your corpus is acceptable to you (it is not for any
> commercial purpose, just to improve lucene).
Yes, that is all right. That is what my corpus made for