Tools for Information Retrieval Experiments

Here is a collection of tools I built and modified for use in IR experiments.

txt | trecbox | Lucene | Terrier

Terminology:

ap.tgz is a sample data; stats and results are shown below.

STATS
2250 Documents from the Associated Press (on TREC DISK 3).  
20   Queries from TREC-4 (Query IDs 201-250).  
167  Relevance judgments.
RESULTS
Lucne
---------------------------------    
RUN NAME                   MAP
---------------------------------
DEMO.a.s.bm25.20.D.x       0.4814
DEMO.a.s.bm25L.20.D.x      0.4335
DEMO.a.s.bm25e.20.D.x      0.4766
DEMO.a.s.tmpl.20.D.x       0.2402
DEMO.a.s.tmple.20.D.x      0.2402
---------------------------------

Terrier
---------------------------------    
RUN NAME                   MAP
---------------------------------
DEMO.a.s.bm25.20.D.x       0.4728
DEMO.a.s.tf_idf.20.D.x     0.4732
DEMO.a.s.tmpl.20.D.x       0.2141
---------------------------------