Evaluation Setup
Training data
3k corresponding path pairs from 10k QA
pairs (TREC-8, 9)
Test data
324 factoid questions from TREC-12 QA task
Passage retrieval on top 200 relevant
documents by TREC