Evaluation Setup
•
Training data
–
3k corresponding path pairs from 10k QA
pairs (TREC-8, 9)
•
Test data
–
324 factoid questions from TREC-12 QA task
•
Passage retrieval on top 200 relevant
documents by TREC