Coverage, Not Averages: Semantic Stratification for Trustworthy Retrieval Evaluation
by Andrew Klearman, Radu Revutchi, Rohin Garg +3
View Leaderboard on Kurate.org