
#1400 of 2292 in Artificial Intelligence (All Time)
Evaluating Deep Research Agents on Expert Consulting Work: A Benchmark with Verifiers, Rubrics, and Cognitive Traps
Congratulate the authors
Know the authors? Send them a congratulation.

Know the authors? Send them a congratulation.