
#142 of 2292 in Artificial Intelligence (All Time)
MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models
Congratulate the authors
Know the authors? Send them a congratulation.

Know the authors? Send them a congratulation.