June 1, 2026 ยท 19 min
SocratesBench: A Curriculum-Aware Adversarial Benchmark for Pedagogical AI Tutors
A white paper introducing SocratesBench - an adversarial, judge-based benchmark measuring pedagogical failure modes in LLM tutors across a fixed quadratic-equation task.