May 15, 2026 ยท 15 min
SocratesBench: A Curriculum-Aware Adversarial Benchmark for Pedagogical AI Tutors
A white paper introducing SocratesBench - an adversarial, judge-based benchmark measuring pedagogical failure modes in LLM tutors across a fixed quadratic-equation task.