A focused look at Claude Fable 5 on SocratesBench, and model comparisons.
A white paper introducing SocratesBench - an adversarial, judge-based benchmark measuring pedagogical failure modes in LLM tutors across a fixed quadratic-equation task.