AI and LLMs struggle with historical accuracy in advanced tests January 20, 2025 By admin Leading AI systems perform poorly on nuanced historical exams, achieving only 46% accuracy at best.