AI benchmarking platform is helping top companies rig their model performances, study claims

May 22, 2025 By admin

LMArena, a popular benchmark for large language models, has been accused of giving preferential treatment to AIs made by big tech firms, potentially enabling them to game their results.

Headlines, uncategorized

Post navigation

← iPhone design guru and OpenAI chief promise an AI device revolution

A safety institute advised against releasing an early version of Anthropic’s Claude Opus 4 AI model →

Search