AI benchmarks hampered by bad science November 7, 2025 By admin : Study finds many tests don’t measure the right things