AI Benchmarks: Useless, Personalized Agents Prevail