The Register on MSN1h
Why AI benchmarking sucksAnyone remember when Volkswagen rigged its emissions results? Oh... AI model makers love to flex their benchmarks scores. But how trustworthy are these numbers? What if the tests themselves are rigged ...
Moreover, Qualcomm believes that AI is becoming the new user interface thanks to the emerging trends in AI agents.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results