Ai Benchmark Results Dashboard

17h

Which AI agent is the best? This new leaderboard can tell you

On Wednesday, Galileo launched an Agent Leaderboard on Hugging Face, an open-source AI platform where users can build, train, access, and deploy AI models. The leaderboard is meant to help people ...

Yahoo Finance3d

HackerRank Introduces New Benchmark to Assess Advanced AI Models

CUPERTINO, Calif., Feb. 11, 2025 (GLOBE NEWSWIRE) -- HackerRank, the Developer Skills Company, today introduced its new ASTRA Benchmark. ASTRA, which stands for Assessment of Software Tasks in ...

Morningstar10d

Paritii Launches The Parity Benchmark: A Game-Changer in AI Fairness Evaluation

What the Results Reveal: AI Still Struggles with Bias and Reasoning Paritii's inaugural benchmark tested seven leading AI models, assessing their ability to handle both factual fairness questions ...

HackerRank Introduces New Benchmark to Assess Advanced AI Models

The ASTRA Benchmark consists of multi-file, project-based problems designed to mimic real-world coding tasks. The intent of the HackerRank ASTRA Benchmark is to determine the correctness and ...

Mena FN3d

Hackerrank Introduces New Benchmark To Assess Advanced AI Models

but Claude- -3.5-sonnet produced more consistent results. Ravisankar added,“By open sourcing our ASTRA Benchmark, we're offering the AI community the opportunity to run their models against a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results