Evals
Providers using this tag (4)
APIs with this tag (6)
APEX Benchmarks (AI Productivity Index) mercor
APEX-Agents Leaderboard mercor
APEX-SWE Leaderboard mercor
OpenAI Evals API openai
Surge RL Environments and Agents surge-ai
Surge Rubrics and Verifiers surge-ai
Score breakdown
Frequency
26.9
log-scaled weighted occurrences
Breadth
0.3
spread across providers
Quality lift
44.0
mean composite of providers using it
Cohesion
0.0
strength of nearest seed neighbor
Where this tag comes from
Provider tag1
Api tag6