SAN FRANCISCO, April 8, 2026 /PRNewswire/ -- KushoAI, an AI-native platform for API testing and software reliability, has introduced APIEval-20, an open benchmark designed to evaluate how effectively ...
-- No existing benchmark measured whether AI agents can find real API bugs from a schema and payload alone -- 100+ downloads in first week by developers and contributors; freely available on ...
How well do you understand the world of words? Verbal ability includes the capacity to reason using language, and to use words appropriately and in context. This test measures some of these abilities.
Microsoft's Bing team has open-sourced Harrier, an embedding model family that tops the multilingual MTEB v2 benchmark under an MIT license.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results