AI Model Benchmark Leaderboard
BLXBench
Category-aware model rankings from local BLXBench runs, grouped by task domain, difficulty level, pass rate, and latency.
No recorded runs yet
Top score
Executed tests
Est. API spend
Top decode
Categories
Levels
No benchmark rows for this slice.
Run BLXBench for any category at any level to populate this view.