BLXBenchBLXBench UI
blxbench
BLXBenchBLXBench UI

Benchmark

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership

Benchmarks

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership
Updated Apr 24, 02:56 PM0 models372 fixtures
blxbench

AI Model Benchmark Leaderboard

BLXBench

Category-aware model rankings from local BLXBench runs, grouped by task domain, difficulty level, pass rate, and latency.

No recorded runs yet
Top score
No modelsn/a
Executed tests
372 available fixtures0
Est. API spend
Sum of per-model costs from overall_ranking$0.00
Top decode
No stream samplesn/a
Categories
Coding Ui / Debugging / Hallucination / Reasoning / Refactoring / Security / Speed7
Levels
easy / medium / hard / Leicht4

Benchmark

OverallAll levels

No benchmark rows for this slice.

Run BLXBench for any category at any level to populate this view.

No benchmark rows for this slice.

Run BLXBench for any category at any level to populate this view.

BLXBench

Community driven leaderboardPublic benchmark runner — run in your environment, share results with the community.

© 2026 BLXBench by bitslix.com

ProvenanceAggregated from user runs
Scope0 / 7 / 372
LatestNo runs
TermsPrivacy