BLXBenchBLXBench UI
blxbench

Benchmark

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership
DocsDownload blxbenchOur TestsPassSponsor / Partnership
BLXBenchBLXBench UI
blxbench

Benchmark

Levels

Misc

DocsDownload blxbenchOur TestsPassSponsor / Partnership
DocsDownload blxbenchOur TestsPassSponsor / Partnership
  1. Home
  2. Our Tests
  3. Speed Perf Throughput Data Structures
blxbench

Test fixture

Speed Perf Throughput Data Structures

Speedhardscorer: contains_all

Latency-sensitive tasks where concise correct output matters.

How it is scored

The model receives the prompt (and optional system message). The run uses scorer contains_all with the JSON configuration below. Pass/fail and partial credit are determined entirely by that scorer against the model output; no human grading.

User prompt
Write an in-depth technical explanation of a least-recently-used (LRU) cache suitable for senior engineer interview prep. Requirements:
- Use exactly these headings in order: ## Motivation ## Structure ## Operations ## Complexity
- Include step-by-step Python-like pseudocode for get and put
- Discuss why combining a hash map with a doubly linked list achieves typical O(1) operations
- Write multiple paragraphs per section; be exhaustive, not terse.

Write in English.
Scorer config
{
  "expected_contains": [
    "## Motivation",
    "## Complexity",
    "LRU",
    "O(1)",
    "hash"
  ]
}
Run parameters

temperature

0

max_tokens

2048

timeout (s)

300

type

throughput

file

speed_perf_throughput_data_structures.json

← PreviousSpeed Perf Throughput Api Design
|
Next →Speed Perf Throughput Essay

BLXBench

Community driven leaderboardPublic benchmark runner — run in your environment, share results with the community.

© 2026 BLXBench by bitslix.com

ProvenanceAggregated from user runs
Scope6 / 7 / 372
Latestrun_5434c2 / 7 / $0.00
TermsPrivacy