BLXBench Docs
BLXBench Docs
LeaderboardOur TestsSponsor / PartnershipDocumentationInstallationQuick StartTUICommandsHeadless ModeConfigurationLeaderboardOur TestsAccountAboutFAQSupport

Commands

Complete reference for all blxbench commands.

This reference assumes the blxbench command is available (install globally as @bitslix/blxbench).

Interactive TUI

For the visual TUI interface, see TUI Guide.

Running Benchmarks

# Start interactive TUI
blxbench

From TUI

The TUI is command-driven. Type /help to list available commands and use Tab to complete suggestions.

CommandDescription
/showShow current run configuration
/set provider or /providerPick a provider interactively
`/set models [listid,id]or/models`
`/set categories [*a,b]`
/set levels easy,medium,hardFilter difficulty levels
/set limit NLimit tests per category
/set ratelimit RPM-or-offThrottle provider calls
`/set fail-fast [onoff]`
`/set report htmljson
`/report submit onoff`
/auth login, /auth logout, /auth whoamiManage web account credentials
/playwright status/install/uninstallManage Playwright Chromium
/runStart the benchmark

Headless Mode

Run benchmarks without the TUI when stdout is not a TTY, or force it with --headless. Pass options directly — there is no run subcommand:

blxbench --headless --provider <alias> --models <model-id> [more-model-ids...]

See Headless Mode for CI/CD integration.

Options

FlagDescriptionDefault
--providerProvider aliasopr (OpenRouter)
--modelsModel ID(s)(required)
--api-keySets BLXBENCH_API_KEY for this process—
--tests-dirPath to tests directoryBuilt-in tests
--categoryFilter categoriesAll
--levelFilter difficultyAll
--limitMax tests per categoryAll
--save-jsonOutput JSON pathAuto
--fail-fastStop on first failurefalse
--ratelimitRequests per minute60 (when flag has no value)
--dotenv-pathCustom .env file.env
--clearClear results dirfalse
--install-chromiumInstall Playwrightfalse
--skip-render-validationSkip UI render stage for coding_uifalse
--submitUpload report after runfalse

Utility Commands

Version

blxbench --version
blxbench -V

Prints blxbench <semver>. The TUI footer shows the same version as v<semver>.

Clear Results

blxbench --headless --clear

Removes generated artifacts while preserving ranking files.

Install Chromium

blxbench --headless --install-chromium

Downloads Playwright Chromium for UI rendering tests.

Environment Variables

VariableDescription
OPENROUTER_API_KEYOpenRouter (opr)
OPENAI_API_KEYOpenAI adapter (oai)
HF_TOKENHugging Face (hgf)
TOGETHER_API_KEYTogether (tgr)
PORTKEY_API_KEYPortkey (ptk)
CLOUDFLARE_API_TOKENCloudflare (cfr)
BLXBENCH_API_KEYBLXBench API key for headless submit
BLXBENCH_SUBMITSet to 1 or true to upload after a headless run

Examples

Run all tests (OpenRouter):

blxbench --headless --provider opr --models openai/gpt-5.4-mini

Run specific categories:

blxbench --headless --provider opr --models openai/gpt-5.4-mini --category speed reasoning

Limit test count:

blxbench --headless --provider opr --models openai/gpt-5.4-mini --limit 5

Upload results:

blxbench --headless --provider opr --models openai/gpt-5.4-mini --submit

TUI

BLXBench interactive Terminal User Interface.

Headless Mode

Running benchmarks in automated environments.

On this page

Interactive TUIRunning BenchmarksFrom TUIHeadless ModeOptionsUtility CommandsVersionClear ResultsInstall ChromiumEnvironment VariablesExamples