Name	Name	Last commit message	Last commit date
parent directory ..
solana-gym-env	solana-gym-env
AGENTS.md	AGENTS.md
CLAUDE.md	CLAUDE.md
README.md	README.md
__init__.py	__init__.py
eliza_agent.py	eliza_agent.py
eliza_explorer.py	eliza_explorer.py
exploration_strategy.py	exploration_strategy.py
instruction_catalog.py	instruction_catalog.py
setup.sh	setup.sh
skill_templates.py	skill_templates.py
test_solana_benchmark.py	test_solana_benchmark.py
trajectory.py	trajectory.py

Name

Last commit message

Last commit date

exploration_strategy.py

instruction_catalog.py

setup.sh

skill_templates.py

test_solana_benchmark.py

trajectory.py

Solana-Gym Benchmark

Evaluates an AI agent's ability to discover Solana on-chain instructions by constructing and submitting unsigned transactions against a Surfpool sandbox. The agent works through 8 Solana programs (System, Token, Token-2022, Memo, Compute Budget, Stake, ATA, Address Lookup Table) and tries to discover all 236 unique program/discriminator pairs. Score is discovered / 236.

Quick Start

# One-time setup (installs Python + Bun deps, checks for surfpool)
bash setup.sh

# Run with external Surfpool (start surfpool first)
surfpool start -u https://api.mainnet-beta.solana.com --no-tui &
USE_EXTERNAL_SURFPOOL=true \
  ENVIRONMENT_CONFIG=voyager/environments/basic_env.json \
  MODEL_NAME=anthropic/claude-sonnet-4.6 \
  python -m benchmarks.solana.eliza_explorer

# Via orchestrator
python -m benchmarks.orchestrator run --benchmarks solana --provider cerebras --model gpt-oss-120b

See AGENTS.md for full env-var reference, harness options, and test commands. The upstream gym environment lives in solana-gym-env/ with its own README.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Solana-Gym Benchmark

Quick Start

FilesExpand file tree

solana

Directory actions

More options

Directory actions

More options

Latest commit

History

solana

Folders and files

parent directory

README.md

Solana-Gym Benchmark

Quick Start