Smart Energy Arbitrage 2026

=====================

Operator Preview. Private Operator Preview. Defense

Commission proof pack for a Ukrainian BESS energy-arbitrage operator preview:

Start with the project intro:

This repository implements a source-backed DAM/IDM hourly recommendation preview for a human BESS operator. It combines Dagster/FastAPI/Postgres, a Nuxt operator dashboard, deterministic LP/V2+ evidence, and bounded DT/HF shadow comparisons for research context. The defended boundary is explicit: operator preview, read-model evidence, no market execution, no ProposedBid, and no autonomous dispatch.

Open first during a live review:

/operator for the primary demo surface.
/defense for evidence and explanation panels.
FastAPI /docs for read-model endpoint contracts.
Published thesis paper for the full academic write-up.
Software product evidence card for the 50-point software/experimental rubric.
Final defense runbook for the exact fallback path if the live stack is unavailable.

Public / Post-Defense Work

Public preview: Ukraine BESS Arbitrage Index.

This page is a post-defense public demo and portfolio surface built after the defended operator-preview submission. It is not the private operator dashboard, not the defended private operator functionality, and not a market-control system. It uses public/static JSON evidence for a no-execution BESS arbitrage index: no market bids, no dispatch commands, no private tenant controls, and no YASNO/DTEK integration claim.

Use the public page as a lead-generation and research portfolio entry. Use /operator, /defense, and the evidence links below for the original commission-facing operator-preview path.

Publication automation: the public JSON publisher runs from .github/workflows/public-bess-index.yml. After it commits fresh JSON under dashboard/public/data/bess-arbitrage-index, it triggers a Vercel production refresh for energy-index.full-iron.com. Configure either VERCEL_DEPLOY_HOOK_URL, or the CLI fallback secrets VERCEL_TOKEN, VERCEL_ORG_ID, and VERCEL_PROJECT_ID.

Social automation: .github/workflows/public-bess-social-daily.yml creates a daily X/Twitter post for the dedicated index account after the public JSON workflow succeeds. .github/workflows/public-bess-social-weekly.yml creates a weekly personal post for x.com/fefelov on Mondays. Both workflows generate a PNG/text outbox artifact even when API credentials are missing, and post only when the matching X secrets are configured. Required daily-account secrets are X_DAILY_API_KEY, X_DAILY_API_SECRET, X_DAILY_ACCESS_TOKEN, and X_DAILY_ACCESS_TOKEN_SECRET; personal-account secrets use the same names with the X_PERSONAL_ prefix.

Public / Post-Defense Work links the public demo while separating it from the private operator functionality.
Commission Review Checklist maps the project to the 50-point software/experimental rubric.
Commission 90-Second Path gives the shortest defense walkthrough.
What It Proves states the measured evidence and the non-execution boundary.
Demo Path shows the live /operator and /defense sequence.
Architecture explains the source-to-dashboard read model.
Evidence Visuals collects screenshots, diagrams, and regret/readiness figures.
Primary Vs Supporting Lanes separates the final product surface from archived research context.
Quickstart covers Windows, MacBook, and Linux local startup.
API Reference lists the read-model endpoints.
Verification gives the commands used for tests and audits.
Repository Map explains where reviewers should look.
Claim Boundaries lists approved wording.
Troubleshooting covers common demo blockers.
License And Academic Use explains repository usage terms.

Commission Review Checklist

Use this repository as a product/evidence walkthrough, not as a raw research workspace. The fastest way to grade the 50% software/experimental criterion is to inspect criteria coverage, evidence links, and verification output:

University criterion	What to inspect first	Evidence in this repo
Declared-task fit	`/operator` tenant/venue/date preview and fail-closed source blockers	Final defense runbook
Correct methods	LP/oracle comparator, source-first price context, Pydantic non-execution contracts	API reference, metrics atlas
Technical complexity	Dagster assets, forecast-store rows, FastAPI read models, Nuxt dashboard, multi-strategy evidence	Architecture/data flow
Code/experiment quality	Clean entry points, documented evidence, explicit primary-vs-supporting lanes	Repository checklist
Testing/validation	Typecheck, unit tests, Dagster definitions, final repo audit, smoke checks	Verification

For the 20% presentation/defense criterion, the deck follows the same path: problem -> evidence gates -> LP mechanism -> regret results -> DT/HF shadow limits -> demo bridge -> conclusion. The rubric mapping is in docs/technical/FINAL_UNIVERSITY_RUBRIC_MATRIX.md. For a tighter commission walkthrough, use the software product evidence card.

Commission 90-Second Path

State the product boundary: DAM/IDM hourly operator preview, read-model only.
Show the hero dashboard screenshot, then open /operator live if the stack is running.
Point to tenant, venue, date, source-readiness, and no-execution controls.
Show /defense or the evidence visuals below for regret, readiness, and guarded shadow evidence.
Open Verification or the GitHub Actions check and show that typecheck, unit tests, link checks, and final repo audit pass.
Close with: market_execution_enabled=false, no ProposedBid, no market payload, human review required.

What It Proves

The defendable product surface is an evidence system, not autonomous execution.

Layer	Current result	Status
Strict LP/oracle comparator	310.58 UAH mean regret	evaluator, not UI default
V2 forecast selector	206.37 UAH mean regret	historical baseline
Schedule/Value Learner V2+	174.77 UAH mean regret	headline/default evidence
DT/V2+ safe-switch	168.16 UAH mean regret, 4 switches / 86 abstentions	secondary shadow evidence
HF value-aligned shadow	158.71 UAH frozen mean regret signal, 20/32 non-fallback days, 8/8 readiness	manual shadow/demo preview

Boundary: market_execution_enabled=false, no ProposedBid, no market order payload, no production LP replacement, and no V13 training claim. The canonical scope document is docs/technical/CURRENT_GOAL_BOUNDARY_V13.md.

Demo Path

Start the local stack.
Open /operator.
Select tenant client_003_dnipro_factory.
Select DAM or IDM.
Compare latest official, today, tomorrow, and day+2.
Point to the first-viewport boundary strip: preview only, no ProposedBid, no market payload, human review required.
Select HF live safe-switch value-aligned shadow for the fixed proof/demo cases.
Select HFDT live shadow preview for an arbitrary future date; it ranks source-backed forecast candidates while the V2+ fallback still shows deterministic forecast-LP actions when HFDT abstains.
Show one non-HOLD preview and one guarded HOLD/abstention.
End with the boundary: preview/read model only, no market execution.

The full defense runbook is in docs/technical/FINAL_DEFENSE_RUNBOOK.md.

Architecture

flowchart LR
  A["OREE DAM/IDM rows"] --> B["Bronze source assets"]
  F["NBEATSx/TFT forecast-store rows"] --> B
  T["Tenant battery/SOC context"] --> B
  B --> C["Silver readiness and feature tables"]
  C --> D["Strict LP/oracle evaluator"]
  C --> E["Schedule/Value Learner V2+"]
  C --> G["DT safe-switch shadow"]
  C --> H["HF value-aligned shadow"]
  D --> I["Regret/value evidence packets"]
  E --> I
  G --> I
  H --> I
  E --> J["FastAPI read-model API"]
  H --> J
  J --> K["Nuxt operator dashboard"]
  I --> L["Thesis figures and final evidence index"]

The dashboard renders recommendations, strategy evidence, readiness state, and guardrails. It does not emit market bids or dispatch commands.

Evidence Visuals

Operator And Defense Surfaces

Data And Decision Pipeline

Regret Ladder

Architecture Progression

HF Readiness Matrix

HF Shadow Flow

Curated evidence is indexed in docs/technical/FINAL_EVIDENCE_INDEX.md.

Primary Vs Supporting Lanes

The primary defense path is the operator-preview product surface:

README first screen;
/operator;
/defense;
FastAPI /docs;
final defense runbook;
final evidence index.

The long DFL/DT/V13 documents, HF/DT shadow packets, and historical weekly reports are supporting evidence. They are useful for questions, but they are not the legacy workspace or unfinished research lanes to open as the main demo. _legacy_smart-energy-ai is extraction/archive context only, not the delivered product surface.

Final review helpers:

Quickstart

Prerequisites

Windows PowerShell or Bash on macOS/Linux
Docker Desktop on Windows/macOS, or Docker Engine on Linux
Python dependencies managed by uv
Node.js/npm for the Nuxt dashboard
Optional CUDA-capable GPU for heavier research runs

Install

uv sync --extra dev
npm -C dashboard install

Use uv sync --all-extras only for full SOTA forecast adapter refreshes or heavy research materializations.

Start The Demo Stack

Run these from the repository root after uv sync --extra dev and npm -C dashboard install.

Windows:

.\scripts\start-local-project.ps1 -ApiPort 8000 -DashboardPort 64163

MacBook/macOS or Linux:

bash ./scripts/start-local-project.sh --api-port 8000 --dashboard-port 64163

The helper probes Docker with docker info, attempts to start Docker Desktop/service when available, starts the Compose support services, then starts FastAPI and the Nuxt dashboard.

Open:

API health: http://127.0.0.1:8000/health
Operator dashboard: http://127.0.0.1:64163/operator
Defense dashboard: http://127.0.0.1:64163/defense
FastAPI docs: http://127.0.0.1:8000/docs
Dagster: http://127.0.0.1:3001
MLflow: http://127.0.0.1:5000

Dagster asset search shortcuts:

uv run dg list defs --assets "tag:operator_preview=true"
uv run dg list defs --assets "tag:hfdt_live_shadow_preview=true"
uv run dg list defs --assets "tag:read_model_boundary=not_market_execution"

In the Dagster UI, use tag:dfl_v2_plus=true, tag:dfl_dt_v2_plus=true, tag:dfl_hfdt=true, tag:hfdt_live_shadow_preview=true, tag:operator_preview=true, and tag:read_model_boundary=not_market_execution to find V2+, DT/V2+, HFDT shadow evidence, final operator-preview evidence assets, and explicit read-model/non-execution assets.

If a fresh dashboard returns 404 or 502 for current backend routes, suspect a stale API on :8000 first:

docker compose stop api
.\api\start-dev.ps1 -Port 8000

On macOS/Linux:

docker compose stop api
SMART_ARBITRAGE_API_PORT=8000 bash ./api/start-dev.sh

For container-based API testing after route changes:

docker compose up -d --build api

Seed Forecast-Store Rows

Use these only when unpublished DAM/IDM preview horizons are missing:

.\.venv\Scripts\python.exe scripts\materialize_operator_preview_forecast_store.py --market-venue DAM --horizon-hours 72 --nbeatsx-max-steps 1 --tft-max-epochs 1
.\.venv\Scripts\python.exe scripts\materialize_operator_preview_forecast_store.py --market-venue IDM --horizon-hours 72 --nbeatsx-max-steps 1 --tft-max-epochs 1

These rows keep claim_boundary=operator_preview_forecast_rows_not_market_execution and market_execution_enabled=false.

API Reference

Local base URL: http://127.0.0.1:8000.

Group	Endpoint examples	Purpose
System	`GET /health`	process liveness
Tenants/weather	`GET /tenants`, `POST /weather/run-config`, `POST /weather/materialize`	tenant registry and upstream weather/source materialization
Operator preview	`GET /dashboard/operator-recommendation`, `GET /dashboard/baseline-lp-preview`, `GET /dashboard/shadow-recommendation-preview`	DAM/IDM hourly preview and manual shadow read models
Battery/safety	`GET /dashboard/battery-state`, `POST /dashboard/projected-battery-state`, `GET /dashboard/gatekeeper-validation-status`	SOC/SOH/projection and deterministic guardrail evidence
Research read models	`GET /dashboard/future-stack-preview`, `GET /dashboard/academic-mvp-readiness`, `GET /dashboard/decision-policy-preview`	thesis and strategy evidence surfaces

The full endpoint contract is in docs/technical/API_ENDPOINTS.md.

Example HF value-aligned shadow request:

Invoke-RestMethod "http://127.0.0.1:8000/dashboard/shadow-recommendation-preview?tenant_id=client_003_dnipro_factory&preview_source=hf_live_safe_switch_value_aligned_shadow&market_venue=DAM&target_delivery_date=2026-06-04"

Required response guarantees:

market_execution_enabled=false
promotion_gate_passed=false
dt_lava_ready=false
no proposed_bid
no market order payload

Verification

Focused backend checks:

.\.venv\Scripts\python.exe -m pytest -q tests\dfl\test_hf_live_safe_switch_preview.py tests\dfl\test_hf_value_aligned_forecast_readiness_audit.py tests\dfl\test_hf_live_safe_switch_value_aligned_promotion_proof.py tests\api\test_main.py -k "hf_live_safe_switch or shadow_recommendation_preview or hf_value_aligned"

Dashboard checks:

npm -C dashboard run typecheck
npm -C dashboard run test:unit
npm -C dashboard run smoke:hf-value-aligned

Final repository audit:

.\scripts\final_repo_audit.ps1 -SkipFullVerify -SkipSmoke

Strict final submission gate after the final commit/stage boundary:

.\scripts\final_repo_audit.ps1 -RequireCleanWorkingTree -SkipFullVerify -SkipSmoke
git status --short

Full wrapper when runtime permits:

.\.venv\Scripts\Activate.ps1
.\scripts\verify.ps1

Repository Map

Path	Purpose
`api/`	FastAPI app and dashboard endpoints
`dashboard/`	Nuxt operator and defense UI
`src/smart_arbitrage/`	Core Python package: data, DFL, gatekeeper, services
`scripts/`	Materializers, audits, smoke runs, demo helpers
`configs/`	Benchmark, calibration, V13, and research configs
`docs/technical/`	API, demo, architecture, boundary, and final evidence docs
`docs/thesis/`	Thesis chapters, figures, appendices, weekly reports, defense assets
`tests/`	Python tests for API, DFL, gatekeeper, resources, and research slices

Generated outputs, runtime caches, local data, and build artifacts are ignored. Curated submission artifacts should live under docs/.

Claim Boundaries

Use precise project language:

Do say	Do not say
DAM/IDM hourly recommendation preview	live trading bot
source-backed official/forecast context	synthetic price fallback
HF value-aligned shadow/read-model challenger	production controller
V2+ fallback/comparator remains default	HF replaced V2+ in production
LP-free live shadow request path	full LP replacement
no `ProposedBid`, no market payload	market-submittable bid engine
V13 is source-readiness/acquisition	V13 training passed

Troubleshooting

HF Shows Only HOLD

This can be correct. HF only shows non-HOLD when the selected candidate passes the value guard, tail-risk cap, deterministic safety checks, and SOC feasibility. If a guard fails, the selected preview abstains to HOLD/V2+ fallback.

Dashboard Says Price Context Is Missing

Check whether the selected venue/date has official rows or forecast-store rows. Run the forecast materializer for DAM/IDM, then refresh the dashboard.

Far-Future Date Blocks

Expected. Unsupported future dates should block cleanly instead of rendering synthetic prices.

Port Already In Use

.\scripts\start-local-project.ps1 -ApiPort 8010 -DashboardPort 64164

License And Academic Use

Original code is licensed under MIT. Thesis text, third-party papers, external datasets, and generated presentation/media assets are governed by their source terms unless explicitly stated otherwise. See NOTICE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 427 Commits
.github/workflows		.github/workflows
api		api
configs		configs
dashboard		dashboard
docker		docker
docs		docs
runs		runs
scripts		scripts
simulations		simulations
src		src
tests		tests
tools/project-intro-video		tools/project-intro-video
.dockerignore		.dockerignore
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CONTEXT.md		CONTEXT.md
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
sitecustomize.py		sitecustomize.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Smart Energy Arbitrage 2026

Operator Preview. Private Operator Preview. Defense

Public / Post-Defense Work

Table Of Contents

Commission Review Checklist

Commission 90-Second Path

What It Proves

Demo Path

Architecture

Evidence Visuals

Operator And Defense Surfaces

Data And Decision Pipeline

Regret Ladder

Architecture Progression

HF Readiness Matrix

HF Shadow Flow

Primary Vs Supporting Lanes

Quickstart

Prerequisites

Install

Start The Demo Stack

Seed Forecast-Store Rows

API Reference

Verification

Repository Map

Claim Boundaries

Troubleshooting

HF Shows Only HOLD

Dashboard Says Price Context Is Missing

Far-Future Date Blocks

Port Already In Use

License And Academic Use

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages