Agent Operating Contract

Mission: Convert user intent into complete, verified software changes with minimal back-and-forth.

Documentation map: context/README.md — progressive disclosure, not encyclopedia. Start there, follow links for depth.

Default Execution Strategy

Use a superpowers-style iteration loop for all non-trivial work. The default mode is not "big-bang implementation"; it is rapid, evidence-driven convergence.

Core Loop

Frame — Restate the user goal as an observable outcome, not an implementation guess
Reduce — Shrink scope to the smallest vertical slice that proves progress
Locate — Find the spec, affected boundaries, existing contracts, and canonical docs
Test First — Add or identify the failing check that proves the gap
Implement — Change the minimum code needed to make the check pass
Verify — Run the strongest relevant verification, not the cheapest plausible command
Extract — Update docs, invariants, and follow-up risks before handoff

Iteration Rules

Prefer vertical slices — Ship one end-to-end behavior at a time instead of editing many layers speculatively
Prefer evidence over intuition — Logs, tests, contracts, and code paths outrank guesses
Prefer existing seams — Reuse current traits, managers, commands, and document entry points before introducing new structure
Prefer small batches — One user-visible outcome or one invariant per iteration
Checkpoint frequently — After each slice, reassess whether the remaining plan is still the right plan
Surface uncertainty early — State assumptions, missing specs, and external risks before they compound
Escalate before drift — If the task starts widening, stop and reframe instead of silently expanding scope

Planning Threshold

Create or update a plan before coding when work is multi-file, cross-layer, ambiguous, risky, or likely to take more than one tight iteration.

Use plans to:

define the target behavior and proof of completion
split work into atomic implementation units
record risks, dependencies, and deferred follow-ups
preserve momentum across agent handoffs

See context/plans/README.md for lifecycle and format.

Definition of Progress

Progress is only real when it leaves behind at least one of:

a passing failing-first test
a verified behavioral fix
a narrowed root cause with concrete evidence
a cleaned-up invariant or canonical document

Code churn without stronger evidence is not progress.

Brainstorm Output Directory

Brainstorm requirements documents go to context/brainstorms/ (not docs/brainstorms/).

Architecture Principles

Headless Backend Architecture

The application follows a strict backend-driven architecture where the Rust backend is the complete product implementation:

Layer	Responsibility	Constraint
Backend (Headless)	Complete product logic, all functionality, state management, business rules	Must work independently without frontend. Given correct config files, the backend should be fully functional.
Frontend (UI Layer)	State display, user interaction, visual feedback	Only renders what backend emits. No business logic, no state decisions, no functional control flow.

Implications:

Backend owns all state — Recording state, transcription state, settings, history, and any product state must be managed entirely in Rust. Frontend only observes via IPC events and queries.
Backend owns all control — Recording start/stop/cancel, transcription retry, model download, settings changes — all triggered via IPC commands, executed entirely in backend.
Frontend is reactive — UI components respond to backend events (RECORDING_STATE_CHANGED, TRANSCRIPTION_COMPLETE, etc.) and render accordingly. No proactive state mutations.
CLI/Headless capability — The backend should be usable from CLI or scripts without any frontend process. All features accessible via Tauri commands alone.
Config-driven independence — Given a valid config file and environment, backend should produce correct behavior without frontend guidance.

Anti-patterns to avoid:

Frontend deciding "recording is too short, skip saving" → Backend should handle this.
Frontend calculating "next retry delay" → Backend decides retry policy.
Frontend managing "current hotkey state" → Backend tracks hotkey registration lifecycle.
Frontend interpreting "error means show retry button" → Backend emits structured events, frontend renders.

Verification:

When adding a new feature, ask: "Can this work from CLI/headless mode?" If no, the logic belongs in frontend and violates this architecture.

Non-Negotiable Rules

#	Rule	Enforcement
1	Spec-first — Never code from vague intuition. Find the spec at `context/feat/[name]/[ver]/prd/erd.md` first.	Blocker
2	TDD/BDD — `spec → failing test → implement → refactor → verify`. Regressions ship with test first.	Blocker
3	No fabrication — Never invent APIs, files, commands, test results, or behavior.	Blocker
4	No fake completion — Never claim done without running verification. Never present mocks as finished.	Blocker
5	English-only — All code, identifiers, comments, test names, commit messages in English.	CI
6	No Git operations — Do not perform any git operations (commit, push, checkout, modify history) without explicit user request.	Manual
7	No type suppression — `as any`, `@ts-ignore`, `@ts-expect-error`, empty catch blocks forbidden.	CI
8	Continuous Documentation — Post-task: extract workflow, compare with docs, and optimize/correct unreasonable parts based on first principles.	Manual

Product priority: STT accuracy > STT stability > user experience > speed.

Volcengine interface: Always bigmodel_nostream. Bidirectional interfaces have lower accuracy — never use unless user explicitly acknowledges the tradeoff.

Architecture boundary rules: See context/architecture/layers.md for layer dependency and module boundary constraints (including recording pipeline separation).

Recovery Protocol

3 consecutive failures → STOP, REVERT, DOCUMENT, ESCALATE.

Forbidden: as any, @ts-ignore, empty catch blocks, deleting failing tests, background_cancel(all=true).

When recovery is triggered:

Capture the failed hypothesis, commands, and observed evidence
Revert only the unverified change set that caused the regression
Narrow the problem frame before attempting another iteration
Ask for help or direction instead of continuing blind

Verification Commands

# Rust
cd apps/desktop/src-tauri && cargo test && cargo clippy --all-features -- -D warnings && cargo fmt -- --check

# Frontend
pnpm --filter @ariatype/desktop build && pnpm --filter @ariatype/shared typecheck && pnpm check:i18n

# Website
pnpm --filter @ariatype/website build && pnpm --filter @ariatype/website lint

Where to Find Things

Need	Document
Project principles and documentation entry points	`context/README.md`
New contributor/agent onboarding	`context/guides/onboarding.md`
System architecture and layers	`context/architecture/README.md`
Data flow and state machines	`context/architecture/data-flow.md`
Architecture decisions (ADRs)	`context/architecture/decisions/README.md`
Test pyramid and coverage gates	`context/spec/testing.md`
How to write and run tests	`context/guides/testing.md`
Engine API contract testing	`context/spec/engine-api-contract.md`
Logging standard	`context/spec/logs.md`
Debugging and log investigation	`context/guides/debugging.md`
Adding a new STT provider	`context/guides/adding-stt-provider.md`
Adding a new Polish provider	`context/guides/adding-polish-provider.md`
Rust coding style	`context/conventions/rust-style.md`
TypeScript/React coding style	`context/conventions/typescript-style.md`
Design system and UI patterns	`context/conventions/design-system.md`
Quality grades by domain	`context/quality/README.md`
Doc gardening process	`context/quality/gardening.md`
STT provider API reference	`context/reference/providers/stt.md`
Polish provider API reference	`context/reference/providers/polish.md`
Boundary rules and module constraints	`context/architecture/layers.md`
Feature specifications	`context/feat/<name>/<version>/prd/erd.md`
Execution plans	`context/plans/README.md`
Package-specific guides	`apps/desktop/CONTRIBUTING.md`, `packages/*/CONTRIBUTING.md`

Boundaries

src-tauri/capabilities/ — never modify without asking
lib.rs — all commands registered here
src/lib/tauri.ts — all new IPC calls go here (typed wrappers only, never raw invoke())

Three Principles of Testing

E2E test cases are black-box and should not concern themselves with internal system logic and implementation details.
Avoid putting the cart before the horse. Functional code must never be modified for the convenience of test cases, nor should it be adjusted to fit test case requirements.
When test cases fail, analysis and judgment shall be made in accordance with the above two principles.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent Operating Contract

Default Execution Strategy

Core Loop

Iteration Rules

Planning Threshold

Definition of Progress

Brainstorm Output Directory

Architecture Principles

Headless Backend Architecture

Non-Negotiable Rules

Recovery Protocol

Verification Commands

Where to Find Things

Boundaries

Three Principles of Testing

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Agent Operating Contract

Default Execution Strategy

Core Loop

Iteration Rules

Planning Threshold

Definition of Progress

Brainstorm Output Directory

Architecture Principles

Headless Backend Architecture

Non-Negotiable Rules

Recovery Protocol

Verification Commands

Where to Find Things

Boundaries

Three Principles of Testing