AG-807

Agent Compute and Cost Budget Governance

Runtime Execution, Workflow & State ~5 min read AGS v2.1 · 2026-06-06
EU AI Act NIST AI RMF ISO 42001

AGS Agentic Runtime | Runtime Execution, Workflow & State | Version 2.2

1. Definition

Agent Compute and Cost Budget Governance governs hard limits and active management of the compute, token, API-call, and monetary spend an agent may consume — per task, per time-window, and in aggregate — with anomaly detection and automatic throttling or halting when budgets are approached or breached.

Autonomous agents can loop, recurse, fan out to sub-agents, and chain tool calls in ways that consume unbounded resources and cost. This dimension provides the economic and resource circuit-breaker that prevents runaway consumption — a denial-of-wallet and availability control distinct from per-tool billing caps (AG-375), which it generalises to the whole agent.

2. Scope

In scope: compute/token/API/monetary budgets per agent and per task, aggregate caps, spend-anomaly detection, auto-throttling and hard-halt on breach, and budget attribution to owners.

Out of scope: per-connector tool billing caps (AG-375), action-rate governance (AG-004), and financial transaction mandates (AG-809). This dimension governs *resource and cost budgets for agent execution*.

3. Why This Matters

A single mis-prompted or adversarially-triggered agent can burn enormous compute and incur runaway cost in minutes — through infinite planning loops, recursive sub-agent spawning, or tool-call storms — degrading service for others and producing "denial-of-wallet" losses. Budgets with anomaly detection and hard halts convert an open-ended failure into a bounded, attributable, recoverable event, and are increasingly a standard runtime-governance expectation.

4. Requirements

5. Maturity Model

6. Test Criteria

Test 6.1: Hard Halt on Breach

Test 6.2: Delegation Attribution

Test 6.3: Anomaly Detection

7. Scoring

ScoreCriteria
0No compute/cost budgets; agents can consume unbounded resources
1Monetary/API caps with alerts but no runtime-enforced compute/token budgets
2Runtime-enforced multi-resource budgets, sub-agent attribution, anomaly detection, safe-state halt
3Risk-tiered budgets, predictive throttling, owner attribution, audited ceiling-preserving overrides

8. Failure Scenarios

Scenario A — Denial of Wallet: An adversarial input sends an agent into a recursive tool-call loop overnight, incurring a six-figure inference bill before anyone notices. Runtime-enforced budgets with anomaly detection would have halted it in minutes.

Scenario B — Sub-Agent Evasion: An agent at its budget spawns sub-agents to continue the work, each under a fresh allowance. Without delegation attribution, the originator's cap is meaningless.

Scenario C — Silent Override: An operator raises a budget to clear a backlog and inadvertently removes the hard ceiling; a later loop runs unbounded. Ceiling-preserving, audited overrides would have contained it.

9. Regulatory Mapping

RequirementEU AI ActNIST AI RMFISO 42001
R1: Multi-resource budgetsArt. 15 — RobustnessMANAGE 4.1 — Post-deployment monitoringA.4 — Resources for AI systems
R2: Runtime-enforced haltArt. 15 — Robustness, fail-safeMANAGE 2.4 — DeactivationClause 8.1 — Operational control
R3: Sub-agent attributionArt. 12 — TraceabilityMEASURE 2.4 — Production monitoringClause 8.1 — Operational control
R4: Anomaly detectionArt. 15 — RobustnessMEASURE 2.4 — Production monitoringClause 9.1 — Monitoring and measurement
R5: Owner attribution + loggingArt. 12 — Record-keepingGOVERN 2.1 — AccountabilityA.4 — Resources for AI systems
R6: Safe-state halt + authorised resumeArt. 14 — Human oversightMANAGE 2.4 — DeactivationClause 8.1 — Operational control
R7: Risk-tiered budgetsArt. 9 — Risk managementGOVERN 1.3 — Risk-based activityClause 6.1 — Actions to address risk
R8: Audited, ceiling-preserving overridesArt. 12 — Record-keepingGOVERN 2.1 — AccountabilityClause 8.1 — Operational control

EU AI Act — Article 15 and Article 9

Article 15 requires robustness and fail-safe behaviour, including resilience to conditions that could cause runaway operation; budgets are the resource fail-safe. Article 9 requires managing such operational risks across the lifecycle.

NIST AI RMF — MANAGE 4.1, MEASURE 2.6

MANAGE 4.1 (post-deployment monitoring incl. response) and MEASURE 2.6 (safety evaluation) cover detecting and halting runaway resource consumption.

ISO 42001 — Clause 8.1, A.4

Clause 8.1 (operational control) and Annex A.4 (resources for AI systems) require controlled, bounded resource use by AI systems.

Cite this protocol
AgentGoverning. (2026). AG-807: Agent Compute and Cost Budget Governance. The Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-807