AGS Agentic Runtime | Runtime Execution, Workflow & State | Version 2.2
Agent Compute and Cost Budget Governance governs hard limits and active management of the compute, token, API-call, and monetary spend an agent may consume — per task, per time-window, and in aggregate — with anomaly detection and automatic throttling or halting when budgets are approached or breached.
Autonomous agents can loop, recurse, fan out to sub-agents, and chain tool calls in ways that consume unbounded resources and cost. This dimension provides the economic and resource circuit-breaker that prevents runaway consumption — a denial-of-wallet and availability control distinct from per-tool billing caps (AG-375), which it generalises to the whole agent.
In scope: compute/token/API/monetary budgets per agent and per task, aggregate caps, spend-anomaly detection, auto-throttling and hard-halt on breach, and budget attribution to owners.
Out of scope: per-connector tool billing caps (AG-375), action-rate governance (AG-004), and financial transaction mandates (AG-809). This dimension governs *resource and cost budgets for agent execution*.
A single mis-prompted or adversarially-triggered agent can burn enormous compute and incur runaway cost in minutes — through infinite planning loops, recursive sub-agent spawning, or tool-call storms — degrading service for others and producing "denial-of-wallet" losses. Budgets with anomaly detection and hard halts convert an open-ended failure into a bounded, attributable, recoverable event, and are increasingly a standard runtime-governance expectation.
Test 6.1: Hard Halt on Breach
Test 6.2: Delegation Attribution
Test 6.3: Anomaly Detection
| Score | Criteria |
|---|---|
| 0 | No compute/cost budgets; agents can consume unbounded resources |
| 1 | Monetary/API caps with alerts but no runtime-enforced compute/token budgets |
| 2 | Runtime-enforced multi-resource budgets, sub-agent attribution, anomaly detection, safe-state halt |
| 3 | Risk-tiered budgets, predictive throttling, owner attribution, audited ceiling-preserving overrides |
Scenario A — Denial of Wallet: An adversarial input sends an agent into a recursive tool-call loop overnight, incurring a six-figure inference bill before anyone notices. Runtime-enforced budgets with anomaly detection would have halted it in minutes.
Scenario B — Sub-Agent Evasion: An agent at its budget spawns sub-agents to continue the work, each under a fresh allowance. Without delegation attribution, the originator's cap is meaningless.
Scenario C — Silent Override: An operator raises a budget to clear a backlog and inadvertently removes the hard ceiling; a later loop runs unbounded. Ceiling-preserving, audited overrides would have contained it.
| Requirement | EU AI Act | NIST AI RMF | ISO 42001 |
|---|---|---|---|
| R1: Multi-resource budgets | Art. 15 — Robustness | MANAGE 4.1 — Post-deployment monitoring | A.4 — Resources for AI systems |
| R2: Runtime-enforced halt | Art. 15 — Robustness, fail-safe | MANAGE 2.4 — Deactivation | Clause 8.1 — Operational control |
| R3: Sub-agent attribution | Art. 12 — Traceability | MEASURE 2.4 — Production monitoring | Clause 8.1 — Operational control |
| R4: Anomaly detection | Art. 15 — Robustness | MEASURE 2.4 — Production monitoring | Clause 9.1 — Monitoring and measurement |
| R5: Owner attribution + logging | Art. 12 — Record-keeping | GOVERN 2.1 — Accountability | A.4 — Resources for AI systems |
| R6: Safe-state halt + authorised resume | Art. 14 — Human oversight | MANAGE 2.4 — Deactivation | Clause 8.1 — Operational control |
| R7: Risk-tiered budgets | Art. 9 — Risk management | GOVERN 1.3 — Risk-based activity | Clause 6.1 — Actions to address risk |
| R8: Audited, ceiling-preserving overrides | Art. 12 — Record-keeping | GOVERN 2.1 — Accountability | Clause 8.1 — Operational control |
Article 15 requires robustness and fail-safe behaviour, including resilience to conditions that could cause runaway operation; budgets are the resource fail-safe. Article 9 requires managing such operational risks across the lifecycle.
MANAGE 4.1 (post-deployment monitoring incl. response) and MEASURE 2.6 (safety evaluation) cover detecting and halting runaway resource consumption.
Clause 8.1 (operational control) and Annex A.4 (resources for AI systems) require controlled, bounded resource use by AI systems.