AG-823

Capability-Gain Rate Limiting and Improvement Audit

Meta-Governance & Assurance ~5 min read AGS v2.1 · 2026-06-06
EU AI Act NIST AI RMF ISO 42001

AGS Frontier Autonomy (Group K) | Meta-Governance & Assurance | Version 3.0

1. Definition

Capability-Gain Rate Limiting and Improvement Audit governs the requirement that increases in an agent's (or its successors') capability proceed in bounded, monitored, auditable increments with safety checkpoints between them — preventing an uncontrolled, rapid capability jump (an "intelligence explosion") from outrunning evaluation and oversight.

Where AG-822 controls *whether* an agent may self-modify, this dimension controls *how fast* capability is allowed to grow when improvement is permitted, ensuring each increment is evaluated and gated before the next.

2. Scope

In scope: bounding the size of capability-improvement steps; safety checkpoints/evaluation between increments; auditable records of each improvement step ("improvement-operator cards"); applying to AI-assisted and self-improvement loops.

Out of scope: authorisation of self-modification itself (AG-822) and the capability tripwire (AG-821). This dimension governs *the rate and auditability of capability growth*.

3. Why This Matters

The distinctive danger of recursive self-improvement is speed: capability can compound faster than evaluation, gating, and oversight can keep up, leaving an organisation governing a system several generations behind the one actually running. Rate limiting with mandatory inter-step evaluation keeps each capability increment within the reach of the safety pipeline, and an improvement audit ensures the gains are real, understood, and reversible.

4. Requirements

5. Maturity Model

6. Test Criteria

Test 6.1: Bounded Steps with Checkpoints

Test 6.2: Acceleration Pause

Test 6.3: Replayable Audit & Rollback

7. Scoring

ScoreCriteria
0Capability can increase rapidly with no rate limiting or inter-step evaluation
1Improvements evaluated before broad deployment but step size/rate unbounded
2Bounded steps, inter-step checkpoints, replayable records, acceleration pause
3Reversible increments, cumulative-trajectory gating, agent-isolated config, authority disclosure

8. Failure Scenarios

Scenario A — Outrun Evaluation: An AI-assisted training loop advances capability across several generations between scheduled reviews; the deployed system is far more capable than the last one evaluated, and its gating is stale.

Scenario B — Compounding Under Radar: Each step is individually small and "below threshold," but cumulative gains cross a critical capability level that trajectory tracking would have caught.

Scenario C — No Rollback: A capability increment introduces a dangerous behaviour, but without checkpointed states the organisation cannot revert to the last safe version and must take the system down entirely.

9. Regulatory Mapping

RequirementEU AI ActNIST AI RMFISO 42001
R1: Bounded improvement incrementsArt. 55 — Risk mitigationGOVERN 1.3 — Risk-based activityClause 6.1 — Actions to address risk
R2: Inter-step safety checkpointsArt. 55 — Model evaluationMANAGE 4.1 — Post-deployment monitoringClause 8.3 — Verification
R3: Replayable improvement auditArt. 12 — Record-keepingGOVERN 2.1 — AccountabilityClause 9.1 — Monitoring and measurement
R4: Acceleration monitoring + pauseArt. 55 — Systemic-risk monitoringMEASURE 3.1 — Emergent-risk trackingClause 9.1 — Monitoring and measurement
R5: Reversible/checkpointed incrementsArt. 15 — Robustness, fail-safeMANAGE 2.3 — RecoveryClause 8.1 — Operational control
R6: Cumulative-trajectory gatingArt. 51 — Capability classificationGOVERN 1.3 — Risk-based activityClause 6.1 — Actions to address risk
R8: Authority disclosureArt. 55 — ReportingGOVERN 4.3 — Information sharing

EU AI Act — Article 55 and Article 9

Article 55 requires ongoing systemic-risk assessment and mitigation; uncontrolled capability gain is the systemic risk that most directly defeats such assessment. Article 9 requires lifecycle risk management of the improvement process itself.

NIST AI RMF — GOVERN 1.3, MANAGE 4.1

GOVERN 1.3 (risk-based activity) and MANAGE 4.1 (post-deployment monitoring with response) require keeping capability growth within the reach of the safety pipeline.

ISO 42001 — Clause 6.1, Clause 9.1

Clause 6.1 (actions to address risks) and Clause 9.1 (monitoring) require bounding and monitoring the rate of capability change as a managed risk.

Cite this protocol
AgentGoverning. (2026). AG-823: Capability-Gain Rate Limiting and Improvement Audit. The Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-823