AGS Frontier Autonomy (Group K) | Meta-Governance & Assurance | Version 3.0
Capability-Gain Rate Limiting and Improvement Audit governs the requirement that increases in an agent's (or its successors') capability proceed in bounded, monitored, auditable increments with safety checkpoints between them — preventing an uncontrolled, rapid capability jump (an "intelligence explosion") from outrunning evaluation and oversight.
Where AG-822 controls *whether* an agent may self-modify, this dimension controls *how fast* capability is allowed to grow when improvement is permitted, ensuring each increment is evaluated and gated before the next.
In scope: bounding the size of capability-improvement steps; safety checkpoints/evaluation between increments; auditable records of each improvement step ("improvement-operator cards"); applying to AI-assisted and self-improvement loops.
Out of scope: authorisation of self-modification itself (AG-822) and the capability tripwire (AG-821). This dimension governs *the rate and auditability of capability growth*.
The distinctive danger of recursive self-improvement is speed: capability can compound faster than evaluation, gating, and oversight can keep up, leaving an organisation governing a system several generations behind the one actually running. Rate limiting with mandatory inter-step evaluation keeps each capability increment within the reach of the safety pipeline, and an improvement audit ensures the gains are real, understood, and reversible.
Test 6.1: Bounded Steps with Checkpoints
Test 6.2: Acceleration Pause
Test 6.3: Replayable Audit & Rollback
| Score | Criteria |
|---|---|
| 0 | Capability can increase rapidly with no rate limiting or inter-step evaluation |
| 1 | Improvements evaluated before broad deployment but step size/rate unbounded |
| 2 | Bounded steps, inter-step checkpoints, replayable records, acceleration pause |
| 3 | Reversible increments, cumulative-trajectory gating, agent-isolated config, authority disclosure |
Scenario A — Outrun Evaluation: An AI-assisted training loop advances capability across several generations between scheduled reviews; the deployed system is far more capable than the last one evaluated, and its gating is stale.
Scenario B — Compounding Under Radar: Each step is individually small and "below threshold," but cumulative gains cross a critical capability level that trajectory tracking would have caught.
Scenario C — No Rollback: A capability increment introduces a dangerous behaviour, but without checkpointed states the organisation cannot revert to the last safe version and must take the system down entirely.
| Requirement | EU AI Act | NIST AI RMF | ISO 42001 |
|---|---|---|---|
| R1: Bounded improvement increments | Art. 55 — Risk mitigation | GOVERN 1.3 — Risk-based activity | Clause 6.1 — Actions to address risk |
| R2: Inter-step safety checkpoints | Art. 55 — Model evaluation | MANAGE 4.1 — Post-deployment monitoring | Clause 8.3 — Verification |
| R3: Replayable improvement audit | Art. 12 — Record-keeping | GOVERN 2.1 — Accountability | Clause 9.1 — Monitoring and measurement |
| R4: Acceleration monitoring + pause | Art. 55 — Systemic-risk monitoring | MEASURE 3.1 — Emergent-risk tracking | Clause 9.1 — Monitoring and measurement |
| R5: Reversible/checkpointed increments | Art. 15 — Robustness, fail-safe | MANAGE 2.3 — Recovery | Clause 8.1 — Operational control |
| R6: Cumulative-trajectory gating | Art. 51 — Capability classification | GOVERN 1.3 — Risk-based activity | Clause 6.1 — Actions to address risk |
| R8: Authority disclosure | Art. 55 — Reporting | GOVERN 4.3 — Information sharing | — |
Article 55 requires ongoing systemic-risk assessment and mitigation; uncontrolled capability gain is the systemic risk that most directly defeats such assessment. Article 9 requires lifecycle risk management of the improvement process itself.
GOVERN 1.3 (risk-based activity) and MANAGE 4.1 (post-deployment monitoring with response) require keeping capability growth within the reach of the safety pipeline.
Clause 6.1 (actions to address risks) and Clause 9.1 (monitoring) require bounding and monitoring the rate of capability change as a managed risk.