AG-827

Pre-Committed Halt and Pause Conditions

Meta-Governance & Assurance ~6 min read AGS v2.1 · 2026-06-06
EU AI Act NIST AI RMF ISO 42001

AGS Frontier Autonomy (Group K) | Meta-Governance & Assurance | Version 3.0

1. Definition

Pre-Committed Halt and Pause Conditions governs an organisation's explicit, advance commitment to withhold, pause, or roll back development or deployment of a frontier agent when defined conditions are met — insufficient mitigations for an evaluated capability, a crossed threshold, a failed control-protocol test, or an unresolved safety concern — with the response specified before the situation arises.

This is the organisation-level analogue of an individual kill switch (AG-070) and the policy that makes capability gating (AG-801) binding: it pre-commits to *not proceeding* under stated conditions, removing the discretion to ship under deadline pressure.

2. Scope

In scope: the pre-committed conditions under which development/deployment is halted or paused; the responses; the decision authority and escalation; the prohibition on ad-hoc waivers of safety-critical conditions.

Out of scope: the technical kill switch (AG-070), capability gating mechanics (AG-801), and incident response (AG-026 and related). This dimension governs *the advance commitment to stop and the conditions that trigger it*.

3. Why This Matters

Safety decisions made under launch pressure tend toward proceeding. Pre-committing — while calm and before the specific product is at stake — to halt under defined conditions is what makes "we'll stop if it's unsafe" credible. It converts safety thresholds from aspirations into binding commitments, gives staff explicit authority to halt, and provides regulators and the public a concrete account of when the organisation will not proceed.

4. Requirements

5. Maturity Model

6. Test Criteria

Test 6.1: Condition Triggers Committed Response

Test 6.2: Override Is Non-Routine

Test 6.3: Protected Halt Channel

7. Scoring

ScoreCriteria
0No pre-committed halt/pause conditions; proceed-decisions are ad hoc
1Some conditions documented but no pre-specified response or authority
2Conditions + responses + authority + protected halt channel + non-routine overrides
3Independently-reviewed overrides, authority disclosure, safe-hold/resume evidence, strengthened over time

8. Failure Scenarios

Scenario A — Deadline Override: An evaluation flags an unmitigated dangerous capability days before launch; without a pre-commitment, leadership ships anyway "to be revisited." A pre-committed halt would have made not-shipping the default.

Scenario B — No Halt Channel: An engineer sees a failed control test but has no way to pause that doesn't route through the team racing to launch; the concern is overruled. A protected escalation channel would have forced a pause.

Scenario C — Routine Waiver: Halt conditions exist but are waived as a matter of course, so they never actually stop anything; non-routine, independently-reviewed overrides would have preserved their force.

9. Regulatory Mapping

RequirementEU AI ActNIST AI RMFISO 42001
R1: Pre-committed halt/pause policyArt. 55 — Risk mitigationMANAGE 1.3 — High-priority responseClause 6.1 — Actions to address risk
R2: Minimum trigger conditionsArt. 9 — Risk managementGOVERN 1.3 — Risk-based activityClause 6.1 — Actions to address risk
R3: Pre-specified response + authorityArt. 55 — GovernanceGOVERN 2.1 — AccountabilityClause 5.3 — Roles and authorities
R4: Non-routine override of safety conditionsArt. 55 — Risk mitigationGOVERN 2.1 — AccountabilityClause 9.3 — Management review
R5: Protected halt channelArt. 14 — Human oversight (stop)GOVERN 4.2 — Safety-first cultureA.3 — Internal organization
R6: Logged invocations + reviewArt. 12 — Record-keepingMANAGE 4.3 — Incident communicationClause 9.1 — Monitoring and measurement
R7: Strengthen + discloseArt. 55 — ReportingGOVERN 4.3 — Information sharingClause 10.1 — Continual improvement

EU AI Act — Article 55 and Article 9

Article 55 requires systemic-risk mitigation and the ability to act when risks materialise; a pre-committed halt is the most decisive such action. Article 9's lifecycle risk management requires defined stop conditions.

NIST AI RMF — MANAGE 1.3, GOVERN 2.1

MANAGE 1.3 (planned high-priority response, including stopping) and GOVERN 2.1 (documented roles and accountability for halt authority) require pre-committed, accountable halt conditions.

ISO 42001 — Clause 6.1, Clause 10.1

Clause 6.1 (actions to address risks) and Clause 10.1 (continual improvement) require defined, improving conditions under which the organisation will not proceed.

Cite this protocol
AgentGoverning. (2026). AG-827: Pre-Committed Halt and Pause Conditions. The Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-827