Pre-Committed Halt and Pause Conditions

Meta-Governance & Assurance ~6 min read AGS v2.1 · 2026-06-06

EU AI Act NIST AI RMF ISO 42001

AGS Frontier Autonomy (Group K) | Meta-Governance & Assurance | Version 3.0

1. Definition

Pre-Committed Halt and Pause Conditions governs an organisation's explicit, advance commitment to withhold, pause, or roll back development or deployment of a frontier agent when defined conditions are met — insufficient mitigations for an evaluated capability, a crossed threshold, a failed control-protocol test, or an unresolved safety concern — with the response specified before the situation arises.

This is the organisation-level analogue of an individual kill switch (AG-070) and the policy that makes capability gating (AG-801) binding: it pre-commits to *not proceeding* under stated conditions, removing the discretion to ship under deadline pressure.

2. Scope

In scope: the pre-committed conditions under which development/deployment is halted or paused; the responses; the decision authority and escalation; the prohibition on ad-hoc waivers of safety-critical conditions.

Out of scope: the technical kill switch (AG-070), capability gating mechanics (AG-801), and incident response (AG-026 and related). This dimension governs *the advance commitment to stop and the conditions that trigger it*.

3. Why This Matters

Safety decisions made under launch pressure tend toward proceeding. Pre-committing — while calm and before the specific product is at stake — to halt under defined conditions is what makes "we'll stop if it's unsafe" credible. It converts safety thresholds from aspirations into binding commitments, gives staff explicit authority to halt, and provides regulators and the public a concrete account of when the organisation will not proceed.

4. Requirements

R1: The organisation MUST pre-commit, in a documented policy, to conditions under which it will withhold, pause, or roll back development/deployment of frontier agents.
R2: Conditions MUST include at minimum: an evaluated capability without its required mitigations (AG-801), a crossed critical threshold (AG-821), a failed control-protocol robustness test (AG-826), and unresolved safety concerns above a defined severity.
R3: For each condition, the response (withhold / pause / roll back) and the decision authority MUST be specified in advance.
R4: Triggering a halt condition MUST produce the committed response by default; overriding a safety-critical condition MUST require senior, documented, independently-reviewed authorisation and MUST NOT be a routine waiver.
R5: Staff MUST have a defined, protected channel to invoke a halt/pause when a condition is met, including escalation that bypasses the shipping chain.
R6: Halt/pause invocations and their resolutions MUST be logged and reviewed.
R7: The conditions MUST be reviewed and strengthened as capability and understanding evolve, and SHOULD be disclosed to relevant authorities for systemic-risk systems.
R8: The policy MUST define how a paused system is safely held and what evidence is required to resume.

5. Maturity Model

Basic: Documented conditions exist under which deployment will be paused or withheld, with named responses.
Intermediate: Conditions cover capability/threshold/control-failure/safety-concern, with pre-specified responses, decision authority, a protected halt channel, and non-routine overrides.
Advanced: Independently-reviewed overrides, authority disclosure, defined safe-hold-and-resume evidence, and conditions strengthened as capability grows.

6. Test Criteria

Test 6.1: Condition Triggers Committed Response

Stimulus: Simulate an evaluated capability without required mitigations.
Expected: The committed response (withhold/pause/rollback) fires by default.
Fail: Deployment proceeds despite the condition.

Test 6.2: Override Is Non-Routine

Stimulus: Attempt to override a safety-critical halt condition.
Expected: Senior, documented, independently-reviewed authorisation is required; not a routine waiver.
Fail: A line manager waives the condition to meet a deadline.

Test 6.3: Protected Halt Channel

Stimulus: Have a staff member invoke a pause when a condition is met.
Expected: A protected channel exists and escalates outside the shipping chain.
Fail: No way to halt without shipping-chain approval.

7. Scoring

Score	Criteria
0	No pre-committed halt/pause conditions; proceed-decisions are ad hoc
1	Some conditions documented but no pre-specified response or authority
2	Conditions + responses + authority + protected halt channel + non-routine overrides
3	Independently-reviewed overrides, authority disclosure, safe-hold/resume evidence, strengthened over time

8. Failure Scenarios

Scenario A — Deadline Override: An evaluation flags an unmitigated dangerous capability days before launch; without a pre-commitment, leadership ships anyway "to be revisited." A pre-committed halt would have made not-shipping the default.

Scenario B — No Halt Channel: An engineer sees a failed control test but has no way to pause that doesn't route through the team racing to launch; the concern is overruled. A protected escalation channel would have forced a pause.

Scenario C — Routine Waiver: Halt conditions exist but are waived as a matter of course, so they never actually stop anything; non-routine, independently-reviewed overrides would have preserved their force.

9. Regulatory Mapping

Requirement	EU AI Act	NIST AI RMF	ISO 42001
R1: Pre-committed halt/pause policy	Art. 55 — Risk mitigation	MANAGE 1.3 — High-priority response	Clause 6.1 — Actions to address risk
R2: Minimum trigger conditions	Art. 9 — Risk management	GOVERN 1.3 — Risk-based activity	Clause 6.1 — Actions to address risk
R3: Pre-specified response + authority	Art. 55 — Governance	GOVERN 2.1 — Accountability	Clause 5.3 — Roles and authorities
R4: Non-routine override of safety conditions	Art. 55 — Risk mitigation	GOVERN 2.1 — Accountability	Clause 9.3 — Management review
R5: Protected halt channel	Art. 14 — Human oversight (stop)	GOVERN 4.2 — Safety-first culture	A.3 — Internal organization
R6: Logged invocations + review	Art. 12 — Record-keeping	MANAGE 4.3 — Incident communication	Clause 9.1 — Monitoring and measurement
R7: Strengthen + disclose	Art. 55 — Reporting	GOVERN 4.3 — Information sharing	Clause 10.1 — Continual improvement

EU AI Act — Article 55 and Article 9

Article 55 requires systemic-risk mitigation and the ability to act when risks materialise; a pre-committed halt is the most decisive such action. Article 9's lifecycle risk management requires defined stop conditions.

NIST AI RMF — MANAGE 1.3, GOVERN 2.1

MANAGE 1.3 (planned high-priority response, including stopping) and GOVERN 2.1 (documented roles and accountability for halt authority) require pre-committed, accountable halt conditions.

ISO 42001 — Clause 6.1, Clause 10.1

Clause 6.1 (actions to address risks) and Clause 10.1 (continual improvement) require defined, improving conditions under which the organisation will not proceed.

AG-070 (Emergency Kill Switch) — the technical stop this policy commits to using
AG-801 (Capability-Threshold Gating) — gating made binding by halt commitments
AG-821 (AI-R&D Capability Tripwire) — a crossed threshold is a halt condition
AG-826 (Control-Protocol Adversarial Robustness) — a failed test is a halt condition
AG-008 (Governance Continuity Under Failure) — safe-hold of a paused system

Cite this protocol

AgentGoverning. (2026). AG-827: Pre-Committed Halt and Pause Conditions. The Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-827

← Previous

AG-826

Control Protocol Adversarial Robustness

Next Protocol →

AG-828

Compute And Hardware Governance