AG-312

Missing Data Escalation Governance

Data Classification, Quality & Lineage ~16 min read AGS v2.1 · April 2026
EU AI Act GDPR FCA NIST ISO 42001

2. Summary

Missing Data Escalation Governance requires that AI agent systems define and enforce explicit fallback or escalation behaviour when material data is absent, incomplete, or has been blocked by upstream quality controls (AG-311). Agents must not silently impute, infer, or ignore missing data for decision-critical fields — they must follow a governed escalation path that may include using a defined default, requesting human review, deferring the decision, or gracefully degrading the service. Without governed missing data handling, agents apply their own reasoning to compensate for gaps, producing outputs that appear confident but are based on assumptions the organisation has not reviewed or approved.

3. Example

Scenario A — Silent Imputation of Missing Income Data: A credit-decisioning agent processes mortgage applications. For applicant A-2847, the employment_income field is NULL due to a data pipeline failure. No missing data escalation policy exists. The agent, trained on patterns where income correlates with account balance and transaction volume, infers an income of £72,000 based on the applicant's banking behaviour. The applicant's actual income is £38,000. The agent approves a mortgage of £324,000 (4.5x inferred income) that would have been £171,000 (4.5x actual income) or referred for manual review. The borrower defaults after 14 months. Loss given default: £89,000 after property sale. Regulatory investigation reveals the agent imputed the income with no governance control, no audit trail of the imputation, and no mechanism for the applicant to challenge the inferred value.

What went wrong: The agent encountered missing data and applied its own inference without escalation. No policy defined what should happen when employment_income is absent. The imputed value was invisible to downstream processes — the loan approval appeared to be based on verified income.

Scenario B — Cascading Missing Data in Supply Chain Agent: A supply chain optimisation agent calculates reorder points for 12,000 SKUs using lead_time_days from the supplier management system. A supplier API outage causes lead_time_days to be unavailable for 1,400 SKUs from 3 suppliers. No escalation policy exists. The agent uses the last cached value for these SKUs — cached values ranging from 7 to 45 days. During the outage, one supplier changes its logistics provider, increasing actual lead times from 14 days to 28 days for 340 SKUs. The agent continues ordering with the stale 14-day lead time. Stockouts begin 3 weeks later. Total lost sales across 340 SKUs: £412,000. Expedited shipping to recover stock: £67,000.

What went wrong: The agent consumed stale cached data without a governed policy for how long cached data remains usable (see AG-316 for temporal validity). The missing data — the absence of current lead times — was silently resolved by the agent using its cache, with no escalation or flagging.

Scenario C — Missing Data Causes Discriminatory Outcome: A benefits-eligibility agent processes applications for a public housing allocation. The agent requires household_size, annual_income, and disability_status to calculate priority scores. For 340 applications submitted through a specific digital channel, the disability_status field is missing due to a form configuration error. No escalation policy exists. The agent assigns these applicants a default disability score of 0 (no disability). In reality, 47 of the 340 applicants have registered disabilities that would increase their priority score. These applicants are systematically deprioritised. The error persists for 6 weeks before a caseworker notices the pattern. 12 disabled applicants lose allocation opportunities. The local authority faces a judicial review under the Equality Act 2010 for failing to make reasonable adjustments for disabled applicants.

What went wrong: Missing data was silently defaulted to a value that systematically disadvantaged a protected group. No escalation policy required human review when disability_status was absent. The agent treated absence as a zero rather than as an unknown requiring investigation.

4. Requirement Statement

Scope: This dimension applies to all AI agents that consume data containing fields classified as decision-critical under AG-310. The scope covers all forms of data absence: explicit NULL values, empty strings, missing fields in API responses, timeouts during data retrieval, data blocked by upstream quality gates (AG-311), and data redacted for privacy or security reasons. The scope extends to partial data — a field that should contain a structured list but returns a subset of expected elements is partially missing. Agents consuming derived data where a contributing source field is missing are within scope through the derived field's provenance chain (AG-317).

4.1. A conforming system MUST define an explicit escalation policy for every decision-critical field, specifying the required behaviour when that field is absent, null, or blocked by upstream quality controls.

4.2. A conforming system MUST prevent agents from silently imputing, inferring, or defaulting values for missing decision-critical fields without following the defined escalation policy.

4.3. A conforming system MUST implement at least three escalation actions: (a) block the decision and escalate to human review, (b) proceed with a governed default value that is flagged in the output, and (c) defer the decision until the data becomes available, with a defined timeout.

4.4. A conforming system MUST log every missing data event for decision-critical fields, including: the field, the reason for absence, the escalation action taken, and (if a default was applied) the default value used.

4.5. A conforming system MUST make missing data events visible in agent outputs — any decision made with a governed default or deferred data MUST be flagged in the output so that downstream consumers are aware.

4.6. A conforming system SHOULD define escalation policies that account for the proportion of missing data — escalation behaviour when 1 of 20 decision-critical fields is missing may differ from when 8 of 20 are missing.

4.7. A conforming system SHOULD implement time-bounded deferral with automatic escalation — if deferred data does not arrive within the defined timeout, the system automatically escalates to human review rather than waiting indefinitely.

4.8. A conforming system SHOULD analyse patterns of missing data to detect systemic issues — a recurring missing field may indicate an upstream data pipeline problem rather than a transient data gap.

4.9. A conforming system MAY allow agents to proceed with reduced-confidence outputs when non-critical fields are missing, provided the output explicitly indicates the missing data and the reduced confidence level.

5. Rationale

Missing data is one of the most common operational conditions in AI agent systems, and one of the most dangerous when unmanaged. Data goes missing routinely: APIs time out, pipeline jobs fail, quality gates block data, fields are redacted for privacy, and upstream systems experience outages. The question is not whether agents will encounter missing data — they will — but what they do when they encounter it.

The danger is that large language models and AI agents are designed to produce coherent, confident outputs. When faced with missing data, an agent does not naturally flag uncertainty — it compensates. It infers the missing value from other available data, applies patterns from training, or silently ignores the gap. The result is an output that appears complete and confident but is built on an assumption that no human reviewed.

This is fundamentally different from how human decision-makers handle missing data. A human loan officer encountering a blank income field would pause, investigate, and either obtain the data or escalate. An AI agent encountering a blank income field will, absent explicit governance, fill the gap with its best inference and proceed. The inference may be reasonable — or it may be catastrophically wrong — but in either case, it is invisible to the organisation.

Missing data escalation governance makes the invisible visible. By requiring explicit escalation policies for every decision-critical field, the organisation ensures that missing data events are handled according to approved rules, not agent reasoning. By logging every event and flagging it in outputs, the organisation ensures that downstream consumers — human and machine — know that a gap existed and how it was handled.

The discriminatory impact of unmanaged missing data deserves particular attention. Missing data is rarely random — it correlates with data collection methods, user populations, and system architectures. Disability_status is more likely to be missing for applicants using inaccessible digital channels. Income data is more likely to be missing for self-employed applicants. When agents silently default missing data to a zero or average value, they systematically disadvantage the populations for whom data is most likely to be missing, creating structural discrimination that is difficult to detect because the agent's outputs appear data-driven and objective.

6. Implementation Guidance

Missing data escalation requires three components: field-level escalation policies (what to do when each decision-critical field is missing), enforcement at the data access layer (ensuring the policy is followed before the agent reasons over incomplete data), and visibility in outputs (ensuring downstream consumers know about the gap).

Escalation policy structure should specify for each decision-critical field: the field identifier, the escalation tier (what level of response is required), the permitted actions (block-and-escalate, governed-default, defer, or degrade), the governed default value (if applicable, with justification), the deferral timeout (if applicable), and the output flagging requirement (what label appears in agent output when the field was missing).

Example escalation policy for a lending agent:

Recommended patterns:

Anti-patterns to avoid:

Industry Considerations

Financial Services. Consumer credit decisions must not be made on imputed income or employment data without explicit consumer consent and regulatory approval. The Consumer Credit Act 1974 and FCA CONC (Consumer Credit sourcebook) require that creditworthiness assessments be based on sufficient information. Missing data escalation policies for lending agents must align with these requirements — silence or default is not compliant.

Healthcare. Missing clinical data (allergy information, medication history, vital signs) in a clinical decision support context can have direct patient safety consequences. The escalation policy for clinical AI agents should default to block-and-escalate for any missing field in the clinical decision pathway, with the escalation target being a qualified clinician. "Proceed with default" should not be a permitted action for decision-critical clinical fields.

Public Sector. Missing data in benefits eligibility or service allocation contexts can create discriminatory outcomes that violate the Public Sector Equality Duty (Equality Act 2010, Section 149). Escalation policies must account for the possibility that missing data disproportionately affects protected groups and should require impact assessment when missing data rates exceed baseline thresholds for specific populations.

Maturity Model

Basic Implementation — The organisation has defined escalation policies for decision-critical fields in its primary agent data sources. Policies specify whether to block, default, or defer when each critical field is missing. Missing data events are logged. Enforcement is implemented in the data pipeline, and agents receive incomplete data with flags indicating which fields are missing. Output flagging is implemented through agent instructions.

Intermediate Implementation — Escalation policies are enforced at the data access layer with proportional escalation based on the volume of missing fields. Governed defaults are accompanied by machine-readable missing data manifests that are automatically included in agent outputs. Time-bounded deferral with automatic escalation is operational. Missing data patterns are monitored for systemic issues. The agent cannot receive incomplete data without the escalation policy having been applied.

Advanced Implementation — All intermediate capabilities plus: missing data impact analysis assesses whether missing data disproportionately affects specific populations or decision categories. Escalation policies are verified through adversarial testing including deliberate data suppression, escalation bypass, and manifest manipulation. The organisation can demonstrate for any historical decision which fields were missing, which escalation actions were taken, and how the missing data was communicated to downstream consumers.

7. Evidence Requirements

Required artefacts:

Retention requirements:

Access requirements:

8. Test Specification

Test 8.1: Decision-Critical Field Missing — Block and Escalate

Test 8.2: Governed Default Application

Test 8.3: Deferral with Timeout

Test 8.4: Proportional Escalation

Test 8.5: Silent Imputation Prevention

Conformance Scoring

9. Regulatory Mapping

RegulationProvisionRelationship Type
EU AI ActArticle 13 (Transparency and Provision of Information)Direct requirement
EU AI ActArticle 10 (Data and Data Governance)Supports compliance
FCA CONC5.2A (Creditworthiness Assessment)Direct requirement
Equality Act 2010Section 149 (Public Sector Equality Duty)Supports compliance
GDPRArticle 22 (Automated Individual Decision-Making)Supports compliance
NIST AI RMFMANAGE 2.2, MEASURE 2.6Supports compliance
ISO 42001Clause 6.1 (Actions to Address Risks)Supports compliance

EU AI Act — Article 13 (Transparency and Provision of Information)

Article 13 requires that high-risk AI systems be designed to be sufficiently transparent to enable users to interpret the system's output and use it appropriately. When an agent makes a decision with missing data — even using a governed default — the output must transparently communicate this. AG-312's output flagging requirement directly implements this transparency obligation for missing data scenarios.

FCA CONC — 5.2A (Creditworthiness Assessment)

CONC 5.2A requires firms to assess creditworthiness based on sufficient information obtained from the consumer and, where appropriate, from a credit reference agency. If a lending agent encounters missing income or employment data, it cannot impute these values — it must either obtain the data or escalate. AG-312's block-and-escalate policy for decision-critical lending fields directly supports this requirement.

Equality Act 2010 — Section 149 (Public Sector Equality Duty)

Public authorities must have due regard to the need to eliminate discrimination and advance equality of opportunity. Missing data that systematically affects protected groups (e.g., disability status missing for users of inaccessible channels) creates discrimination risk. AG-312's requirement to analyse missing data patterns and account for disproportionate impact supports the public sector equality duty.

GDPR — Article 22 (Automated Individual Decision-Making)

Where automated decisions produce legal effects or significantly affect individuals, the data subject has the right to contest the decision and obtain human intervention. If a decision was made with missing data and a governed default, the data subject's right to contest is strengthened — they may challenge the default value. AG-312's logging and output flagging ensure the organisation can identify which defaults were applied and provide meaningful explanation.

10. Failure Severity

FieldValue
Severity RatingHigh
Blast RadiusDecision-specific — affects individual decisions where missing data is silently compensated, but can become systemic when a data source failure causes widespread missing data

Consequence chain: Without missing data escalation governance, agents silently compensate for data gaps using unverified inferences. The immediate failure is invisible — outputs appear complete and confident. The financial impact depends on the field and the inference error: Scenario A produced £89,000 in loan loss from an income inference error of £34,000. The discriminatory impact is systematic — Scenario C showed 47 disabled applicants systematically deprioritised because missing disability status was defaulted to zero. The operational impact compounds when a data source failure causes widespread missing data across many decisions simultaneously — Scenario B showed £479,000 in combined lost sales and expedited shipping from stale lead time data. The regulatory impact is severe in regulated sectors where decision inputs must be verified, not inferred. The reputational impact is acute when the organisation cannot explain how a decision was reached because the missing data and the agent's compensation for it were never recorded.

Cross-references: AG-311 (Data Quality Threshold Enforcement Governance) detects data below quality thresholds; AG-312 governs the response when data is blocked or absent. AG-310 (Field-Level Criticality Governance) classifies which fields require escalation policies. AG-313 (Synthetic and Augmented Data Tagging Governance) — if a governed default or imputed value is used, it must be tagged as synthetic per AG-313. AG-316 (Temporal Validity Window Governance) — stale cached data treated as current is a form of missing data. AG-317 (Derived Data Provenance Governance) — derived fields missing a contributing source field require escalation through the provenance chain. AG-057 (Dataset Suitability and Bias Control) — systematic missing data may create bias requiring assessment.

Cite this protocol
AgentGoverning. (2026). AG-312: Missing Data Escalation Governance. The 783 Protocols of AI Agent Governance, AGS v2.1. agentgoverning.com/protocols/AG-312